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METHODS OF DIAGNOSIS OF LUNG CANCER, COMPOSITIONS AND METHODS 
OF SCREENING FOR MODULATORS OF LUNG CANCER 

5 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application is related to USSN 60/284,770, filed April 18, 2001; USSN 
60/290,492, filed May 10, 2001; USSN 60/334,370, filed November 29, 2001; USSN 
60/339,245, filed November 9, 2001; USSN 60/350,666, filed November 13, 2001; and 
10 USSN 60/xxx,xxx, filed April 12, 2002 (Docket OMNI-002P); each of which is incorporated 
herein by reference in its entirety. 

FIELD OF THE INVENTION 
The invention relates to the identification of nucleic acid and protein expression 
15 profiles and nucleic acids, products, and antibodies thereto that are involved in lung cancer; 
and to the use of such expression profiles and compositions in diagnosis and therapy of lung 
cancer. The invention further relates to methods for identifying and using agents and/or 
targets that inhibit lung cancer or related conditions. 

20 BACKGROUND OF THE INVENTION 

Lung cancer is the second most commonly occurring cancer in the United States and 
is the leading cause of cancer-related death. It is estimated that there are over 160,000 new 
cases of lung cancer in the United States every year. Of those who are diagnosed with lung 
cancer, 86 percent will die within five years. Lung cancer is the most common visceral 

25 cancer in men and accounts for nearly one third of all cancer deaths in both men and women. 
In fact, lung cancer accounts for 7% of all deaths, due to any cause, in both men and women. 

Smoking is the primary cause of lung cancer, with more than 80% of lung cancers 
resulting from smoking. About 400 to 500 separate gaseous substances are present in the 
smoke of a non-filter cigarette. The most noteworthy substances include nitrogen oxides, 

30 hydrogen cyanide, formaldehyde, benzene, and toluene. The particles present in cigarette 
smoke contain at least 3,500 individual compounds such as nicotine, tobacco alkaloids 
(nornicotine, anatabine, anabasine), polycyclic aromatic hydrocarbons (e.g., benzo(a)pyrene, 
B(a)P), naphthalenes, aromatic amines, phenols, and tobacco-specific nitrosamines. 

1 
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Tobacco-specific nitrosamines are formed during tobacco curing and processing, and 
are suspected of causing lung cancer in humans. In rodent studies, regardless of the where or 
how it is applied, the tobacco-specific nitrosamine known as NNK produces lung adenomas 
and lung adenocarcinomas. The tobacco-specific nitrosamine known as NNAL also produces 
5 lung adenocarcinomas in rodents. 

Many of the chemicals found in cigarette smoke also affect the nonsmoker inhaling 
"secondhand" or sidestream smoke. Indeed, the smoke inhaled by non-smokers has a 
chemical composition similar to the smoke inhaled by smokers, but, importantly, the 
concentrations of the carcinogenic tobacco-specific nitrosamines are present in higher 
10 concentrations in second hand smoke. For this and other reasons, "passive smoking" is an 
important cause of lung cancer, causing as many as 3,000 lung cancer deaths in nonsmokers 
each year. 

In addition to smoking, other factors thought to be causes of lung cancer include on- 
the-job exposure to carcinogens such as asbestos and uranium, exposure to chemical hazards 

15 such as radon, polycyclic aromatic hydrocarbons, chromium, nickel, and inorganic arsenic, 
genetic factors, and diet. 

Histological classification of various lung cancers define the types of cancer that 
begin in the lung. See, e.g., Travis, et al. (1999) Histological Typing of L ung and Pleural 
Tumours (International Histological Classification of Tumours, No 1 . Four major cell types 

20 make up more than 88% of all primary lung neoplasms. These are: squamous or epidermoid 
carcinoma, small cell (also called oat cell) carcinoma, adenocarcinoma, and large cell (also 
called large cell anaplastic) carcinoma. The remainder include undifferentiated carcinomas, 
carcinoids, bronchial gland tumors, and other rarer types. The various cell types have 
different natural histories and responses to therapy, and, thus, a correct histologic diagnosis is 

25 the first step of effective treatment. 

Small cell lung cancer (SCLC) accounts for 18-25% of all lung cancers, and occurs 
less frequently than non-small cell lung cancers, and generally spread to distant organs more 
rapidly than non-small cell lung cancer. In general, at the time of presentation small cell lung 
cancers have already spread beyond the beyond the bounds where surgery and curative intent 

30 can be undertaken. Hoever, if identified early enough, these cancers are often responsive to 
chemotherapy and thoracic radiation treatment. 

Non-small cell lung cancers (NSCLC) are the more frequently occurring form of lung 
cancer. They comprise squamous cell carcinoma, adenocarcinoma, and large cell carcinoma 
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and account for more than 75% of all lung cancers. Non-small cell tumors that are localized 
at the time of presentation can sometimes be cured with surgery and/or radiotherapy, but 
usually are not identified until significant metastasis has occurred, which are typically not 
very responsive to surgical, chemotherapy, or radiation treatment.. 
5 The screening of asymptomatic persons at high risk for lung cancer has often proven 

ineffective. In general, only 5 to 15 percent of lung cancer patients have their disease 
detected while they are asymptomatic. Of course, early detection and treatment are critical 
factors in the fight against lung cancer. The average survival rate is 49% for those whose 
cancer is detected early, before the cancer has spread from the lung. Lung cancer often 

10 spreads outside of the lung, and it may have spread to the bones or brain by the time it is 
diagnosed. While the prognosis may be better for lung cancers that are detected early, 
because of the lack ofV effective curative treatments, early detection does not necessarily alter 
the total death rate from lung cancer. 

Thus, methods for diagnosis and prognosis of lung cancer and effective treatment of 

1 5 lung cancer would be desirable. Accordingly, provided herein are methods that can be used 
in diagnosis and prognosis of lung cancer. Further provided are methods that can be used to 
screen candidate therapeutic agents for the ability to modulate, e.g., treat, lung cancer. 
Additionally, provided herein are molecular targets and compositions for therapeutic 
intervention in lung disease and other metastatic cancers. 

20 

SUMMARY OF THE INVENTION 
The present invention provides nucleotide sequences of genes that are up- and down- 
regulated in lung cancer cells. Such genes are useful for diagnostic purposes, and also as 
targets for screening for therapeutic compounds that modulate lung cancer, such as 

25 antibodies. The methods of detecting nucleic acids of the invention or their encoded proteins 
can be used for a number of purposes. Examples include early detection of lung cancers, 
monitoring and early detection of relapse following treatment of lung cancers, monitoring 
response to therapy of lung cancers, determining prognosis of lung cancers, directing therapy 
of lung cancers, selecting patients for postoperative chemotherapy or radiation therapy, 

30 selecting therapy, determining tumor prognosis, treatment, or response to treatment, and early 
detection of precancerous lesions of the lung. Examples of benign or precancerous lesions 
include: atelectasis, emphysema, brochitis, chronic obstructive pulmonary disease, fibrosis, 
hypersensitivity pneumonitis (HP), interstitial pulmonary fibrosis (IPF), asthma, and 
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bronchiectasis. Other aspects of the invention will become apparent to the skilled artisan by 
the following description of the invention. 

In one aspect, the present invention provides a method of detecting a lung cancer- 
associated transcript in a cell from a patient, the method comprising contacting a biological 
5 sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1 A-16. Alternatively, the sample may 
be contacted with a specific binding reagent, e.g., antibody. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 
95% identical to a sequence as shown in Tables 1A-16. In another embodiment, the 
10 polynucleotide comprises a sequence as shown in Tables 1A-16. 

In one embodiment, the biological sample is a tissue sample, or a body fluid. In 
another embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. In 
one embodiment, the polynucleotide is immobilized on a solid surface. In one embodiment, 
15 the patient is undergoing a therapeutic regimen to treat lung cancer. In another embodiment, 
the patient is suspected of having lung cancer. In one embodiment, the patient is a primate, 
e.g., a human. 

In one embodiment, the method further comprises the step of amplifying nucleic acids 
before the step of contacting the biological sample with the polynucleotide. 

20 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 

25 identical to a sequence as shown in Tables 1 A-16, thereby monitoring the efficacy of the 
therapy. Or the sample may be evaluated for protein, e.g., contacting the sample with an 
antibody. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated transcript to a level of the lung cancer-associated 
30 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. Or the sample may be evalated for comparison of protein. 

In another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
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biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated antibody in the biological sample by contacting the 
biological sample with a polypeptide encoded by a polynucleotide that selectively hybridizes 
to a sequence at least 80% identical to a sequence as shown in Tables 1A-16, wherein the 
5 polypeptide specifically binds to the lung cancer-associated antibody, thereby monitoring the 
efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated antibody to a level of the lung cancer-associated antibody 
in a biological sample from the patient prior to, or earlier in, the therapeutic treatment. 

10 In another aspect, the present invention provides a method of monitoring the efficacy 

of a therapeutic treatment of lung cancer, the method comprising the steps of: (i) providing a 
biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a lung cancer-associated polypeptide in the biological sample by contacting the 
biological sample with an antibody, wherein the antibody specifically binds to a polypeptide 

15 encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
to a sequence as shown in Tables 1 A- 16, thereby monitoring the efficacy of the therapy. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the lung cancer-associated polypeptide to a level of the lung cancer-associated 
polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

20 treatment. In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1 A- 16. In one embodiment, an 
expression vector or cell comprises the isolated nucleic acid. In one aspect, the present 
invention provides an isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1 A- 16. 

25 In another aspect, the present invention provides an antibody that specifically binds to 

an isolated polypeptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables 1 A- 16. In one embodiment, the antibody is conjugated to an 
effector component, e.g., a fluorescent label, a radioisotope or a cytotoxic chemical. In one 
embodiment, the antibody is an antibody fragment. In another embodiment, the antibody is 

30 humanized. 

In one aspect, the present invention provides a method of detecting lung cancer in a a 
patient, the method comprising contacting a biological sample from the patient with an 
antibody or protein as described herein. 
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In another aspect, the present invention provides a method of detecting antibodies 
specific to a lung cancer gene in a patient, the method comprising contacting a biological 
sample from the patient with a polypeptide encoded by a nucleic acid comprises a sequence 
from Tables 1A-16. 

5 In another aspect, the present invention provides a method for identifying a compound 

that modulates a lung cancer-associated polypeptide, the method comprising the steps of: (i) 
contacting the compound with a lung cancer-associated polypeptide, the polypeptide encoded 
by a polynucleotide that selectively hybridizes to a sequence at least 80% identical to a 
sequence as shown in Tables 1 A-16; and (ii) determining the functional effect of the 

1 0 compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 
cell membrane. In another embodiment, the polypeptide is recombinant. In one 
embodiment, the functional effect is determined by measuring ligand binding to the 

15 polypeptide. 

In another aspect, the present invention provides a method of inhibiting proliferation 
or another critical process of a lung cancer-associated cell to treat lung cancer in a patient, the 
method comprising the step of administering to the subject a therapeutically effective amount 
of a compound identified as described herein. In one embodiment, the compound is an 
20 antibody. 

In another aspect, the present invention provides a drug screening assay comprising 
the steps of: (i) administering a test compound to a mammal having lung cancer or a cell 
isolated therefrom; (ii) comparing the level of gene expression of a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
25 1 A-16 in a treated cell or mammal with the level of gene expression of the polynucleotide in 
a control cell or mammal, wherein a test compound that modulates the level of expression of 
the polynucleotide is a candidate for the treatment of lung cancer. 

In one embodiment, the control is a mammal with lung cancer or a cell therefrom that 
has not been treated with the test compound. In another embodiment, the control is a normal 
30 cell or mammal, or a non-malignant lung disease. 

In another aspect, the present invention provides a method for treating a mammal 
having lung cancer comprising administering a compound identified by the assay described 
herein. 
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In another aspect, the present invention provides a pharmaceutical composition for 
treating a mammal having lung cancer, the composition comprising a compound identified by 
the assay described herein and a physiologically acceptable excipient 

5 DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and treatment of lung disease or cancer, as well as methods for 
screening for compositions which modulate lung cancer. "Treatment, monitoring, detection 
or modulation of lung disease or cancer" includes treatment, monitoring, detection, or 

10 modulation of lung disease in those patients who have lung disease (whether malignant or 

non-malignant, e.g., emphysema, bronchitis, or fibrosis) as well as patients with lung cancers 
in which gene expression from a gene in Tables 1A-16 is increased or decreased, indicating 
that the subject is more likely to have disease. In particular,while these targets are identified 
primarily from lung cancer samples, these same targets are likely to be similarly found in 

15 analyses of other medical conditions. These other conditions may result from similar 
pathological processes which affect similar tissues, e.g., lung cancer, small cell lung 
carcinoma (oat cell carcinoma), non-small cell carcinomas (e.g., squamous cell carcinoma, 
adenocarcinoma, large cell lung carcinoma, carcinoid, granulomatous), fibrosis (idiopathic 
pulmonary fibrosis (IPF), hypersensitivity pneumonitis (HP), interstitial pneumonitis, 

20 nonspecific idiopathic pneumonitis (NSIP)), chronic obstructive pulmonary disease (COPD, 
e.g., emphysema, chronic bronchitis), asthma, bronchiectasis, and esophageal cancer. See, 
e.g., the NCI webpage and USSN 60/347,349 and USSN 60/xxx,xxx (docket LFBR-001-1P, 
filed March 29, 2002), each of which is incorporated herein by reference. The treatment may 
be of lung cancer or related condition itself, or treatment of metastasis. 

25 In particular, identification of markers selectively expressed on these cancers allows 

for use of that expression in diagnostic, prognostic, or therapeutic methods. As such, the 
invention defines various compositions, e.g., nucleic acids, polypeptides, antibodies, and 
small molecule agonists/antagonists, which will be useful to selectively identify those 
markers. For example, therapeutic methods may take the form of protein therapeutics which 

30 use the marker expression for selective localization or modulation of function (for those 
markers which have a causative disease effect), for vaccines, identification of binding 
partners, or antagonism, e.g., using antisense or RNAi. The markers may be useful for 
molecular characterization of subsets of lung diseases, which subsets may actually require 
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very different treatments. Moreover, the markers may also be important in related diseases to 
the specific cancers, e.g., which affect similar tissues in non-malignant diseases, or have 
similar mechanisms of induction/maintenance. Metastatic processes or characteristics may 
also be targeted. Diagnostic and prognostic uses are made available, e.g., to subset related 
5 but distinct diseases, or to determine treatment strategy. The detection methods may be based 
upon nucleic acid, e.g., PCR or hybridization techniques, or protein, e.g., ELISA, imaging, 
IHC, etc. The diagnosis may be qualitative or quantitative, and may detect increases or 
decreases in expression levels. 

Tables 1A-16 provide unigene cluster identification numbers for the nucleotide 

10 sequence of genes that exhibit increased or decreased expression in lung cancer samples. The 
tables also provide an exemplar accession number that provides a nucleotide sequence that is 
part of the unigene cluster. In Table 1A, genes marked as "target 1" or "target 2" are 
particularly useful as therapeutic targets. Genes marked as "target 3" are particularly useful 
as diagnostic markers. Genes marked as "chron" are upregulated in chronically diseased lung 

15 (e.g., emphysema, bronchitis, fibrosis) relative to lung tumors and normal tissue. In certain 
analyses, the ratio for the "chron" category was determined using the 70th percentile of 
chronically diseases lung samples divided by the 90th percentile of normal lung samples. 
The ratio for the targets was determined using the 70th percentile of lung tumor samples 
divided by the 90th percentile of normal lung samples. 

20 

Definitions 

The term "lung cancer protein" or "lung cancer polynucleotide" or "lung cancer- 
associated transcript" refers to nucleic acid and polypeptide polymorphic variants, alleles, 
mutants, and interspecies homologs that: (1) have a nucleotide sequence that has greater than 

25 about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 
92%>, 93%, 94%>, 95%, 96%, 97%, 98%, or 99% or greater nucleotide sequence identity, 
preferably over a region of over a region of at least about 25, 50, 100, 200, 500, 1000, or 
more nucleotides, to a nucleotide sequence of or associated with a unigene cluster of Tables 
1A-16; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 

30 comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 
unigene cluster of Tables 1A-16, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or 
the complement thereof of Tables 1A-16 and conservatively modified variants thereof; or (4) 
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have an amino acid sequence that has greater than about 60% amino acid sequence identity, 
65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 
or 99% or greater amino sequence identity, preferably over a region of over a region of at 
least about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid sequence 
5 encoded by a nucleotide sequence of or associated with a unigene cluster of Tables 1A-16. A 
polynucleotide or polypeptide sequence is typically from a mammal including, but not 
limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or 
other mammal. A "lung cancer polypeptide" and a "lung cancer polynucleotide," include 
both naturally occurring or recombinant forms. 

10 A "full length" lung cancer protein or nucleic acid refers to a lung cancer polypeptide 

or polynucleotide sequence, or a variant thereof, that contains the elements normally 
contained in one or more naturally occurring, wild type lung cancer polynucleotide or 
polypeptide sequences. The "full length" may be prior to, or after, various stages of post- 
translational processing or splicing, including alternative splicing. 

15 "Biological sample" as used herein is a sample of biological tissue or fluid that 

contains nucleic acids or polypeptides, e.g., of a lung cancer protein, polynucleotide, or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 
humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 

20 archival materials, blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. 
Biological samples also include explants and primary and/or transformed cell cultures 
derived from patient tissues. A biological sample is typically obtained from a eukaryotic 
organism, most preferably a mammal such as a primate, e.g., chimpanzee or human; cow; 
dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or other mammal; or a bird; reptile; 

25 fish. Livestock and domestic animals are of interest. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 

30 methods of the invention in vivo. Archival tissues or materials, having treatment or outcome 
history, will be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic 
acids or polypeptide sequences, refer to two or more sequences or subsequences that are the 
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same or have a specified percentage of amino acid residues or nucleotides that are the same 
(e.g., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
5 measured using, e.g., a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI 
web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
be "substantially identical." This definition also refers to, or may be applied to, the 
complement of a test sequence. The definition also includes sequences that have deletions 
10 and/or insertions, substitutions, and naturally occurring, e.g., polymorphic or allelic variants, 
and man-made variants. As described below, the preferred algorithms can account for gaps 
and the like. Preferably, identity exists over a region that is at least about 25 amino acids or 
nucleotides in length, or more preferably over a region that is 50-100 amino acids or 
nucleotides in length. 

15 For sequence comparison, typically one sequence acts as a reference sequence, to 

which test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are entered into a computer, subsequence coordinates are designated, if 
necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 

20 comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
contiguous positions selected from the group consisting typically of from 20 to 600, usually 
about 50 to about 200, more usually about 100 to about 150 in which a sequence may be 

25 compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. Methods of alignment of sequences for comparison are 
well-known in the art. Optimal alignment of sequences for comparison can be conducted, 
e.g., by the local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 
2:482, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 

30 48:443, by the search for similarity method of Pearson and Lipman (1988) Proc. Nat'l. Acad. 
Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
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Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, 
e.g., Ausubel, et aL (eds. 1995 and supplements) Current Protocols in Molecular Biology . 

Preferred examples of algorithms that are suitable for determining percent sequence 
identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, which are 
5 described in Altschul, et al. (1977) Nuc. Acids Res. 25:3389-3402 and Altschul, et al. (1990) 
J. Mol. Biol. 215:403-410. BLAST and BLAST 2.0 are used, with the parameters described 
herein, to determine percent sequence identity for the nucleic acids and proteins of the 
invention. Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This 

10 algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul, et al., supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 

15 them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 
for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 

20 hits in each direction are halted when: the cumulative alignment score falls off by the 

quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

25 uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

30 The BLAST algorithm also performs a statistical analysis of the similarity between 

two sequences (see, e.g., Karlin and Altschul (1993) Proc. NafL Acad. Sci. USA 90:5873- 
5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N))> which provides an indication of the probability by which a match between 

11 
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two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01, and most preferably less than about 0.001. Log values may be negative 
5 large numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 1 10, 150, 170, etc. 

An indication that two nucleic acid sequences are substantially identical is that the 
polypeptide encoded by the first nucleic acid is immunologically cross reactive with the 
antibodies raised against the polypeptide encoded by the second nucleic acid. Thus, a 
polypeptide is typically substantially identical to a second polypeptide, e.g., where the two 
10 peptides differ only by conservative substitutions. Another indication that two nucleic acid 
sequences are substantially identical is that the two molecules or their complements hybridize 
to each other under stringent conditions. Yet another indication that two nucleic acid 
sequences are substantially identical is that the same primers can be used to amplify the 
sequences. 

15 A "host cell" is a naturally occurring cell or a transformed cell that contains an 

expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 

20 Collection catalog or web site, www.atcc.org). 

The terms "isolated," "purified," or "biologically pure" refer to material that is 
substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 

25 chromatography. A protein or nucleic acid that is the predominant species present in a 

preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 

30 that the nucleic acid or protein is at least about 85% pure, more preferably at least 95% pure, 
and most preferably at least 99% pure. "Purify" or "purification" in other embodiments 
means removing at least one contaminant or component from the composition to be purified. 
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In this sense, purification does not require that the purified compound be homogeneous, e.g., 
100% pure. 

The terms "polypeptide/' "peptide" and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 
5 one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, as well 
as amino acid analogs and amino acid mimetics that function similarly to the naturally 

10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refer to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

15 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 
modified R groups (e.g., norleucine) or modified peptide backbones, but retain some basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refer to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that function similarly to another amino acid. 

20 Amino acids may be referred to herein by either their commonly known three letter 

symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic acid 

25 sequences. With respect to particular nucleic acid sequences, conservatively modified 

variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG, and GCU each encode the amino 
acid alanine. Thus, at each position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 

13 
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conservatively modified variations. Every nucleic acid sequence Herein which encodes a 
polypeptide also describes silent variations of the nucleic acid. In certain contexts each 
codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 
TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 
5 functionally similar molecule. Accordingly, a silent variation of a nucleic acid which 
encodes a polypeptide is implicit in a described sequence with respect to the expression 
product, but not necessarily with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual substitutions, 
deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 

10 alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded 
sequence is a "conservatively modified variant" where the alteration results in the substitution 
of an amino acid with a chemically similar amino acid. Conservative substitution tables 
providing functionally similar amino acids are well known in the art. Such conservatively 
modified variants are in addition to and do not exclude polymorphic variants, interspecies 

15 homologs, and alleles of the invention. Typically conservative substitutions include for one 
another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine 
(N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine 
(M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine (S), 
Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins (1984)). 

20 Macromolecular structures such as polypeptide structures can be described in terms of 

various levels of organization. For a general discussion of this organization, see, e.g., 
Alberts, et al. (1994) Molecular Biology of the Cell (3 rd ed.) and Cantor and Schimmel (1980) 
Biophysical Chemistry Part I: The Conformation of Biological Macromolecules . "Primary 
structure" refers to the amino acid sequence of a particular peptide. "Secondary structure" 

25 refers to locally ordered, three dimensional structures within a polypeptide. These structures 
are commonly known as domains. Domains are portions of a polypeptide that often form a 
compact unit of the polypeptide and are typically 25 to approximately 500 amino acids long. 
Typical domains are made up of sections of lesser organization such as stretches of (3-sheet 
and a-helices. "Tertiary structure" refers to the complete three dimensional structure of a 

30 polypeptide monomer. "Quaternary structure" refers to the three dimensional structure 
formed, usually by the noncovalent association of independent tertiary units. Anisotropic 
terms are also known as energy terms. 
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'"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents 
used herein means at least two nucleotides covalently linked together. Oligonucleotides are 
typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up 
to about 100 nucleotides in length. Nucleic acids and polynucleotides are a polymers of any 
5 length, including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, 
etc. A nucleic acid of the present invention will generally contain phosphodiester bonds, 
although in some cases, nucleic acid analogs are included that may have at least one different 
linkage, e.g., phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphophoroamidite linkages (see Eckstein (1992) Oligonucleotides and Analogues: A 

10 Practical Approach Oxford University Press); and peptide nucleic acid backbones and 
linkages. Other analog nucleic acids include those with positive backbones; non-ionic 
backbones, and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, in Sanghui and Cook, eds. Carbohydrate 
Modifications in Antisense Research . ASC Symposium Series 580. Nucleic acids containing 

15 one or more carbocyclic sugars are also included within one definition of nucleic acids. 

Modifications of the ribose-phosphate backbone may be done for a variety of reasons, e.g., to 
increase the stability and half-life of such molecules in physiological environments or as 
probes on a biochip. Mixtures of naturally occurring nucleic acids and analogs can be made; 
alternatively, mixtures of different nucleic acid analogs, and mixtures of naturally occurring 

20 nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide nucleic 
acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 

25 kinetics. PNAs have larger changes in the melting temperature (T m ) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4° C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9° C. 
Similarly, due to their non-ionic nature, hybridization of the bases attached to these 
backbones is relatively insensitive to salt concentration. In addition, PNAs are not degraded 

30 by cellular enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or contain 
portions of both double stranded or single stranded sequence. As will be appreciated by those 
in the art, the depiction of a single strand also defines the sequence of the complementary 
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strand; thus the sequences described herein also provide the complement of the sequence. 
The nucleic acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the 
nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and combinations 
of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine 
5 hypoxanthine, isocytosine, isoguanine, etc. "Transcript" typically refers to a naturally 
occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g., the individual units of a peptide nucleic 

10 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, physiological, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

15 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 

peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the cancer nucleic acids, proteins, and antibodies. Many methods known in 
the art for conjugating the antibody to the label may be employed, including those methods 
described by Hunter, et al. (1962) Nature 144:945; David, et al. (1974) Biochemistry 

20 13:1014-1021; Pain, etal. H98n J. Immunol. Meth. , 40:219-230; and Nygren (1982) JL 
Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is 
bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

25 The "effector" can be a variety of molecules including, e.g., detection moieties including 

radioactive compounds, fluorescent compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 

30 covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 

Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
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using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is a nucleic acid capable of 
binding to a target nucleic acid of complementary sequence through one or more types of 
5 chemical bonds, usually through complementary base pairing, e.g., through hydrogen bond 
formation. As used herein, a probe may include natural (i.e., A, G, C, or T) or modified bases 
(7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a 
linkage other than a phosphodiester bond, preferably one that does not functionally interfere 
with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the constituent 

10 bases are joined by peptide bonds rather than phosphodiester linkages. Probes may bind 

target sequences lacking complete complementarity with the probe sequence depending upon 
the stringency of the hybridization conditions. The probes are preferably directly labeled, 
e.g., with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled, e.g., with 
biotin to which a streptavidin complex may later bind. By assaying for the presence or 

15 absence of the probe, one can detect the presence or absence of the select sequence or 

subsequence. Diagnosis or prognosis may be based at the genomic level, or at the level of 
RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 

20 the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant 
cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 

25 originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 

polymerases and endonucleases, in a form not normally found in nature. In this manner, 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 

30 understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
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recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term "heterologous" when used with reference to portions of a nucleic acid 
5 indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 
10 or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is typically an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 
sequences near the start site of transcription, such as, in the case of a polymerase II type 

15 promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 

20 to a functional linkage between a nucleic acid expression control sequence (such as a 

promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
e.g., wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 

25 synthetically, with a series of specified nucleic acid elements that permit transcription of a 

particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed in operable linkage to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 

30 duplexing, or hybridizing of a molecule selectively to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
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The phrase "stringent hybridization conditions" refers to conditions under which a 
probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 
acids, but to essentially no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
5 higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
"Overview of principles of hybridization and the strategy of nucleic acid assays" in Tijssen 
(1993) Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic 
Probes (vol. 24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C 
lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength 

10 pH. The T m is the temperature (under defined ionic strength, pH, and nucleic concentration) 
at which 50% of the probes complementary to the target hybridize to the target sequence at 
equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 

15 other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for short probes (e.g., 
10 to 50 nucleotides) and at least about 60° C for long probes (e.g., greater than 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is 
typically at least two times background, preferably 10 times background hybridization. 

20 Exemplary stringent hybridization conditions are often: 50% formamide, 5x SSC, and 1% 
SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 65° C, with wash in 0.2x SSC, 
and 0.1% SDS at 65° C. For PCR, a temperature of about 36° C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32° C and 48° C 
depending on primer length. For high stringency PCR amplification, a temperature of about 

25 62° C is typical, although high stringency annealing temperatures can range from about 50° C 
to about 65° C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90° C - 95° C for 
0.5 - 2 min., an annealing phase lasting 0.5 - 2 min., and an extension phase of about 72° C 
for 1 - 2 min. Protocols and guidelines for low and high stringency amplification reactions 

30 are provided, e.g., in Innis, et al.(1990) PCR Protocols. A Guide to Methods and 
A pplications . 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
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occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 
permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 
moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
5 1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least twice 
background. Alternative hybridization and wash conditions can be utilized to provide 
conditions of similar stringency. Additional guidelines for determining hybridization 
parameters are provided in numerous reference, e.g., Ausubel, et al. (ed.) Current Protocols in 
Molecular Biology Lippincott. 

10 The phrase "functional effects" in the context of assays for testing compounds that 

modulate activity of a lung cancer protein includes the determination of a parameter that is 
indirectly or directly under the influence of the lung cancer protein or nucleic acid, e.g., a 
physiological, enzymatic, functional, physical, or chemical effect, such as the ability to 
decrease lung cancer. It includes ligand binding activity; cell viability, cell growth on soft 

15 agar; anchorage dependence; contact inhibition and density limitation of growth; cellular 
proliferation; cellular transformation; growth factor or serum dependence; tumor specific 
marker levels; invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and 
protein expression in cells undergoing metastasis, and other characteristics of lung cancer 
cells. "Functional effects" include in vitro, in vivo, and ex vivo activities. 

20 By "determining the functional effect" is meant assaying for a compound that 

increases or decreases a parameter that is indirectly or directly under the influence of a lung 
cancer protein sequence, e.g., physiological, functional, enzymatic, physical, or chemical 
effects. Such functional effects can be measured by many means known to those skilled in 
the art, e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, 

25 refractive index), hydrodynamic (e.g., shape), chromatographic, or solubility properties for 
the protein, measuring inducible markers or transcriptional activation of the lung cancer 
protein; measuring binding activity or binding assays, e.g., binding to antibodies or other 
ligands, and measuring cellular proliferation. Determination of the functional effect of a 
compound on lung cancer can also be performed using lung cancer assays known to those of 

30 skill in the art such as an in vitro assays, e.g., cell growth on soft agar; anchorage 

dependence; contact inhibition and density limitation of growth; cellular proliferation; 
cellular transformation; growth factor or serum dependence; tumor specific marker levels; 
invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 
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expression in cells undergoing metastasis, and other characteristics of lung cancer cells. The 

functional effects can be evaluated by many means known to those skilled in the art, e.g., 

microscopy for quantitative or qualitative measures of alterations in morphological features, 

measurement of changes in RNA or protein levels for lung cancer-associated sequences, 

5 measurement of RNA stability, identification of downstream or reporter gene expression 

(CAT, luciferase, P-gal, GFP, and the like), e.g., via chemiluminescence, fluorescence, 

colorimetric reactions, antibody binding, inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of lung cancer polynucleotide and 

polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 

10 compounds identified using in vitro and in vivo assays of lung cancer polynucleotide and 

polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of lung cancer proteins, e.g., antagonists. Antisense or inhibitory 
nucleic acids may seem to inhibit expression and subsequent function of the protein. 

15 "Activators" are compounds that increase, open, activate, facilitate, enhance activation, 
sensitize, agonize, or up regulate lung cancer protein activity. Inhibitors, activators, or 
modulators also include genetically modified versions of lung cancer proteins, e.g., versions 
with altered activity, as well as naturally occurring and synthetic ligands, antagonists, 
agonists, antibodies, small chemical molecules and the like. Such assays for inhibitors and 

20 activators include, e.g., expressing the lung cancer protein in vitro, in cells, or cell 

membranes, applying putative modulator compounds, and then determining the functional 
effects on activity, as described above. Activators and inhibitors of lung cancer can also be 
identified by incubating lung cancer cells with the test compound and determining increases 
or decreases in the expression of 1 or more lung cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 

25 25, 30, 40, 50 or more lung cancer proteins, such as lung cancer proteins encoded by the 
sequences set out in Tables 1A-16. 

Samples or assays comprising lung cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 
activator, or modulator to examine the extent of inhibition. Control samples (untreated with 

30 inhibitors) are assigned a relative protein activity value of 1 00%. Inhibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably 50%, more 
preferably 25-0%. Activation of a lung cancer polypeptide is achieved when the activity 
value relative to the control (untreated with activators) is 1 10%, more preferably 150%, more 
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preferably 200-500% (i.e., two to five fold higher relative to the control), more preferably 
1000-3000% higher. 

The phrase "changes in cell growth" refers to any change in cell growth and 
proliferation characteristics in vitro or in vivo, such as cell viability, formation of foci, 
5 anchorage independence, semi-solid or soft agar growth, changes in contact inhibition and 
density limitation of growth, loss of growth factor or serum requirements, changes in cell 
morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., Freshney (1994) Culture of Animal Cells a Manual of 

10 Basic Technique pp. 231-241 (3 rd ed.). 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells, or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 

15 and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney 
(1994) Culture of Animal Cells a Manual of Basic Technique (3 rd ed.)). 

20 "Antibody" refers to a polypeptide comprising a framework region from an 

immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 

25 gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD, and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
Fundamental Immunology . 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 

30 tetramer is composed of two identical pairs of polypeptide chains, each pair having one 
"light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 
chain defines a variable region of about 100 to 110 or more amino acids primarily responsible 
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for antigen recognition. The terms variable light chain (V L ) and variable heavy chain (V H ) 

refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 

fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 

5 antibody below the disulfide linkages in the hinge region to produce F(ab)' 2 , a dimer of Fab 

which itself is a light chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 may be 

reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 

converting the F(ab)' 2 dimer into an Fab 5 monomer. The Fab 5 monomer is essentially Fab 

with part of the hinge region (see Paul (ed. 1999) Fundamental Immunology (4th ed.). While 

10 various antibody fragments are defined in terms of the digestion of an intact antibody, one of 
skill will appreciate that such fragments may be synthesized de novo either chemically or by 
using recombinant DNA methodology. Thus, the term antibody, as used herein, also includes 
antibody fragments either produced by the modification of whole antibodies, or those 
synthesized de novo using recombinant DNA methodologies (e.g., single chain Fv) or those 

15 identified using phage display libraries (see, e.g., McCafferty, et al. (1990) Nature 348:552- 
554). 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler and Milstein 
(1975) Nature 256:495-497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. 

20 (1985), pp. 77-96 in Monoclonal Antibodies and Cancer Therapy : Coligan (1991 and 

supplements) Current Protocols in Immunology : Harlow and Lane (1988) Antibodies. A 
Laboratory Manual : and Goding (1986) Monoclonal Antibodies: Principle s and Practice (2d 
ed.)). Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can 
be adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 

25 other organisms such as other mammals, may be used to express humanized antibodies. 

Alternatively, phage display technology can be used to identify antibodies and heteromeric 
Fab fragments that specifically bind to selected antigens (see, e.g., McCafferty, et al. (1990) 
Nature 348:552-554; Marks, et al. (1992) Biotechnology 10:779-783). 

A "chimeric antibody" is an antibody molecule in which, e.g, (a) the constant region, 

30 or a portion thereof, is altered, replaced, or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector 
function, and/or species, or an entirely different molecule which confers new properties to the 
chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the 
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variable region, or a portion thereof, is altered, replaced, or exchanged with a variable region 
having a different or altered antigen specificity. 



Identification of lung cancer-associated sequences 

5 In one aspect, the expression levels of genes are determined in different patient 

samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 

10 characteristic of the state of the cell. That is, normal tissue may be distinguished from 
cancerous or metastatic cancerous tissue, or metastatic cancerous tissue can be compared 
with tissue from surviving cancer patients. By comparing expression profiles of tissue in 
known different lung cancer states, information regarding which genes are important 
(including both up- and down-regulation of genes) in each of these states is obtained. 

1 5 Molecular profiling may distinguish subtypes of a currently collective disease designation, 
e.g., different forms of lung cancer (chronic disease, adenocarcinoma, etc.) 

The identification of sequences that are differentially expressed in lung cancer versus 
non-lung cancer tissue allows the use of this information in a number of ways. For example, 
a particular treatment regime may be evaluated: does a chemotherap eutic drug act to down- 

20 regulate lung cancer, and thus tumor growth or recurrence, in a particular patient. 

Alternatively, a treatment step may induce other markers which may be used as targets to 
destroy tumor cells. Similarly, diagnosis and treatment outcomes may be done or confirmed 
by comparing patient samples with the known expression profiles. Malignant diseasemay be 
compared to non-malignant conditions. Metastatic tissue can also be analyzed to determine 

25 the stage of lung cancer in the tissue, or origin of primary tumor, e.g., metastasis from a 

remote primary site. Furthermore, these gene expression profiles (or individual genes) allow 
screening of drug candidates with an eye to mimicking or altering a particular expression 
profile; e.g., screening can be done for drugs that suppress the lung cancer expression profile. 
This may be done by making biochips comprising sets of the important lung cancer genes, 

30 which can then be used in these screens. PCR methods may be applied with selected primer 
pairs, and analysis may be of RNA or of genomic sequences. These methods can also be 
done on the protein basis; that is, protein expression levels of the lung cancer proteins can be 
evaluated for diagnostic purposes or to screen candidate agents. In addition, the lung cancer 
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nucleic acid sequences can be administered for gene therapy purposes, including the 
administration of antisense nucleic acids, or the lung cancer proteins (including antibodies 
and other modulators thereof) administered as therapeutic drugs or as protein or DNA 
vaccines. 

5 Thus the present invention provides nucleic acid and protein sequences that are 

differentially expressed in lung cancer relative to normal tissues and/or non-malignant lung 
disease, or in different types of lung disease, herein termed "lung cancer sequences." As 
outlined below, lung cancer sequences include those that are up-regulated (i.e., expressed at a 
higher level) in lung cancer, as well as those that are down-regulated (i.e., expressed at a 

10 lower level). In a preferred embodiment, the lung cancer sequences are from humans; 
however, as will be appreciated by those in the art, lung cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other lung 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 

15 horses, etc.) and pets (dogs, cats, etc.). Lung cancer sequences from other organisms may be 
obtained using the techniques outlined below. 

Lung cancer sequences can include both nucleic acid and amino acid sequences. As 
will be appreciated by those in the art and is more fully outlined below, lung cancer nucleic 
acid sequences are useful in a variety of applications, including diagnostic applications, 

20 which will detect naturally occurring nucleic acids, as well as screening applications; e.g., 
biochips comprising nucleic acid probes or PGR microtiter plates with selected probes to the 
lung cancer sequences can be generated. 

A lung cancer sequence can be initially identified by substantial nucleic acid and/or 
amino acid sequence homology to the lung cancer sequences outlined herein. Such 

25 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, e.g., using homology programs or hybridization 
conditions. 

For identifying lung cancer-associated sequences, the lung cancer screen typically 
includes comparing genes identified in different tissues, e.g., normal and cancerous tissues, 
30 cancer and non-malignant conditions, non-malignant conditions and normal tissues, or tumor 
tissue samples from patients who have metastatic disease vs. non metastatic tissue. Other 
suitable tissue comparisons include comparing lung cancer samples with metastatic cancer 
samples from other cancers, such as, breast, other gastrointestinal cancers, prostate, ovarian, 
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etc. Samples of, non metastatic disease tissue and tissue undergoing metastasis are applied to 

biochips comprising nucleic acid probes. The samples are first microdissected, if applicable, 
and treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, e.g., from Affymetrix, Santa Clara, CA. Gene expression profiles as 
5 described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
lung, but also including, and not limited to colon, heart, brain, liver, breast, kidney, muscle, 
prostate, small intestine, large intestine, spleen, bone, and/or placenta. In a preferred 
10 embodiment, those genes identified during the lung cancer screen that are expressed in 
significant amounts in other tissues (e.g., essential organs) are removed from the profile, 
although in some embodiments, this is not necessary (e.g., where organs may be dispensible 
at a later stage of life). That is, when screening for drugs, it is usually preferable that the 
target expression be disease specific, to minimize possible side effects on other organs. 
15 In a preferred embodiment, lung cancer sequences are those that are up-regulated in 

lung cancer; that is, the expression of these genes is higher in cancerous tissue than in normal 
lung or other tissue. "Up-regulation" as used herein means, when the ratio is presented as a 
number greater than one, that the ratio is greater than one, preferably 1 .5 or greater, more 
preferably 2.0 or greater. Another embodiment is directed to sequences up-regulated in non- 
20 malignant conditions relative to normal. Unigene cluster identification numbers and 

accession numbers herein are for the GenBank sequence database and the sequences of the 
accession numbers are hereby expressly incorporated by reference. GenBank is known in the 
art, see, e.g., Benson, DA, et al (1998) Nucleic Acids Research 26:1-7 and 
http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 
25 European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
Another embodiment is directed to sequences up-regulated in non-malignant conditions 
relative to normal. In some situations, the sequences may be derived from assembly of 
available sequences or be predicted from genomic DNA using exon prediction algorithms, 
such as FGENESH (Salamov and Solovyev (2000) Genome Res. 10:516-522). In other 
30 situations, sequences have been derived from cloning and sequencing of isolated nucleic 
acids. 

In another preferred embodiment, lung cancer sequences are those that are down- 
regulated in the lung cancer; that is, the expression of these genes is lower in cancerous tissue 
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or normal lung or other tissue. "Down-regulation" as used herein means, when the ratio is 
presented as a number greater than one, that the ratio is greater than one, preferably 1.5 or 
greater, more preferably 2.0 or greater, or, when the ratio is presented as a number less than 
one, that the ratio is less than one, preferably 0.5 or less, more preferably 0.25 or less. 

5 

Informatics 

The ability to identify genes that are over or under expressed in lung cancer can 
additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

10 biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with lung cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson (1998) Pharmaceutical Proteomics: Targets. 
Mechanism, and Function, paper presented at the IBC Proteomics conference, Coronado, CA 

1 5 (June 11-12, 1998)). Subcellular toxicological information can also be utilized in a biological 
sensor device to predict the likely toxicological effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
saccharides, lipids, drugs, and the like). 

20 Thus, in another embodiment, the present invention provides a database that includes 

at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database. The electronic 
database of the invention can be maintained on any electronic device allowing for the storage 

25 of and access to the database, such as a personal computer, but is preferably distributed on a 
wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent to those of skill in the art that similar databases 
can be assembled for assay data acquired using an assay of the invention. 

30 The compositions and methods for identifying and/or quantitating the relative and/or 

absolute abundance of a variety of molecular and macromolecular species from a biological 
sample representing lung cancer, i.e., the identification of lung cancer-associated sequences 
described herein, provide an abundance of information, which can be correlated with 
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pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, gene- 
disease causal linkages, identification of correlates of immunity and physiological status, 
among others. Although the data generated from the assays of the invention is suited for 
manual review and analysis, in a preferred embodiment, data processing using high-speed 
5 computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is known 
in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 

10 Patent 5,953,727 discloses a relational database having sequence records containing 
information in a format that allows a collection of partial-length DNA sequences to be 
catalogued and searched according to association with one or more sequencing projects for 
obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 

1 5 sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 

20 dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 

25 fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 

See also Mount, et al. (2001) Bioinformatics; Durbin, et al. (eds., 1999) Biological 
Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids (; Baxevanis and 
Oeullette (eds., 1998) Bioinformatics: A Practical Guide to the Analys is of Genes and 

30 Proteins^) : Rashidi and Buehler (1999) Bioinformatics: Basic Applications in Biological 

Science and Medicine : Setubal, et al. (eds 1997) Introduction to Comput ational Molecular 
Biology : Misener and Krawetz (eds, 2000) Bioinformatics: Methods and Protocols : Higgins 
and Taylor (eds., 2000) Bioinformatics: Sequence, Structure, and Databanks : A Practical 
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Approach : Brown (2001) Bioinformatics: A Biologist's Guide to Biocomputing and the 
Internet : Han and Kamber (2000) Data Mining: Concepts and Techniques (2000); and 
Waterman (1995) Introduction to Computational Biology: Maps, Sequences, and Genomes . 
The present invention provides a computer database comprising a computer and 
5 software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 
is from a control tissue sample known to be free of pathological disorders. In a variation, at 

10 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for lung cancer. In another variation, the assay 
records cross-tabulate one or more of the following parameters for each target species in a 
sample: (1) a unique identification code, which can include, e.g., a target molecular structure 
and/or characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample 

15 source; and (3) absolute and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 

20 data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer systems built therewith, comprising a bit pattern 

25 encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 

30 or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
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be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
5 SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 
Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 
in a file format suitable for retrieval and processing in a computerized sequence analysis, 
comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 

10 linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 

line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 
least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 

15 The invention also provides a method for transmitting assay data that includes 

generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

20 In a preferred embodiment, the invention provides a computer system for comparing a 

query target to a database containing an array of data structures, such as an assay result 
obtained by the method of the invention, and ranking database targets based on the degree of 
identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for alignment and/or comparison of the assay results. 

25 Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 
file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 

30 SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 
the query target and results are output via an I/O device. For example, a central processor 
can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
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MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain 

molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 

data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 

SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 

5 be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 

adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 

device. 

The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
10 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

1 5 Characteristics of lung cancer-associated proteins 

Lung cancer proteins of the present invention may be classified as secreted proteins, 
transmembrane proteins or intracellular proteins. In one embodiment, the lung cancer protein 
is an intracellular protein. Intracellular proteins may be found in the cytoplasm and/or in the 
nucleus. Intracellular proteins are involved in all aspects of cellular function and replication 

20 (including, e.g., signaling pathways); aberrant expression of such proteins often results in 
unregulated or disregulated cellular processes (see, e.g., Alberts (ed. 1994) Molecular 
Biology of the Cell (3d ed.). For example, many intracellular proteins have enzymatic 
activity such as protein kinase activity, protein phosphatase activity, protease activity, 
nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins also serve 

25 as docking proteins that are involved in organizing complexes of proteins, or targeting 

proteins to various subcellular localizations, and are involved in maintaining the structural 
integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 
proteins of one or more structural motifs for which defined functions have been attributed. In 

30 addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
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domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

few, have been shown to mediate protein-protein interactions. Some of these may also be 

involved in binding to phospholipids or other second messengers. As will be appreciated by 

5 one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 

sequence; thus, an analysis of the sequence of proteins may provide insight into both the 

enzymatic potential of the molecule and/or molecules with which the protein may associate. 

One useful database is Pfam (protein families), which is a large collection of multiple 

sequence alignments and hidden Markov models covering many common protein domains. 

10 Versions are available via the internet from Washington University in St. Louis, the Sanger 

Center in England, and the Karolinska Institute in Sweden (see, e.g., Bateman, et al (2000) 

Nuc. Acids Res. 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 

(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 

322). 

15 In another embodiment, the lung cancer sequences are transmembrane proteins. 

Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 

20 and/or may serve as a binding site for additional proteins. Frequently the intracellular 

domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

25 Transmembrane proteins may contain from one to many transmembrane domains. 

For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels, pumps, and adenylyl cyclases contain 
numerous transmembrane domains. Many important cell surface receptors such as G protein 

30 coupled receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they 
contain 7 membrane spanning regions. Characteristics of transmembrane domains include 
approximately 17 consecutive hydrophobic amino acids that may be followed by charged 
amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
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localization and number of transmembrane domains within the protein may be predicted (see, 

e.g., PSORT web site http://psort.nibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, conserved 

motifs are found repeatedly among various extracellular domains. Conserved structure 

5 and/or functions have been ascribed to different extracellular motifs. Many extracellular 

domains are involved in binding to other molecules. In one aspect, extracellular domains are 

found on receptors. Factors that bind the receptor domain include circulating ligands, which 

may be peptides, proteins, or small molecules such as adenosine and the like. For example, 

growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 

10 cognate receptors to initiate a variety of cellular responses. Other factors include cytokines, 
mitogenic factors, hormones, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they may mediate cell-cell interactions. 
Cell-associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol 
(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains may 

15 also associate with the extracellular matrix and contribute to the maintenance of the cell 
structure. 

Lung cancer proteins that are transmembrane are particularly preferred in the present 
invention as they are readily accessible targets for extracellular immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 

20 in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ or in histological analysis. Alternatively, antibodies can also label intracellular proteins, 
in which case analytical samples are typically permeablized to provide access to intracellular 
proteins. In addition, some membrane proteins can be processed to release a soluble protein, 
or to expose a residual fragment. Released soluble proteins may be useful diagnostic 

25 markers, processed residual protein fragments may be useful lung markers of disease. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

30 In another embodiment, the lung cancer proteins are secreted proteins; the secretion of 

which can be either constitutive or regulated. These proteins may have a signal peptide or 
signal sequence that targets the molecule to the secretory pathway. Secreted proteins are 
involved in numerous physiological events; e.g., if circulating, they often serve to transmit 
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signals to various other cell types. The secreted protein may function in an autocrine manner 

(acting on the cell that secreted the factor), a paracrine manner (acting on cells in close 

proximity to the cell that secreted the factor), an endocrine manner (acting on cells at a 

distance, e.g., secretion into the blood stream), or exocrine (secretion, e.g., through a duct or 

5 to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, lacrimal 

glands, mammary glands, sax producing glands of the ear, etc.). Thus secreted molecules 

often find use in modulating or altering numerous aspects of physiology. Lung cancer 

proteins that are secreted proteins are particularly preferred in the present invention as they 

serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, or stool tests. 

10 Those which are enzymes may be antibody or small molecule targets. Others may be useful 

as vaccine targets, e.g., via CTL mechanisms. 

Use of lung cancer nucleic acids 

As described above, lung cancer sequence is initially identified by substantial nucleic 
15 acid and/or amino acid sequence homology or linkage to the lung cancer sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

20 The lung cancer nucleic acid sequences of the invention, e.g., the sequences in Tables 

1A-16, can be fragments of larger genes, i.e., they are nucleic acid segments. "Genes" in this 
context includes coding regions, non-coding regions, and mixtures of coding and non-coding 
regions. Accordingly, as will be appreciated by those in the art, using the sequences provided 
herein, extended sequences, in either direction, of the lung cancer genes can be obtained, 

25 using techniques well known in the art for cloning either longer sequences or the full length 
sequences; see Ausubel, et al., supra. Much can be done by informatics and many sequences 
can be clustered to include multiple sequences corresponding to a single gene, e.g., systems 
such as UniGene (see, http://www.ncbi.nlm.iiih.gov/UniGene/). 

Once a lung cancer nucleic acid is identified, it can be cloned and, if necessary, its 

30 constituent parts recombined to form the entire lung cancer nucleic acid coding regions or the 
entire mRNA sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 
recombinant lung cancer nucleic acid can be further-used as a probe to identify and isolate 
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other lung cancer nucleic acids, e.g., extended coding regions. It can also be used as a 

"precursor" nucleic acid to make modified or variant lung cancer nucleic acids and proteins. 

The lung cancer nucleic acids of the present invention are used in several ways. In a 

first embodiment, nucleic acid probes to the lung cancer nucleic acids are made and attached 

5 to biochips to be used in screening and diagnostic methods, as outlined below, or for 

administration, e.g., for gene therapy, RNAi, vaccine, and/or antisense applications. 

Alternatively, the lung cancer nucleic acids that include coding regions of lung cancer 

proteins can be put into expression vectors for the expression of lung cancer proteins, again 

for screening purposes or for administration to a patient. 

10 In a preferred embodiment, nucleic acid probes to lung cancer nucleic acids (both the 

nucleic acid sequences outlined in the figures and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the lung cancer nucleic acids, i.e., the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 

1 5 hybridization of the target sequence and the probes of the present invention occurs. As 

outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 

20 conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under appropriate reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single and 

25 partially double stranded. The strandedness of the probe is dictated by the structure, 

composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally 
complements of ORFs or whole genes are not used. In some embodiments, nucleic acids of 

30 lengths up to hundreds of bases can be used. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 

35 



WO 02/086443 PCT/US02/12476 

particular target. The probes can be overlapping (i.e., have some sequence in common), or 
separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
5 equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is typically meant one or more of 
electrostatic, hydrophilic, and hydrophobic interactions. Included in non-covalent binding is 

10 the covalent attachment of a molecule, such as, streptavidin to the support and the non- 
covalent binding of the biotinylated probe to the streptavidin. By "covalent binding" and 
grammatical equivalents herein is meant that the two moieties, the solid support and the 
probe, are attached by at least one bond, including sigma bonds, pi bonds and coordination 
bonds. Covalent bonds can be formed directly between the probe and the solid support or can 

15 be formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to a biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 

20 synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified for the 
attachment or association of the nucleic acid probes and is amenable to at least one detection 

25 method. Often the substrate may contain discrete individual sites appropriate for ndivitual 
partitioning and identification. As will be appreciated by those in the art, the number of 
possible substrates are very large, and include, but are not limited to, glass and modified or 
functionalized glass, plastics (including acrylics, polystyrene and copolymers of styrene and 
other materials, polypropylene, polyethylene, polybutylene, polyurethanes, Teflon, etc.), 

30 polysaccharides, nylon or nitrocellulose, resins, silica or silica-based materials including 
silicon and modified silicon, carbon, metals, inorganic glasses, plastics, etc. In general, the 
substrates allow optical detection and do not appreciably fluoresce. A preferred substrate is 
described in US application entitled Reusable Low Fluorescent Plastic Biochip, U.S. 
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Application Serial No. 09/270,214, filed March 15, 1999, herein incorporated by reference in 
its entirety. 

Generally the substrate is planar, although as will be appreciated by those in the art, 
other configurations of substrates may be used as well. For example, the probes may be 
5 placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 
volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 

10 the biochip is derivatized with a chemical functional group including, but not limited to, 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 

15 homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 
additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized, and then attached to the surface 
20 of the solid support. Either the 5' or 3' terminus may be attached to the solid support, or 

attachment may be via linkage to an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very strong, 

yet non-covalent. For example, biotinylated oligonucleotides can be made, which bind to 

surfaces covalently coated with streptavidin, resulting in attachment. 
25 Alternatively, the oligonucleotides may be synthesized on the surface, as is known in 

the art. For example, photoactivation techniques utilizing photopolymerization compounds 

and techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 

situ, using known photolithographic techniques, such as those described in WO 95/251 16; 

WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited within, all of 
30 which are expressly incorporated by reference; these methods of attachment form the basis of 

the Affymetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 

lung cancer-associated sequences. These assays are typically performed in conjunction with 
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reverse transcription. In such assays, a lung cancer-associated nucleic acid sequence acts as a 

template in an amplification reaction (e.g., Polymerase Chain Reaction, or PGR). In a 

quantitative amplification, the amount of amplification product will be proportional to the 

amount of template in the original sample. Comparison to appropriate controls provides a 

5 measure of the amount of lung cancer-associated RNA. Methods of quantitative 

amplification are well known to those of skill in the art. Detailed protocols for quantitative 

PGR are provided, e.g., in Innis, et al. (1990) PGR Protocols. A Guide to Methods and 

Applications . 

In some embodiments, a TaqMan based assay is used to measure expression. 

10 TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
dye and a 3 5 quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3 ' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 

15 quenching agent, thereby resulting in an increase in fluorescence as a function of 

amplification (see, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560, Landegren, et al. (1988) 
Science 241:1077, and Barringer, et al. (1990) Gene 89:117), transcription amplification 

20 (Kwoh, et al. (1989) Proc. Natl. Acad. Sci. USA 86:1 173), self-sustained sequence 

replication (Guatelli, et al. (1990) Proc. Nat. Acad. Sci. USA 87:1874), dot PCR, and linker 
adapter PCR, etc. 

Expression of lung cancer proteins from nucleic acids 

25 In a preferred embodiment, lung cancer nucleic acids, e.g., encoding lung cancer 

proteins, are used to make a variety of expression vectors to express lung cancer proteins 
which can then be used in screening assays, as described below. Expression vectors and 
recombinant DNA technology are well known to those of skill in the art (see, e.g., Ausubel, 
supra, and Fernandez and Hoeffler (eds 1999) Gene Expression Systems) and are used to 

30 express proteins. The expression vectors may be either self-replicating extrachromosomal 
vectors or vectors which integrate into a host genome. Generally, these expression vectors 
include transcriptional and translational regulatory nucleic acid operably linked to the nucleic 
acid encoding the lung cancer protein. The term "control sequences" refers to DNA 
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sequences used for the expression of an operably linked coding sequence in a particular host 

organism. Control sequences that are suitable for prokaryotes, e.g., include a promoter, 

optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known to 

utilize promoters, polyadenylation signals, and enhancers. 

5 Nucleic acid is "operably linked" when it is placed into a functional relationship with 

another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 

operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 

the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 

sequence if it affects the transcription of the sequence; or a ribosome binding site is operably 

10 linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 

"operably linked" means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not have 
to be contiguous. Linking is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in 

1 5 accordance with conventional practice. Transcriptional and translational regulatory nucleic 
acid will generally be appropriate to the host cell used to express the lung cancer protein. 
Numerous types of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 

20 not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 

sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences may be either constitutive or inducible promoters. The promoters 
25 may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
combine elements of more than one promoter, are also known in the art, and are useful in the 
present invention. 

In addition, an expression vector may comprise additional elements. For example, the 
expression vector may have two replication systems, thus allowing it to be maintained in two 
30 organisms, e.g., in mammalian or insect cells for expression and in a prokaryotic host for 
cloning and amplification. Furthermore, for integrating expression vectors, the expression 
vector often contains at least one sequence homologous to the host cell genome, and 
preferably two homologous sequences which flank the expression construct. The integrating 
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vector may be directed to a specific locus in the host cell by selecting the appropriate 
homologous sequence for inclusion in the vector. Constructs for integrating vectors are well 
known in the art (e.g., Fernandez and Hoeffler, supra). 

In addition, in a preferred embodiment, the expression vector contains a selectable 
5 marker gene to allow the selection of transformed host cells. Selection genes are well known 
in the art and will vary with the host cell used. 

The lung cancer proteins of the present invention are usually produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding a lung 
cancer protein, under the appropriate conditions to induce or cause expression of the lung 

1 0 cancer protein. Conditions appropriate for lung cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 

15 growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect and 
animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 

20 and other yeasts, E. colU Bacillus subtilis, Sf9 cells, CI 29 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THP1 cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the lung cancer proteins are expressed in mammalian 
cells. Mammalian expression systems are also known in the art, and include retroviral and 

25 adenoviral systems. Of particular use as mammalian promoters are the promoters from 
mammalian viral genes, since the viral genes are often highly expressed and have a broad 
host range. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter, herpes simplex virus promoter, and the CMV 
promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription termination and 

30 polyadenylation sequences recognized by mammalian cells are regulatory regions located 3' 
to the translation stop codon and thus, together with the promoter elements, flank the coding 
sequence. Examples of transcription terminator and polyadenylation signals include those 
derived form SV40. 
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The methods of introducing exogenous nucleic acid into mammalian hosts, as well as 
other hosts, is well known in the art, and will vary with the host cell used. Techniques 
include dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fusion, electroporation, viral infection, encapsulation of the 
5 polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

In a preferred embodiment, lung cancer proteins are expressed in bacterial systems. 
Promoters from bacteriophage may also be used and are known in the art. In addition, 
synthetic promoters and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of 
the trp and lac promoter sequences. Furthermore, a bacterial promoter can include naturally 

10 occurring promoters of non-bacterial origin that have the ability to bind bacterial KNA 
polymerase and initiate transcription. In addition to a functioning promoter sequence, an 
efficient ribosome binding site is desirable. The expression vector may also include a signal 
peptide sequence that provides for secretion of the lung cancer protein in bacteria. The 
protein is either secreted into the growth media (gram-positive bacteria) or into the 

15 periplasmic space, located between the inner and outer membrane of the cell (gram-negative 
bacteria). The bacterial expression vector may also include a selectable marker gene to allow 
for the selection of bacterial strains that have been transformed. Suitable selection genes 
include genes which render the bacteria resistant to drugs such as ampicillin, 
chloramphenicol, erythromycin, kanamycin, neomycin and tetracycline. Selectable markers 

10 also include biosynthetic genes, such as those in the histidine, tryptophan and leucine 

biosynthetic pathways. These components are assembled into expression vectors. Expression 
vectors for bacteria are well known in the art, and include vectors for Bacillus subtilis, E. 
coli, Streptococcus cremoris, and Streptococcus lividans, among others (e.g., Fernandez and 
Hoeffler, supra). The bacterial expression vectors are transformed into bacterial host cells 

15 using techniques well known in the art, such as calcium chloride treatment, electroporation, 
and others. 

In one embodiment, lung cancer proteins are produced in insect cells. Expression 
vectors for the transformation of insect cells, and in particular, baculovirus-based expression 
vectors, are well known in the art. 
50 In a preferred embodiment, lung cancer protein is produced in yeast cells. Yeast 

expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula potymorpha, 
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Kluyveromyces fragilis and K. lactis, Pichia guillerimondii, and P. pastoris, 

Schizosaccharomyces pornbe, and Yarrowia lipolytica. 

The lung cancer protein may also be made as a fusion protein, using techniques well 

known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired epitope 

5 is small, the lung cancer protein may be fused to a carrier protein to form an immunogen. 

Alternatively, the lung cancer protein may be made as a fusion protein to increase expression 

for affinity purification purposes, or for other reasons. For example, when the lung cancer 

protein is a lung cancer peptide, the nucleic acid encoding the peptide may be linked to other 

nucleic acid for expression purposes. 

10 In a preferred embodiment, the lung cancer protein is purified or isolated after 

expression. Lung cancer proteins may be isolated or purified in a variety of appropriate 
ways. Standard purification methods include electrophoretic, molecular, immunological and 
chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the lung cancer protein 

15 may be purified using a standard anti-lung cancer protein antibody column. Ultrafiltration 
and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes (1982) Protein Purification . 
The degree of purification necessary will vary depending on the use of the lung cancer 
protein. In some instances no purification will be necessary. 

20 Once expressed and purified if necessary, the lung cancer proteins and nucleic acids 

are useful in a number of applications. They may be used as immunoselection reagents, as 
vaccine reagents, as screening agents, therapeutic entities, for production of antibodies, as 
transcription or translation inhibitors, etc. 

25 Variants of lung cancer proteins 

In one embodiment, the lung cancer proteins are derivative or variant lung cancer 
proteins as compared to the wild-type sequence. That is, as outlined more fully below, the 
derivative lung cancer peptide will often contain at least one amino acid substitution, deletion 
or insertion, with amino acid substitutions being particularly preferred. The amino acid 
30 substitution, insertion or deletion may occur at a particular residue within the lung cancer 
peptide. 

Also included within one embodiment of lung cancer proteins of the present invention 
are amino acid sequence variants. These variants typically fall into one or more of three 

42 



WO 02/086443 PCT/US02/12476 

classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the lung cancer 
protein, using cassette or PCR mutagenesis or other techniques, to produce DNA encoding 
the variant, and thereafter expressing the DNA in recombinant cell culture as outlined above. 
5 However, variant lung cancer protein fragments having up to about 100-150 residues may be 
prepared by in vitro synthesis. Amino acid sequence variants are characterized by the 
predetermined nature of the variation, a feature that sets them apart from naturally occurring 
allelic or interspecies variation of the lung cancer protein amino acid sequence. The variants 
typically exhibit a similar qualitative biological activity as the naturally occurring analogue, 

10 although variants can also be selected which have modified characteristics as will be more 
fully outlined below. 

While the site or region for introducing an amino acid sequence variation is often 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 

15 conducted at the target codon or region and the expressed lung cancer variants screened for 
the optimal combination of desired activity. Techniques exist for making substitution 
mutations at predetermined sites in DNA having a known sequence, e.g., Ml 3 primer 
mutagenesis and PCR mutagenesis. Screening of mutants is often done using assays of lung 
cancer protein activities. 

20 Amino acid substitutions are typically of single residues; insertions usually will be on 

the order of from about 1 to 20 amino acids, although considerably larger insertions may be 
occasionally tolerated. Deletions generally range from about 1 to about 20 residues, although 
in some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to arrive 

25 at a final derivative. Generally these changes are done on a few amino acids to minimize the 
alteration of the molecule. Larger changes may be tolerated in certain circumstances. When 
small alterations in the characteristics of a lung cancer protein are desired, substitutions are 
generally made in accordance with the amino acid substitution chart provided in the 
definition section. 

30 Variants typically exhibit essentially the same qualitative biological activity and will 

elicit the same immune response as a naturally-occurring analog, although variants also are 
selected to modify the characteristics of lung cancer proteins as needed. Alternatively, the 
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variant may be designed or reorganized such that a biological activity of the lung cancer 

protein is altered. For example, glycosylation sites may be added, altered, or removed. 

Covalent modifications of lung cancer polypeptides are included within the scope of 

this invention. One type of covalent modification includes reacting targeted amino acid 

5 residues of a lung cancer polypeptide with an organic derivatizing agent that is capable of 

reacting with selected side chains or the N-or C-terminal residues of a lung cancer 

polypeptide. Derivatization with bifunctional agents is useful, for instance, for crosslinking 

lung cancer polypeptides to a water-insoluble support matrix or surface for use in a method 

for purifying anti-lung cancer polypeptide antibodies or screening assays, as is more fully 

10 described below. Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2- 
phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic 
acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N-maleimido-1,8- 
octane and agents such as methyl-3-((p-azidophenyl)dithio)propipimidate. 

15 Other modifications include deamidation of glutaminyl and asparaginyl residues to 

the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of serinyl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains (Creighton 
(1983) Proteins: Structure and Molecular Properties , pp. 79-86), acetylation of the N-terminal 

20 amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the lung cancer polypeptide encompassed by 
this invention is an altered native glycosylation pattern of the polypeptide. "Altering the 
native glycosylation pattern" is intended herein to mean adding to or deleting one or more 
carbohydrate moieties of a native sequence lung cancer polypeptide. Glycosylation patterns 

25 can be altered in many ways. For example the use of different cell types to express lung 
cancer-associated sequences can result in different glycosylation patterns. 

Addition of glycosylation sites to lung cancer polypeptides may also be accomplished 
by altering the amino acid sequence thereof. The alteration may be made, e.g., by the 
addition of, or substitution by, one or more serine or threonine residues to the native sequence 

30 lung cancer polypeptide (for O-linked glycosylation sites). The lung cancer amino acid 
sequence may optionally be altered through changes at the DNA level, particularly by 
mutating the DNA encoding the lung cancer polypeptide at preselected bases such that 
codons are generated that will translate into the desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the lung cancer 
polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such 
methods are described in the art, e.g., in WO 87/05330, and in Aplin and Wriston (1981) 
CRC Crit. Rev. Biochem.. pp. 259-306. 
5 Removal of carbohydrate moieties present on the lung cancer polypeptide may be 

accomplished chemically or enzymatically or by mutational substitution of codons encoding 
for amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are known in the art and described, for instance, by Hakimuddin, et al. (1987) 
Arch. Biochem. Biophvs.. 259:52 and by Edge, et al. (1981) Anal. Biochem. , 118:131. 
10 Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a 
variety of endo-and exo-glycosidases as described by Thotakura, et al. (1987) Meth. 
Enzvmol. . 138:350. 

Another type of covalent modification of lung cancer comprises linking the lung 
cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene 

15 glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent 
Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192, or 4,179,337. 

Lung cancer polypeptides of the present invention may also be modified in a way to 
form chimeric molecules comprising a lung cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fusion of a lung cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the lung cancer polypeptide. The 
presence of such epitope-tagged forms of a lung cancer polypeptide can be detected using an 
antibody against the tag polypeptide. Also, provision of the epitope tag enables the lung 

25 cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a lung cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

30 Various tag polypeptides and their respective antibodies are well known and examples 

include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 and metal 
chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. (1988) Mol. 
Cell. Biol. 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies 
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thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616); and the Herpes 

Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. (1990) Protein 

Engineering 3(6):547-553). Other tag polypeptides include the Flag-peptide (Hopp, et al. 

(1988) BioTechnologv 6:1204-1210); the KT3 epitope peptide (Martin, et al. (1992) Science 

5 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 266:15163- 

15166); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) Proc. Nat'l 

Acad. Sci. USA 87:6393-6397). 

Also included are other lung cancer proteins of the lung cancer family, and lung 

cancer proteins from other organisms, which are cloned and expressed as outlined below. 

10 Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be used to 
find other related lung cancer proteins from primates or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include unique areas of the lung cancer nucleic acid sequence. As is generally known in the 
art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 

15 about 20 to about 30 being preferred, and may contain inosine as needed. PCR reaction 
conditions are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to lung cancer proteins 

In a preferred embodiment, when a lung cancer protein is to be used to generate 
20 antibodies, e.g., for immunotherapy or immunodiagnosis, the lung cancer protein should 
share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller lung cancer protein will be able to bind to the full-length protein, 
25 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are well known (e.g., Coligan, supra; and 
Harlow and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
30 immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 
or intraperitoneal injections. The immunizing agent may include a protein encoded by a 
nucleic acid of Tables 1 A- 16 or fragment thereof or a fusion protein thereof. It may be useful 
to conjugate the immunizing agent to a protein known to be immunogenic in the mammal 
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being immunized. Immunogenic proteins include, e.g., keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Adjuvants include, e.g., 
Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicorynomycolate). The immunization protocol may be selected by one skilled in 
5 the art. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies 
may be prepared using hybridoma methods, such as those described by Kohler and Milstein 
(1975) Nature 256:495. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce 
10 or are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of the tables, or fragment thereof, 
or a fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are 
used if cells of human origin are desired, or spleen cells or lymph node cells are used if non- 
15 human mammalian sources are desired. The lymphocytes are then fused with an 

immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell (Goding (1986) Monoclonal Antibodies: Principles and Practice , pp. 59-103 ). 
Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells 
of rodent, bovin, or primate origin. Usually, rat or mouse myeloma cell lines are employed. 
20 The hybridoma cells may be cultured in a suitable culture medium that preferably contains 
one or more substances that inhibit the growth or survival of the unfused, immortalized cells. 
For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl 
transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the 
25 growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
typically monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
30 protein encoded by a nucleic acid of the tables or a fragment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 
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In a preferred embodiment, the antibodies to lung cancer protein are capable of 
reducing or eliminating a biological function of a lung cancer protein, in a naked form or 
conjugated to an effector moiety. That is, the addition of anti-lung cancer protein antibodies 
(either polyclonal or preferably monoclonal) to lung cancer tissue (or cells containing lung 
5 cancer) may reduce or eliminate the lung cancer. Generally, at least a 25% decrease in 
activity, growth, size or the like is preferred, with at least about 50% being particularly 
preferred and about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the lung cancer proteins are humanized 
antibodies (e.g., Xenerex Biosciences, Medarex, Inc., Abgenix, Inc., Protein Design Labs, 

10 Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric molecules of 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab 5 , 
F(ab')2 or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. Humanized antibodies include human 
immunoglobulins (recipient antibody) in which residues from a complementary determining 

1 5 region (CDR) of the recipient are replaced by residues from a CDR of a non-human species 
(donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of a human immunoglobulin are 
replaced by corresponding non-human residues. Humanized antibodies may also comprise 
residues which are found neither in the recipient antibody nor in the imported CDR or 

20 framework sequences. In general, a humanized antibody will comprise substantially all of at 
least one, and typically two, variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of 
the framework (FR) regions are those of a human immunoglobulin consensus sequence. A 
humanized antibody optimally also will typically comprise at least a portion of an 

25 immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones, et 
al. (1986) Nature 321:522-525; Riechmann, etal. (1988) Nature 332:323-329; andPresta 
(1992) Curr. Op. Struct. Biol. 2:593-596). Humanization can be performed following the 
method of Winter and co-workers (Jones, et al. (1986) Nature 321:522-525; Riechmann, et al. 
(1988) Nature 332:323-327; Verhoeyen, et al. (1988) Science 239:1534-1536), by 

30 substituting rodent CDRs or CDR sequences for corresponding sequences of a human 

antibody. Accordingly, such humanized antibodies are chimeric antibodies (U.S. Patent No. 
4,816,567), wherein substantially less than an intact human variable domain has been 
substituted by corresponding sequence from a non-human species. 
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Human-like antibodies can also be produced using various techniques known in the 
art, including phage display libraries (Hoogenboom and Winter (1991) J. Mol. Bio l. 227:381; 
Marks, et al. (1991) J. Mol. Biol. 222:581). The techniques of Cole, et al. and Boerner, et al. 
are also available for the preparation of human monoclonal antibodies (Cole, et al. (1985) 
5 Monoclonal Antibodies and Cancer Therapy , p. 77 and Boerner, et al. (1991) J. Immunol. 
147(l):86-95). Similarly, human antibodies can be made by introducing human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 
human antibody production is observed, which closely resembles that seen in humans in 

10 nearly all respects, including gene rearrangement, assembly, and antibody repertoire. This 
approach is described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 
5,633,425; 5,661,016, and in the following scientific publications: Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 
Nature 368:812-13; Fishwild, et al. (1996) Nature Biotechnology 14:845-51; Neuberger 

15 (1996) Nature Biotechnology 14:826; and Lonberg and Huszar (1995) Intern. Rev. Immunol. 
13:65-93. 

By immunotherapy is meant treatment of lung cancer with an antibody raised against 
a lung cancer proteins. As used herein, immunotherapy can be passive or active. Passive 
immunotherapy as defined herein is the passive transfer of antibody to a recipient (patient). 

20 Active immunization is the induction of antibody and/or T-cell responses in a recipient 

(patient). Induction of an immune response is the result of providing the recipient with an 
antigen to which antibodies are raised. The antigen may be provided by injecting a 
polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 

25 expression of the antigen, leading to an immune response. 

In a preferred embodiment the lung cancer proteins against which antibodies are 
raised are secreted proteins as described above. Without being bound by theory, antibodies 
used for treatment, may bind and prevent the secreted protein from binding to its receptor, 
thereby inactivating the secreted lung cancer protein. 

30 In another preferred embodiment, the lung cancer protein to which antibodies are 

raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment may bind the extracellular domain of the lung cancer protein and prevent it from 
binding to other proteins, such as circulating ligands or cell-associated molecules. The 
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antibody may cause down-regulation of the transmembrane lung cancer protein. The 

antibody may be a competitive, non-competitive or uncompetitive inhibitor of protein binding 

to the extracellular domain of the lung cancer protein. The antibody may be an antagonist of 

the lung cancer protein or may prevent activation of a transmembrane lung cancer protein, or 

5 may induce or suppress a particular cellular pathway. In some embodiments, when the 

antibody prevents the binding of other molecules to the lung cancer protein, the antibody 

prevents growth of the cell. The antibody may also be used to target or sensitize the cell to 

cytotoxic agents, including, but not limited to TNF-oc, TNF-p, IL-1, INF-y, and IL-2, or 

chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, methotrexate, 

10 and the like. In some instances the antibody may belong to a sub-type that activates serum 

complement when complexed with the transmembrane protein thereby mediating cytotoxicity 
or antigen-dependent cytotoxicity (ADCC). Thus, lung cancer may be treated by 
administering to a patient antibodies directed against the transmembrane lung cancer protein. 
Antibody-labeling may activate a co-toxin, localize a toxin payload, or otherwise provide 

15 means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be various molecules, including labeling moieties such as radioactive 
labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the therapeutic 
moiety is a small molecule that modulates the activity of a lung cancer protein. In another 

20 aspect the therapeutic moiety may modulate an activity of molecules associated with or in 
close proximity to a lung cancer protein. The therapeutic moiety may inhibit enzymatic or 
signaling activity such as protease or collagenase activity associated with lung cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to lung cancer tissue or cells results in a reduction 

25 in the number of afflicted cells, thereby reducing symptoms associated with lung cancer. 

Cytotoxic agents are numerous and varied and include, but are not limited to, cytotoxic drugs 
or toxins or active fragments of such toxins. Suitable toxins and their corresponding 
fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A chain, curcin, 
crotin, phenomycin, enomycin, saporin, auristatin, and the like. Cytotoxic agents also include 

30 radiochemicals made by conjugating radioisotopes to antibodies raised against lung cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 
the antibody. Targeting the therapeutic moiety to transmembrane lung cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the lung cancer 
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afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the untargeted therapeutic moiety. 

In another preferred embodiment, the lung cancer protein against which the antibodies 
are raised is an intracellular protein. In this case, the antibody may be conjugated to a protein 
5 or other entity which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
the individual or cell. Moreover, wherein the lung cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody theretomay contain a signal for that target localization, i.e., 
a nuclear localization signal. 
10 The lung cancer antibodies of the invention specifically bind to lung cancer proteins. 

By "specifically bind" herein is meant that the antibodies bind to the protein with a of at 
least about 0.1 mM, more usually at least about 1 |iM, preferably at least about 0.1 |iM or 
better, and most preferably, 0.01 |iM or better. Selectivity of binding to the specific target 
and not to related other sequences is also important. 

15 

Detection of lung cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the lung cancer phenotype. Expression levels of genes in normal tissue (e.g., 
not undergoing lung cancer), in lung cancer tissue (and in some cases, for varying severities 

20 of lung cancer that relate to prognosis, as outlined below), or in non-malignant disease are 
evaluated to provide expression profiles. A gene expression profile of a particular cell state 
or point of development is essentially a "fingerprint" of the state of the cell. While two states 
may have a particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is reflective of the state 

25 of the cell. By comparing expression profiles of cells in different states, information 

regarding which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 
tissue. This will provide for molecular diagnosis of related conditions. 

30 "Differential expression," or grammatical equivalents as used herein, refers to 

qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 
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normal versus lung cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
5 not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 

10 GeneChip™ expression arrays, Lockhart (1996) Nature Biotechnology 14: 1675-1680, hereby 
expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is typically 
at least about 50%, more preferably at least about 100%, more preferably at least about 

15 150%, more preferably at least about 200%, with from 300 to at least 1000% being especially 
preferred. 

Evaluation may be at the gene transcript or the protein level. The amount of gene 
expression may be monitored using nucleic acid probes to the RNA or DNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 

20 gene product itself (protein) can be monitored, e.g., with antibodies to the lung cancer protein 
and standard immunoassays (ELISAs, etc.) or other techniques, including mass spectroscopy 
assays, 2D gel electrophoresis assays, etc. Proteins corresponding to lung cancer genes, e.g., 
those identified as being important in a lung cancer or disease phenotype, can be evaluated in 
a lung cancer diagnostic test. In a preferred embodiment, gene expression monitoring is 

25 performed simultaneously on a number of genes. 

The lung cancer nucleic acid probes may be attached to biochips as outlined herein for 
the detection and quantification of lung cancer sequences in a particular cell. The assays are 
further described below in the example. PCR techniques can be used to provide greater 
sensitivity. Multiple protein expression monitoring can be performed as well. Similarly, 

30 these assays may be performed on an individual basis as well. 

In a preferred embodiment nucleic acids encoding the lung cancer protein are 
detected. Although DNA or RNA encoding the lung cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding a lung cancer protein is detected. 
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Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is complementary to 
and hybridizes with the mRNA and includes, but is not limited to, oligonucleotides, cDNA or 
RNA. Probes also should contain a detectable label, as defined herein. In one method the 
mRNA is detected after immobilizing the nucleic acid to be examined on a solid support such 
5 as nylon membranes and hybridizing the probe with the sample. Following washing to 

remove the non-specifically bound probe, the label is detected. In another method detection 
of the mRNA is performed in situ. In this method permeabilized cells or tissue samples are 
contacted with a detectably labeled nucleic acid probe for sufficient time to allow the probe 
to hybridize with the target mRNA. Following washing to remove the non-specifically bound 

10 probe, the label is detected. For example a digoxygenin labeled riboprobe (RNA probe) that 
is complementary to the mRNA encoding a lung cancer protein is detected by binding the 
digoxygenin with an anti-digoxygenin secondary antibody and developed with nitro blue 
tetrazolium and 5 -bromo-4-chloro -3 -indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 

15 described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The lung cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing lung cancer sequences are used in diagnostic assays. This can be performed on an 
individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 

20 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, lung cancer proteins, including intracellular, 
transmembrane, or secreted proteins, find use as markers of lung cancer, e.g., for prognostic 
or diagnostic purposes. Detection of these proteins in putative lung cancer tissue allows for 

25 detection, prognosis, or diagnosis of lung cancer or similar disease, and perhaps for selection 
of therapeutic strategy. In one embodiment, antibodies are used to detect lung cancer 
proteins. A preferred method separates proteins from a sample by electrophoresis on a gel 
(typically a denaturing and reducing protein gel, but may be another type of gel, including 
isoelectric focusing gels and the like). Following separation of proteins, the lung cancer 

30 protein is detected, e.g., by immunoblotting with antibodies raised against the lung cancer 
protein. Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the lung cancer protein find use in in situ 
imaging techniques, e.g., in histology (e.g., Asai (ed. 1993) Methods in Cell Biology: 
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Antibodies in Cell Biology , volume 37. In this method cells are contacted with from one to 

many antibodies to the lung cancer protein(s). Following washing to remove non-specific 

antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 

the antibody is detected by incubating with a secondary antibody that contains a detectable 

5 label, e.g., multicolor fluorescence or confocal imaging. In another method the primary 

antibody to the lung cancer protein(s) contains a detectable label, e.g., an enzyme marker that 

can act on a substrate. In another preferred embodiment each one of multiple primary 

antibodies contains a distinct and detectable label. This method finds particular use in 

simultaneous screening for a plurality of lung cancer proteins. Many other histological 

10 imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 
to detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing lung cancer from 

15 blood, serum, plasma, stool, and other samples. Such samples, therefore, are useful as 

samples to be probed or tested for the presence of lung cancer proteins. Antibodies can be 
used to detect a lung cancer protein by previously described immunoassay techniques 
including ELIS A, immunoblotting (western blotting), immunoprecipitation, BIACORE 
technology and the like. Conversely, the presence of antibodies may indicate an immune 

20 response against an endogenous lung cancer protein or vaccine. 

In a preferred embodiment, in situ hybridization of labeled lung cancer nucleic acid 
probes to tissue arrays is done. For example, arrays of tissue samples, including lung cancer 
tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) is then 
performed. When comparing the fingerprints between an individual and a standard, the 

25 skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It is 
further understood that the genes which indicate the diagnosis may differ from those which 
indicate the prognosis and molecular profiling of the condition of the cells may lead to 
distinctions between responsive or refractory conditions or may be predictive of outcomes. 
In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

30 modified proteins and cells containing lung cancer sequences are used in prognosis assays. 
As above, gene expression profiles can be generated that correlate to lung cancer, clinical, 
pathological, or other information, in terms of long term prognosis. Again, this may be done 
on either a protein or gene level, with the use of genes being preferred. Single or multiple 
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genes may be useful in various combinations. As above, lung cancer probes may be attached 

to biochips for the detection and quantification of lung cancer sequences in a tissue or patient. 

The assays proceed as outlined above for diagnosis. PCR method may provide more 

sensitive and accurate quantification. 

5 

Assays for therapeutic compounds 

In a preferred embodiment, the proteins, nucleic acids, and antibodies as described 
herein are used in drug screening assays. The lung cancer proteins, antibodies, nucleic acids, 
modified proteins and cells containing lung cancer sequences are used in drug screening 

10 assays or by evaluating the effect of drug candidates on a "gene expression profile" or 

expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
Zlokarnik, et al. (1998) Science 279:84-8; Heid (1996) Genome Res. 6:986-94. 

15 In a preferred embodiment, the lung cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing the native or modified lung cancer proteins are used in 
screening assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the lung cancer phenotype or an identified physiological 
function of a lung cancer protein. As above, this can be done on an individual gene level or 

20 by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 

embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified differentially expressed genes herein, a variety of assays may be 

25 performed. In a preferred embodiment, assays may be run on an individual gene or protein 
level. That is, having identified a particular gene with altered regulation in lung cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
lung cancer protein. "Modulation" thus includes an increase or a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 

30 gene expression in normal versus tissue undergoing lung cancer, with changes of at least 

10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000%) or 
greater. Thus, if a gene exhibits a 4-fold increase in lung cancer tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in lung 



55 



WO 02/086443 PCT/US02/12476 
cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 

expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 

quantification of gene expression levels, or, alternatively, the gene product itself can be 

5 monitored, e.g., through the use of antibodies to the lung cancer protein and standard 

immunoassays. Proteomics and separation techniques may also allow quantification of 

expression. 

In a preferred embodiment, gene or protein expression monitoring of a number of 
entities, i.e., an expression profile, is monitored simultaneously. Such profiles will typically 
1 0 involve a plurality of those entities described herein. 

In this embodiment, the lung cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of lung cancer sequences in a particular 
cell. Alternatively, PGR may be used. Thus, a series, e.g., of microtiter plate, may be used 
with dispensed primers in desired wells. A PGR reaction can then be performed and analyzed 
15 for each well. 

Expression monitoring can be performed to identify compounds that modify the 
expression of one or more lung cancer-associated sequences, e.g., a polynucleotide sequence 
set out in the tables. Generally, in a preferred embodiment, a test compound is added to the 
cells prior to analysis. Moreover, screens are also provided to identify agents that modulate 

20 lung cancer, modulate lung cancer proteins, bind to a lung cancer protein, or interfere with 
the binding of a lung cancer protein and an antibody, substrate, or other binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or. 

25 indirectly alter the lung cancer phenotype or the expression of a lung cancer sequence, e.g., a 
nucleic acid or protein sequence. In preferred embodiments, modulators alter expression 
profiles of nucleic acids or proteins provided herein. In one embodiment, the modulator 
suppresses a lung cancer phenotype, e.g., to a normal or non-malignant tissue fingerprint. In 
another embodiment, a modulator induces a lung cancer phenotype. Generally, a plurality of 

30 assay mixtures are run in parallel with different agent concentrations to obtain a differential 
response to the various concentrations. Typically, one of these concentrations serves as a 
negative control, i.e., at zero concentration or below the level of detection. 
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In one aspect, a modulator will neutralize the effect of a lung cancer protein. By 
"neutralize" is meant that activity of a protein and the consequent effect on the cell is 
inhibited or blocked. 

In certain embodiments, combinatorial libraries of potential modulators will be 
screened for an ability to bind to a lung cancer polypeptide or to modulate activity. 
Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 
library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (i.e., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks (Gallop, 
et al. (1994) J. Med. Chem. 37(9):1233-1251). 

Preparation and screening of combinatorial chemical libraries is well known to those 
of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Pept. Prot. Res. 37:487- 
493, Houghton, et al. (1991) Nature, 354:84-88), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 
Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers such as 
hydantoins, benzodiazepines and dipeptides (Hobbs, et al. (1993) Proc. Nat. Acad. Sci. USA 
90:6909-6913), vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 
1 14:6568), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding (Hirschmann, et 
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al. (1992) J. Amer. Chem. Soc. 114:9217-9218), analogous organic syntheses of small 

compound libraries (Chen, et al. (1994) J. Amer. Chem. Soc. 116:2661), oligocarbamates 

(Cho, et al. (1993) Science 261:1303), and/or peptidyl phosphonates (Campbell, et al. (1994) 

J. Org. Chem. 59:658). See, generally, Gordon, et al. (1994) J. Med. Chem. 37:1385, nucleic 

5 acid libraries (see, e.g., Stratagene, Corp.), peptide nucleic acid libraries (see, e.g., U.S. 

Patent 5,539,083), antibody libraries (see, e.g., Vaughn, et al. (1996) Nature Biotechnology 

14(3):309-314, and PCT/US96/10287), carbohydrate libraries (see, e.g., Liang, et al. (1996) 

Science 274:1520-1522, and U.S. Patent No. 5,593,853), and small organic molecule libraries 

(see, e.g., benzodiazepines, Baum (1993) C&EN, Jan 18, page 33; isoprenoids, U.S. Patent 

10 No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, 
U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, U.S. Patent No. 
5,506,337; benzodiazepines, U.S. Patent No. 5,288,514; and the like). 

Devices for the preparation of combinatorial libraries are commercially available (see, 
e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, Rainin, 

15 Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, Bedford, 
MA). 

A number of well known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 

20 systems utilizing robotic arms (Zymate II, Zymark Corporation, Hopkinton, Mass.; Orca, 

Hewlett-Packard, Palo Alto, Calif.), which mimic the manual synthetic operations performed 
by a chemist. The above devices, with appropriate modification, are suitable for use with the 
present invention. In addition, numerous combinatorial libraries are themselves 
commercially available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, 

25 Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek 
Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect modulation of lung cancer gene transcription, polypeptide 
expression, and polypeptide activity. 

30 High throughput assays for evaluating the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
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U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 

binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 

throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 

5 Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 

Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 

typically automate procedures, including sample and reagent pipetting, liquid dispensing, 

timed incubations, and final readings of the microplate in detector(s) appropriate for the 

assay. These configurable systems provide high throughput and rapid start up as well as a 

10 high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 

15 fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 
random or directed digests of proteinaceous cellular extracts, may be used. In this way 
libraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 

20 useful test compound will be directed to the class of proteins to which the target belongs, e.g., 
substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 30 
amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to 
about 15 being particularly preferred. The peptides may be digests of naturally occurring 

25 proteins, random peptides, or "biased" random peptides. By "randomized" or grammatical 
equivalents herein is meant that the nucleic acid or peptide consists of essentially random 
sequences of nucleotides and amino acids, respectively. Since these random peptides (or 
nucleic acids, discussed below) are often chemically synthesized, they may incorporate a 
nucleotide or amino acid at any position. The synthetic process can be designed to generate 

30 randomized proteins or nucleic acids, to allow the formation of all or most of the possible 

combinations over the length of the sequence, thus forming a library of randomized candidate 
bioactive proteinaceous agents. 
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In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is, some 
positions within the sequence are either held constant, or are selected from a limited number 
of possibilities. In a preferred embodiment, the nucleotides or amino acid residues are 
5 randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 
sterically biased (either small or large) residues, towards the creation of nucleic acid binding 
domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 
threonines, tyrosines or histidines for phosphorylation sites, etc. 

Modulators of lung cancer can also be nucleic acids, as defined above. 

10 As described above generally for proteins, nucleic acid modulating agents may be 

naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical moieties, a 
wide variety of which are available in the literature. 

15 After a candidate agent has been added and the cells allowed to incubate for some 

period of time, the sample containing a target sequence is analyzed. If required, the target 
sequence is prepared using known techniques. For example, the sample may be treated to 
lyse the cells, using known lysis buffers, electroporation, etc., with purification and/or 
amplification such as PGR performed as appropriate. For example, an in vitro transcription 

20 with labels covalently attached to the nucleotides is performed. Generally, the nucleic acids 
are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 

25 alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 
altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
or biotin which specifically binds to streptavidin. For the example of biotin, the streptavidin 

30 is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

Nucleic acid assays can be direct hybridization assays or can comprise "sandwich 
assays", which include the use of multiple probes, as is generally outlined in U.S. Patent Nos. 



60 



WO 02/086443 PCT/US02/12476 
5,681,702, 5,597,909, 5,545,730, 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 

5,591,584, 5,624,802, 5,635,352, 5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of 

which are hereby incorporated by reference. In this embodiment, in general, the target nucleic 

acid is prepared as outlined above, and then added to the biochip comprising a plurality of 

5 nucleic acid probes, under conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including 

high, moderate and low stringency conditions as outlined above. The assays are generally 

run under stringency conditions which allow formation of the label probe hybridization 

complex only in the presence of target. Stringency can be controlled by altering a step 

10 parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration, pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
outlined in U.S. Patent No. 5,681,697: Thus it may be desirable to perform certain steps at 

1 5 higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 
other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc. 

20 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 
The assay data are analyzed to determine the expression levels, and changes in 

25 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the lung cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In another 
embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 

30 important in a particular state, screens can be performed to identify modulators that alter 

expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
expressed gene. Again, having identified the importance of a gene in a particular state, 



61 



WO 02/086443 PCT/US02/12476 

screens are performed to identify agents that bind and/or modulate the biological activity of 
the gene product, or evaluate genetic polymorphisms. 

Genes can be screened for those that are induced in response to a candidate agent. 
After identifying a modulator based upon its ability to suppress a lung cancer expression 
5 pattern leading to a normal expression pattern, or to modulate a single lung cancer gene 
expression profile so as to mimic the expression of the gene from normal tissue, a screen as 
described above can be performed to identify genes that are specifically modulated in 
response to the agent. Comparing expression profiles between normal tissue and agent 
treated lung cancer tissue reveals genes that are not expressed in normal tissue or lung cancer 

10 tissue, but are expressed in agent treated tissue. These agent-specific sequences can be 
identified and used by methods described herein for lung cancer genes or proteins. In 
particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated lung cancer tissue sample. 

15 Thus, in one embodiment, a test compound is administered to a population of lung 

cancer cells, that have an associated lung cancer expression profile. By "administration" or 
"contacting" herein is meant that the candidate agent is added to the cells in such a manner as 
to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 
action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 

20 candidate agent (i.e., a peptide) may be put into a viral construct such as an adenoviral or 
retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCT US97/01019: Regulatable gene therapy systems can also be used. 

Once a test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some 

25 period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., lung cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress a lung cancer phenotype. A change in at least one gene, 
preferably many, of the expression profile indicates that the agent has an effect on lung 
30 cancer activity. By defining such a signature for the lung cancer phenotype, screens for new 
drugs that alter the phenotype can be devised. With this approach, the drug target need not be 
known and need not be represented in the original expression screening platform, nor does 
the level of transcript for the target protein need to change. 
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Measure of lung cancer polypeptide activity, or of lung cancer or the lung cancer 
phenotype can be performed using a variety of assays. For example, the effects of the test 
compounds upon the function of the metastatic polypeptides can be measured by examining 
parameters described above. A suitable physiological change that affects activity can be used 
5 to assess the influence of a test compound on the polypeptides of this invention. When the 
functional consequences are determined using intact cells or animals, one can also measure a 
variety of effects such as, in the case of lung cancer associated with tumors, tumor growth, 
tumor metastasis, neovascularization, hormone release, transcriptional changes to both known 
and uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 

10 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, mammalian lung cancer polypeptide is typically used, e.g., 
mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a lung cancer polypeptide is first contacted with a potential modulator and 

15 incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, the 
lung cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is typically measured using immunoassays such as western 
blotting, ELIS A and the like with an antibody that selectively binds to the lung cancer 
polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 

20 PGR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 

blotting, are preferred. The level of protein or mRNA is typically detected using directly or 
indirectly labeled detection agents, e.g., fluorescently or radioactively labeled nucleic acids, 
radioactively or enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using a lung cancer protein 

25 promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
CAT, or p-gal. The reporter construct is typically transfected into a cell. After treatment 
with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 

30 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression of 
the gene or the gene product itself can be done. The gene products of differentially expressed 
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genes are sometimes referred to herein as "lung cancer proteins." The lung cancer protein 

may be a fragment, or alternatively, be the full length protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
5 embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
10 gene product is used; that is, the gene products of one or more differentially expressed 

nucleic acids are made. For example, antibodies are generated to the protein gene products, 
and standard immunoassays are run to determine the amount of protein present. Alternatively, 
cells comprising the lung cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a lung cancer 
15 protein and a candidate compound, and determining the binding of the compound to the lung 
cancer protein. Preferred embodiments utilize the human lung cancer protein, although other 
mammalian proteins may also be used, e.g., for the development of animal models of human 
disease. In some embodiments, as outlined herein, variant or derivative lung cancer proteins 
may be used. 

20 Generally, in a preferred embodiment of the methods herein, the lung cancer protein 

or the candidate agent is non-diffusably bound to an insoluble support, preferably having 
isolated sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble 
supports may be made of a composition to which the compositions can be bound, is readily 
separated from soluble material, and is otherwise compatible with the overall method of 

25 screening. The surface of such supports may be solid or porous and of a convenient shape. 
Examples of suitable insoluble supports include microtiter plates, arrays, membranes and 
beads. These are typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon 
or nitrocellulose, teflon™, etc. Microtiter plates and arrays are especially convenient because 
a large number of assays can be carried out simultaneously, using small amounts of reagents 

30 and samples. The particular manner of binding of the composition is typically not crucial so 
long as it is compatible with the reagents and overall methods of the invention, maintains the 
activity of the composition, and is nondiffusable. Preferred methods of binding include the 
use of antibodies (which do not sterically block either the ligand binding site or activation 
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sequence when the protein is bound to the support), direct binding to "sticky" or ionic 
supports, chemical crosslinking, the synthesis of the protein or agent on the surface, etc. 
Following binding of the protein or agent, excess unbound material is removed by washing. 
The sample receiving areas may then be blocked through incubation with bovine serum 
5 albumin (BSA), casein or other innocuous protein or other moiety. 

In a preferred embodiment, the lung cancer protein is bound to the support, and a test 
compound is added to the assay. Alternatively, the candidate agent is bound to the support 
and the lung cancer protein is added. Novel binding agents include specific antibodies, non- 
natural binding agents identified in screens of chemical libraries, peptide analogs, etc. Of 

10 particular interest are screening assays for agents that have a low toxicity for human cells. A 
wide variety of assays may be used for this purpose, including labeled in vitro protein-protein 
binding assays, electrophoretic mobility shift assays, immunoassays for protein binding, 
functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the lung cancer 

1 5 protein may be done in a number of ways. In a preferred embodiment, the compound is 

labeled, and binding determined directly, e.g., by attaching all or a portion of the lung cancer 
protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), washing 
off excess reagent, and determining whether the label is present on the solid support. Various 
blocking and washing steps may be utilized as appropriate. 

20 In some embodiments, only one of the components is labeled, e.g., the proteins (or 

proteinaceous candidate compounds) can be labeled. Alternatively, more than one 
component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor for 
the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

25 In one embodiment, the binding of the test compound is determined by competitive 

binding assay. The competitor may be a binding moiety known to bind to the target molecule 
(i.e., a lung cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circumstances, there may be competitive binding between the compound and the 
binding moiety, with the binding moiety displacing the compound. In one embodiment, the 

30 test compound is labeled. Either the compound, or the competitor, or both, is added first to 
the protein for a time sufficient to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, typically between 4 and 40° C. Incubation 
periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 
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between 0. 1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 
away. The second component is then added, and the presence or absence of the labeled 
component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by a test 
5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the lung cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the lung cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 

10 label on the support indicates displacement. 

In an alternative embodiment, the test compound is added first, with incubation and 
washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the lung cancer protein with a higher affinity. Thus, if the 
test compound is labeled, the presence of the label on the support, coupled with a lack of - 

15 competitor binding, may indicate that the test compound is capable of binding to the lung 
cancer protein. 

In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the lung cancer proteins. In one 
embodiment, the methods comprise combining a lung cancer protein and a competitor in a 

20 first sample. A second sample comprises a test compound, a lung cancer protein, and a 

competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the lung cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

25 agent is capable of binding to the lung cancer protein. 

Alternatively, differential screening is used to identify drug candidates that bind to the 
native lung cancer protein, but cannot bind to modified lung cancer proteins. The structure of 
the lung cancer protein may be modeled, and used in rational drug design to synthesize agents 
that interact with that site. Drug candidates that affect the activity of a lung cancer protein 

30 are also identified by screening drugs for the ability to either enhance or reduce the activity of 
the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least triplicate to obtain statistically significant results. 
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Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, samples are washed free of non-specifically bound material and the 
amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted in a scintillation counter to determine the amount of 
5 bound compound. 

A variety of other reagents may be included in the screening assays. These include 
reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
10 protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a lung cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising lung cancer 
15 proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes a lung cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
20 cytokines, growth factors, action potentials, pharmacological agents including 

chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate lung cancer agents are identified. Compounds 
with pharmacological activity are able to enhance or interfere with the activity of the lung 
25 cancer protein. Once identified, similar structures are evaluated to identify critical structural 
feature of the compound. 

In one embodiment, a method of inhibiting lung cancer cell division is provided. The 
method comprises administration of a lung cancer inhibitor. In another embodiment, a 
method of inhibiting lung cancer is provided. The method may comprise administration of a 
30 lung cancer inhibitor. In a further embodiment, methods of treating cells or individuals with 
lung cancer are provided, e.g., comprising administration of a lung cancer inhibitor. 

In one embodiment, a lung cancer inhibitor is an antibody as discussed above. In 
another embodiment, the lung cancer inhibitor is an antisense molecule. 
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A variety of cell growth, proliferation, viability, and metastasis assays are known to 
those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 
5 Normal cells require a solid substrate to attach and grow. When the cells are 

transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 

10 grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of lung cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 
the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft. 

15 Techniques for soft agar growth or colony formation in suspension assays are 

described in Freshney (1994) Culture of Animal Cells a Manual of Basic Technique (3 rd ed.), 
herein incorporated by reference. See also, the methods section of Garkavtsev, et al. (1996), 
supra, herein incorporated by reference. 

20 Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 
touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 

25 higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 

30 normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 
method of measuring density limitation of growth. Transformed host cells are transfected 
with a lung cancer-associated sequence and are grown for 24 hours at saturation density in 
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non-limiting medium conditions. The percentage of cells labeling with ( 3 H)-thymidine is 
determined autoradiographically. See, Freshney (1994), supra. 



Growth factor or serum dependence 
5 Transformed cells typically have a lower serum dependence than their normal 

counterparts (see, e.g., Temin (1966) J. Natl. Cancer Insti. 37:167-175; Eagle, et al. (1970) L 
Exp. Med. 131:836-879); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

10 

Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 

15 Gullino, "Angiogenesis, tumor vascularization, and potential interference with tumor growth" 
in Mihich (ed. 1985) Biological Responses in Cancer , pp. 178-184). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman (1992) "Angiogenesis and Cancer" in Sem Cancer Biol. ). 
Various techniques which measure the release of these factors are described in 

20 Freshney (1994), supra. Also, see, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 

Strickland and Beers (1976) J. Biol. Chem . 251:5694-5702; Whur, et al. (1980) Br. J. Cancer 
42:305-312; Gullino, "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" in Mihich (ed. 1985) Biological Responses in Cancer , pp. 178-184; Freshney 
Anticancer Res. 5:11 1-130 (1985). 

25 

Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate lung cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
30 invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 
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Techniques described in Freshney (1994), supra, can be used. Briefly, the level of 

invasion of host cells can be measured by using filters coated with Matrigel or some other 

extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 

filter, is rated as invasiveness, and rated histologically by number of cells and distance 

5 moved, or by prelabeling the cells with 125 I and counting the radioactivity on the distal side of 

the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 



Tumor growth in vivo 

Effects of lung cancer-associated sequences on cell growth can be tested in transgenic 

10 or immune-suppressed mice. Knock-out transgenic mice can be made, in which the lung 
cancer gene is disrupted or in which a lung cancer gene is inserted. Knock-out transgenic 
mice can be made by insertion of a marker gene or other heterologous gene into the 
endogenous lung cancer gene site in the mouse genome via homologous recombination. 
Such mice can also be made by substituting the endogenous lung cancer gene with a mutated 

15 version of the lung cancer gene, or by mutating the endogenous lung cancer gene, e.g., by 
exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 

20 that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi, et al. (1989) Science 244:1288). Chimeric targeted mice can be 
derived according to Hogan, et al. (1988) Manipulating the Mouse Embrvo: A Laboratory 
Manual , Cold Spring Harbor Laboratory and Robertson (ed. 1987) Teratocarcinomas and 

25 Embryonic Stem Cells: A Practical Approach, , IRL Press, Washington, D.C. 

Alternatively, various immune-suppressed or immune-deficient host animals can be 
used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) JL 
Natl. Cancer Inst. 52:921), a SCID mouse, a thymectomized mouse, or an irradiated mouse 
(see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263; Selby, et al. (1980) Br. J. Cancer 41:52) 

30 can be used as a host. Transplantable tumor cells (typically about 10 6 cells) injected into 

isogenic hosts will produce invasive tumors in a high proportions of cases, while normal cells 
of similar origin will not. In hosts which developed invasive tumors, cells expressing a lung 
cancer- associated sequences are injected subcutaneously. After a suitable length of time, 
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preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two largest 

dimensions) and compared to the control. Tumors that have statistically significant reduction 

(using, e.g., Student's T test) are said to have inhibited growth. 



5 Polynucleotide modulators of lung cancer 

Antisense and RNAi Polynucleotides 

In certain embodiments, the activity of a lung cancer-associated protein is 
downregulated, or entirely inhibited, by the use of antisense or an inhibitory polynucleotide, 
i.e., a nucleic acid complementary to, and which can preferably hybridize specifically to, a 

10 coding mRNA nucleic acid sequence, e.g., a lung cancer protein mRNA, or a subsequence 
thereof. Binding of the antisense polynucleotide to the mRNA reduces the translation and/or 
stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 

1 5 close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 
sugar linkages. Exemplary among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the lung cancer protein mRNA. See, e.g., 
Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

20 Such antisense polynucleotides can readily be synthesized using recombinant means, 

or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 
Antisense molecules as used herein include antisense or sense oligonucleotides. 

25 Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 
sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for lung cancer molecules. A preferred antisense molecule is for a lung 
cancer sequence in the tables, or for a ligand or activator thereof. Antisense or sense 

30 oligonucleotides, according to the present invention, comprise a fragment generally at least 
about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an 
antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein 
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is described in, e.g., Stein and Cohen (1988) Cancer Res. 48:2659 and van der Krol, et al. 

(1988) BioTechniques 6:958). 

RNA interference is a mechanism to suppress gene expression in a sequence specific 

manner. See, e.g., Brumelkamp, et al. (2002) Sciencexpress (21March2002); Sharp (1999) 

5 Genes Dev. 13:139-141; and Cathew (2001) Curr. Op. Cell Biol. 13:244-248. In mammalian 

cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 

be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1 :494- 

498. The mechanism may be used to downregulate expression levels of identified genes, e.g., 

treatment of or validation of relevance to disease. 

10 

Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of lung cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 

15 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes (see, e.g., Castanotto, et al. (1994) Adv. in Pha rmacology 
25: 289-317 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 1 8:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 

20 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., WO 
94/26877; Ojwang, et al. (1993) Proc. Natl. Acad. Sci. USA 90:6340-6344; Yamada, et al. 
(1994) Human Gene Therapy 1:39-45; Leavitt, et al. (1995) Proc. Natl. Acad. Sci. USA 
92:699-703; Leavitt, et al. (19994) Human Gene Therapy 5:1 151-120; and Yamada, et al. 
(1994) Virology 205: 121-126). 

25 Polynucleotide modulators of lung cancer may be introduced into a cell containing the 

target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 

30 substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of lung 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
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formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating lung cancer in cells or organisms 
5 are provided. In one embodiment, the methods comprise administering to a cell an anti-lung 
cancer antibody that reduces or eliminates the biological activity of an endogenous lung 
cancer protein. Alternatively, the methods comprise administering to a cell or organism a 
recombinant nucleic acid encoding a lung cancer protein. This may be accomplished in any 
number of ways. In a preferred embodiment, e.g., when the lung cancer sequence is down- 

10 regulated in lung cancer, such state may be reversed by increasing the amount of lung cancer 
gene product in the cell. This can be accomplished, e.g., by overexpressing the endogenous 
lung cancer gene or administering a gene encoding the lung cancer sequence, using known 
gene-therapy techniques. In a preferred embodiment, the gene therapy techniques include the 
incorporation of the exogenous gene using enhanced homologous recombination (EHR), e.g., 

15 as described in PCT/US93/03868, hereby incorporated by reference in its entirety. 

Alternatively, e.g., when the lung cancer sequence is up-regulated in lung cancer, the activity 
of the endogenous lung cancer gene is decreased, e.g., by the administration of a lung cancer 
antisense or RNAi nucleic acid. 

In one embodiment, the lung cancer proteins of the present invention may be used to 

20 generate polyclonal and monoclonal antibodies to lung cancer proteins. Similarly, the lung 
cancer proteins can be coupled, using standard technology, to affinity chromatography 
columns. These columns may then be used to purify lung cancer antibodies useful for 
production, diagnostic, or therapeutic purposes. In a preferred embodiment, the antibodies 
are generated to epitopes unique to a lung cancer protein; that is, the antibodies show little or 

25 no cross-reactivity to other proteins. The lung cancer antibodies may be coupled to standard 
affinity chromatography columns and used to purify lung cancer proteins. The antibodies 
may also be used as blocking polypeptides, as outlined above, since they will specifically 
bind to the lung cancer protein. 

30 Methods of identifying variant lung cancer-associated sequences 

Without being bound by theory, expression of various lung cancer sequences is 
correlated with lung cancer. Accordingly, disorders based on mutant or variant lung cancer 
genes may be determined. In one embodiment, the invention provides methods for 
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identifying cells containing variant lung cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous lung cancer genes in a cell. In a preferred embodiment, 
the invention provides methods of identifying the lung cancer genotype of an individual, e.g., 
determining all or part of the sequence of at least one lung cancer gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced lung cancer gene to a known lung cancer gene, i.e., 
a wild-type gene. 

The sequence of all or part of the lung cancer gene can then be compared to the 
sequence of a known lung cancer gene to determine if any differences exist. This can be 
done using known homology programs, such as Bestfit, etc. In a preferred embodiment, the 
presence of a difference in the sequence between the lung cancer gene of the patient and the 
known lung cancer gene correlates with a disease state or a propensity for a disease state, as 
outlined herein. 

In a preferred embodiment, the lung cancer genes are used as probes to determine the 
number of copies of the lung cancer gene in the genome. 

In another preferred embodiment, the lung cancer genes are used as probes to 
determine the chromosomal localization of the lung cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the lung 
cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of a lung cancer protein or 
modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel, et al. (1992) Pharmaceutical Dosage Forms and Drug 
Delivery : Lieberman, Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd (1999) The Art. Science and Technology of 
Pharmaceutical Compounding ; and Pickar (1999) Dosage Calculations^ Adjustments for 
lung cancer degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
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drug interaction and the severity of the condition may be necessary, and will be ascertainable 
with routine experimentation by those skilled in the art. 

A "patient" for the purposes of the present invention includes both humans and other 
animals, particularly mammals. Thus the methods are applicable to both human therapy and 
5 veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred embodiment the patient is human. 

The administration of the lung cancer proteins and modulators thereof of the present 
invention can be done in a variety of ways, including, but not limited to, orally, 
subcutaneously, intravenously, intranasally, transdermally, intraperitoneally, intramuscularly, 

10 intrapulmonary, vaginally, rectally, or intraocularly. In some instances, e.g., in the treatment 
of wounds and inflammation, the lung cancer proteins and modulators may be directly 
applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a lung cancer 
protein in a form suitable for administration to a patient. In the preferred embodiment, the 

15 pharmaceutical compositions are in a water soluble form, such as being present as 

pharmaceutically acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 

20 sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 

25 such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 

30 ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 
carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 
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lactose, corn and other starches; binding agents; sweeteners and other flavoring agents; 
coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage 
forms depending upon the method of administration. For example, unit dosage forms 
5 suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that lung cancer protein modulators (e.g., antibodies, antisense 
constructs, ribozymes, small organic molecules, etc.) when administered orally, should be 
protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 

10 packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a lung cancer protein 
modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous carrier. 
A variety of aqueous carriers can be used, e.g., buffered saline and the like. These solutions 

15 are sterile and generally free of undesirable matter. These compositions may be sterilized by 
conventional, well known sterilization techniques. The compositions may contain 
pharmaceutically acceptable auxiliary substances as required to approximate physiological 
conditions such as pH adjusting and buffering agents, toxicity adjusting agents and the like, 
e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate 

20 and the like. The concentration of active agent in these formulations can vary widely, and 
will be selected primarily based on fluid volumes, viscosities, body weight and the like in 
accordance with the particular mode of administration selected and the patient's needs (e.g., 
Remington's Pharmaceutical Science (15th ed., 1980) and Hardman, et al. (eds. 1996) 
Goodman and Gilman: The Pharmacologial Basis of Therapeutics") . 

25 Thus, a typical pharmaceutical composition for intravenous administration would be 

about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per patient per 
day may be used, particularly when the drug is administered to a secluded site and not into 
the blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 

30 administrate compositions will be known or apparent to those skilled in the art, e.g., 

Remington's Pharmaceutical Science and Goodman and Gilman, The Pharmacologial Basis 
of Therapeutics , supra. 
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The compositions containing modulators of lung cancer proteins can be administered 
for therapeutic or prophylactic treatments. In therapeutic applications, compositions are 
administered to a patient suffering from a disease (e.g., a cancer) in an amount sufficient to 
cure or at least partially arrest the disease and its complications. An amount adequate to 
5 accomplish this is defined as a "therapeutically effective dose." Amounts effective for this 
use will depend upon the severity of the disease and the general state of the patient's health. 
Single or multiple administrations of the compositions may be administered depending on the 
dosage and frequency as required and tolerated by the patient. In any event, the composition 
should provide a sufficient quantity of the agents of this invention to effectively treat the 

10 patient. An amount of modulator that is capable of preventing or slowing the development of 
cancer in a mammal is referred to as a "prophylactically effective dose." The particular dose 
required for a prophylactic treatment will depend upon the medical condition and history of 
the mammal, the particular cancer being prevented, as well as other factors such as age, 
weight, gender, administration route, efficiency, etc. Such prophylactic treatments may be 

15 used, e.g., in a mammal who has previously had cancer to prevent a recurrence of the cancer, 
or in a mammal who is suspected of having a significant likelihood of developing cancer 
based, at least in part, upon gene expression profiles. Vaccine strategies may be used, in 
either a DNA vaccine form, or protein vaccine. 

It will be appreciated that the present lung cancer protein-modulating compounds can 

20 be administered alone or in combination with additional lung cancer modulating compounds 
or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in the tables, such as antisense or RNAi 
polynucleotides or ribozymes, will be introduced into cells, in vitro or in vivo. The present 

25 invention provides methods, reagents, vectors, and cells useful for expression of lung cancer- 
associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo, or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 

30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and other well known methods for introducing cloned genomic 
DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., 
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Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzvmology 

volume 152 (Berger), Ausubel, et al. (eds. 1999) Current Protocols (supplemented through 

1999), and Sambrook, et al. (1989) Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 

1-3). 

5 In a preferred embodiment, lung cancer proteins and modulators are administered as 

therapeutic agents, and can be formulated as outlined above. Similarly, lung cancer genes 
(including both the full-length sequence, partial sequences, or regulatory sequences of the 
lung cancer coding regions) can be administered in a gene therapy application. These lung 
cancer genes can include antisense or inhibitory applications, e.g., as inhibitory RNA or gene 

10 therapy (e.g., for incorporation into the genome) or as antisense compositions. 

Lung cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (see, e.g.,Vitiello, et al. (1995) J. Clin. Invest. 95:341), 
peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 

15 (see, e.g., Eldridge, et al. (1991) Molec. Immunol. 28:287-294; Alonso, et al. (1994) Vaccine 
12:299-306; Jones, et al. (1995) Vaccine 13:675-681), peptide compositions contained in 
immune stimulating complexes (ISCOMS) (see, e.g., Takahashi, et al. (1990) Nature 
344:873-875; Hu, et al. (1998) Clin Exp Immunol. 113:235-243), multiple antigen peptide 
systems (MAPs) (see, e.g., Tarn (1988) Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413; Tarn 

20 (1996) J. Immunol. Methods 196:17-32), peptides formulated as multivalent peptides; 

peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
vectors (Perkus, et al., p. 379 In: Kaufmann (ed. 1996) Concepts in vaccine development : 
Chakrabarti, et al. (1986) Nature 320:535; Hu, et al. (1986) Nature 320:537; Kieny, et al. 
(1986) AIDS Bio/Technology 4:790; Top, et al. (1971) J. Infect. Pis. 124:148; Chanda, et al. 

25 (1990) Virology 175:535), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) 
J. Immunol. Methods 192:25; Eldridge, et al. (1993) Sem. Hematol. 30:16; Falo, et al. (1995) 
Nature Med. 7:649), adjuvants (Warren, et al. (1986) Annu. Rev. Immunol. 4:369; Gupta, et 
al. (1993) Vaccine 11:293), liposomes (Reddy, et al. (1992) J. Immunol. 148:1585; Rock 
(1996) Immunol. Today 17:131), or, naked or particle absorbed cDNA (Ulmer, et al. (1993) 

30 Science 259:1745; Robinson, et al. (1993) Vaccine 11:957; Shiver, et al., p. 423 In: 

Kaufmann (ed. 1996) Concepts in vaccine development : Cease and Berzofsky (1994) Annu. 
Rev. Immunol. 12:923 and Eldridge, et al. (1993) Sem. Hematol. 30:16). Toxin-targeted 
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delivery technologies, also known as receptor mediated targeting, such as those of Avant 

Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 

designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 

5 mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 

as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 

MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 

Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 

10 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 

tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 

polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 

Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 

used as adjuvants. 

1 5 Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 

encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff, et. al. (1990) Science 247:1465 as well as 
U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; WO 
98/04720; and in more detail below. Examples of DNA-based delivery technologies include 

20 "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, cationic lipid 
complexes, and particle-mediated ("gene gun") or pressure-mediated delivery (see, e.g., U.S. 
Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include 

25 attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, e.g., as a vector to express nucleotide sequences that encode lung cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 

30 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover, et al. (1991) Nature 351 :456-460. A wide variety of other vectors useful 
for therapeutic administration or immunization e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 
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like, will be apparent to those skilled in the art from the description herein (see, e.g., Shata, et 
al. (2000) Mol Med Today 6:66-71: Shedlock, et al. (2000) J. Leukoc. Biol. 68:793-806; 
Hipp, et al. (2000) In Vivo 14:571-85). 

Methods for the use of genes as DNA vaccines are well known, and include placing a 
5 lung cancer gene or portion of a lung cancer gene under the control of a regulatable promoter 
or a tissue-specific promoter for expression in a lung cancer patient. The lung cancer gene 
used for DNA vaccines can encode full-length lung cancer proteins, but more preferably 
encodes portions of the lung cancer proteins including peptides derived from the lung cancer 
protein. In one embodiment, a patient is immunized with a DNA vaccine comprising a 

10 plurality of nucleotide sequences derived from a lung cancer gene. For example, lung cancer- 
associated genes or sequence encoding subfragments of a lung cancer protein are introduced 
into expression vectors and tested for their immunogenicity in the context of Class I MHC 
and an ability to generate cytotoxic T cell responses. This procedure provides for production 
of cytotoxic T cell responses against cells which present antigen, including intracellular 

15 epitopes. 

In a preferred embodiment, DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the lung cancer polypeptide encoded by the DNA vaccine. 
Additional or alternative adjuvants are available. 

20 In another preferred embodiment lung cancer genes find use in generating animal 

models of lung cancer. When the lung cancer gene identified is repressed or diminished in 
metastatic tissue, gene therapy technology, e.g., wherein antisense or inhibitory RNA directed 
to the lung cancer gene will also dimmish or repress expression of the gene. Animal models 
of lung cancer find use in screening for modulators of a lung cancer-associated sequence or 

25 modulators of lung cancer. Similarly, transgenic animal technology including gene knockout 
technology, e.g., as a result of homologous recombination with an appropriate gene targeting 
vector, will result in the absence or increased expression of the lung cancer protein. When 
desired, tissue- specific expression or knockout of the lung cancer protein may be necessary. 
It is also possible that the lung cancer protein is overexpressed in lung cancer. As 

30 such, transgenic animals can be generated that overexpress the lung cancer protein. 

Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 
determined and compared for a determination of the expression level of the transgene. 
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Animals generated by such methods will find use as animal models of lung cancer and are 
additionally useful in screening for modulators to treat lung cancer. 



Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, kits are 
also provided by the invention. In diagnostic and research applications such kits may include 
at least one of the following: assay reagents, buffers, lung cancer-specific nucleic acids or 
antibodies, hybridization probes and/or primers, antisense polynucleotides, ribozymes, RNAi, 
dominant negative lung cancer polypeptides or polynucleotides, small molecule inhibitors of 
lung cancer-associated sequences, etc. A therapeutic product may include sterile saline or 
another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing instructions (e.g., 
protocols) for the practice of the methods of this invention. While the instructional materials 
typically comprise written or printed materials they are not limited to such. A medium 
capable of storing such instructions and communicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such 
media may include addresses to internet sites that provide such instructional materials. 

The present invention also provides for kits for screening for modulators of lung 
cancer-associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a lung 
cancer-associated polypeptide or polynucleotide, reaction tubes, and instructions for testing 
lung cancer-associated activity. Optionally, the kit contains biologically active lung cancer 
protein. A wide variety of kits and components can be prepared according to the present 
invention, depending upon the intended user of the kit and the particular needs of the user. 
Diagnosis would typically involve evaluation of a plurality of genes or products. The genes 
typically will be selected based on correlations with important parameters in disease which 
may be identified in historical or outcome data. 
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Example 1 : Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and 
5 analyzed using gene chips. RNA was isolated and gene chip analysis was performed as 

described (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) Genes Dev. 14:981- 
993). 
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Tables 1A and 1B were previously filed on April 18, 2001 in USSN 60/284,770 (18501-001500US) and on November 29, 2001 in USSN 60/334,370 
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S-phase kinase-associated protein 2 (p45 


1.32 


3.01 


105719 


AA291644 


Hs.36793 


Hypothetical protein FLJ23188 


1.28 


2.31 


105743 


AA293300 


Hs.9598 


ESTs 


1 


1 


106012 ■ 


AA411621 


Hs.8895 


ESTs; same as BFH6? 


0.94 


2.04 


106231 


AA429571 


Hs 38002 


K1AA1355 protein 


1.04 


1.5 


10654Q 


AA454607 


Hs 38114 


Hypothetical protein FLJ1 1 100 


1.26 


2.26 


10R575 


AA45R039 

r\rvt<j uu w w 


Hs 105421 


ESTs 


1 


2 


I U0O04 


AA45Q897 


Hs 11950 

I IO. 1 1 3Jv 


GPl-anchored metastasis-associated prote 


0.87 


1.32 


106727 


AA4R5349 


Hs 34045 


HvDothetical Drotein FLJ20764 


0.87 


1.59 


106906 


AA490937 


Hs 222024 


Transcription factor BMAL2 (cycle-like f 


0.61 


1.6 


1 07059 


AA608545 


Hs.23044 


RAD51 (S. cerevislae) homolog (E coli Re 


0.48 


2.67 


107104 


AA609786 


Hs. 15243 


Nucleolar protein 1 (120kD) 


1.01 


1.44 


107151 


AA621169 


Hs 8687 

J IO1UUU1 


ESTs; procollagen l-N proteinase 


0.97 


2.89 


107284 


074039 


Hs 291904 


Accessory prote/ns BAP31/BAP29 


1.15 


3.65 


107901 


AA02R41R 


Hs 91539 
no.w 1 vjoj 


ESTs 


0.72 


3.44 


107922 


M028028 


Hs.61460 


Ig superfamily receptor LNIR precursor 


1 


2.48 


107932 


AA029317 


Hs.18878 


Hypothetical protein FLJ21620 


1 


1 


108695 


AA121315 


Hs.70823 


KIAA1077 protein 


0.91 


3.53 


108857 


AA1 33250 


Hs.62180 


ESTs 


1 


1 


108860 


AA133334 


Hs.1 29911 


ESTs 


0.73 


7.3 


108990 


M1 52296 


Hs.72045 


ESTs 


1 


1 


109166 


AA179845 


Hs.73625 


"RAB6 interacting, kinesin-like (rabkine 


1 


4.55 


109424 


AA227919 


Hs.85962 


Hyaluronan synthase 3 


1 


1.28 


109665 


F05012 


Hs.27027 


Hypothetical protein DKFZp762H1311 


1.42 


2 


109970 


H09281 


Hs.13234 


ESTs 


1.13 


2.16 
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110015 


H10998 


Hs.7164 


A disintegrin and metalloproteinase doma 


0.84 


1.95 


110156 


H18957 


Hs.4213 


ESTs 


0.94 


1.41 


110561 


H59617 


Hs.5199 


HSPC150 protein similar to ubiquitin-con 


0.91 


3.18 


111223 


N68921 


Hs.34806 


ESTs; Weakly similar to neogenin [H.sapi 


0.91 


3.13 


111345 


N89820 


Hs. 14559 


Hypothetical protein FLJ 10540 


1 


1.25 


111876 


R38239 


Hs.293246 


"ESTs, Weakly similar to putative p150 [ 


0.83 


1.27 


111902 


R39191 


Hs.1 09445 


KIAA1 020 protein 


0.91 


0.91 


112244 


R51309 


Hs.70823 


KIAA1077 protein 


0.77 


3.01 


112973 


T17271 




"cDNA FLJ 13308 fis, clone OVARC1001436, 


1 


1 


112989 


T23482 


Hs.89981 


"Diacylglycerol kinase, zeta (104kD)" 


0.55 


1.03 


113047 


T25867 


Hs.7549 


ESTs 


0.87 


2 


113095 


T40920 


Hs. 126733 


ESTs 


1 


1 


113531 


T90345 


Hs.16740 


Hypothetical protein FU11036 


0.42 


1.44 


113970 


W86748 


Hs.8109 


ESTs 


1.17 


1.73 


114346 


Z41450 


Hs. 130489 


"ATPase, aminophospholipid transporter-l 


0.86 


0.82 


114407 


AA010188 


Hs. 103305 


ESTs 


0.8 


1.88 


114471 


AA028074 


Hs.1 04613 


RP42 homolog 


1.06 


1.34 


114509 


AA043551 


Hs.101799 


KIAA1350 protein 


1.82 


2.32 


115060 


AA253214 


Hs.1 98249 


"Gap junction protein, beta 5 (connexin 


0.79 


1.49 


115091 


AA255900 


Hs.1 84523 


KIAA0965 protein 


0.72 


1.92 


115123 


AA256642 


Hs.236894 


"ESTs, High sim to LRP1 hu low density I 


0.59 


1.97 


115291 


AA279943 


Hs.1 22579 


ESTs 


1 


1.25 


115506 


AA292537 


Hs.45207 


Hypothetical protein KIAA1335 


1.15 


1.48 


115522 


AA331393 


Hs.47378 


ESTs 


0.5 


3.29 


115536 


AA347193 


Hs.62180 


ESTs 


1 


1 


115697 


AA411502 


Hs.63325 


Homo sapiens type II membrane serine pro 


1 


6.53 


115909 


AA436666 


Hs.59761 


ESTs 


1 


6.98 


115978 


AA447522 


Hs.69517 


Differentially expressed in Fanconi anem 


1 


2.31 


116028 


AA452112 


Hs.42644 


thioredoxin-like 


0.99 


1.68 


116107 


AA456968 


Hs.92030 


ESTs 


1.14 


1.8 


116134 


AA460246 


Hs.50441 


CGI-04 protein 


1.11 


1.86 


116157 


M461063 


Hs.44298 


Hypothetical protein 


0.99 


1.9 


116158 


AA461187 


Hs.61762 


Hypoxia-inducible protein 2 


0.44 


0.86 


116335 


AA495830 


Hs.87013 


"Homo sapiens cDNA FLJ 10238 fis, clone H 


0.62 


3.89 


116483 


C14092 


Hs.76118 


Ubiquitin carboxyl-termina! esterase L1 


1.04 


2.36 


117320 


N23239 


Hs.211092 


LUNX protein; PLUNC(palate lung & nasal 


0.51 


0.64 


117557 


N33920 


Hs.44532 


Diubiquitin 


1.11 


2.63 


117693 


N40939 


Hs.112110 


PTD007 protein 


0.98 


1.79 


117881 


N50073 


Hs.260622 


Butyrate-induced transcript 1 


1 


1.43 


118368 


N64339 


Hs.48956 


ESTs 


0.67 


2.86 


118566 


N68558 


Hs.42824 


Hypothetical protein FLJ10718 


1.21 


0.83 


118695 


N71781 


Hs.50081 


KIAA1199 seeCVA7.doc 


0.88 


1.63 


119780 


W72967 


Hs.191381 


ESTs; Weakly similar to hypothetical pro 


1 


1 


119845 


W79920 


Hs.58561 


G protein-coupled receptor 87 


1 


1 


120102 


W95428 


Hs.1 32927 


"ESTs, Moderately similar to p53 reguiat 


1 


1 


120104 


W95477 


Hs.1 80479 


ESTs 


0.69 


3.07 


120486 


AA253400 


Hs.1 37569 


Tumor protein 63 kDa with strong homolog 


1.08 


12.05 


120859 


AA350158 


Hs.1 619 


Achaete-scute complex (Drosophila) homol 


1 


1 


120880 


AA360240 


Hs.97019 


EST 


1 


1 


120948 


AA397822 


Hs.1 04650 


Hypothetical protein FLJ10292 


1.04 


2.15 


120983 


AA398209 


Hs.97587 


EST 


1 


1 


121362 


AA405500 


Hs.97932 


Chondromodulin I precursor 


1 


1 


121369 


AA405657 


Hs.1 28791 


CGI-09 protein 


1 


1.8 


121791 


AA423978 


Hs.293317 


"ESTs, Weakly similar to JM27 [H.sapiens 


1 


1 


123005 


AA479726 


Hs.1 05577 


ESTs 


1 


1 


123044 


AA481549 


Hs.1 30881 


B-cell CLUIymphoma 1 1 A (zinc finger pro 


0.95 


1.88 


123160 


AA488687 


Hs.284235 


ESTs 


1.59 


4.98 


123479 


AA599469 


Hs.1 35056 


clone RP5-850E9 on chromosome 20 


1.19 


1.64 


123571 


AA608956 


Hs.1 12619 


"ESTs, Weakly similar to PQ0109 Purkinje 


1.03 


1.14 


123829 


AA620697 


Hs.1 12208 


XAGE-1 protein 


1.39 


2.2 


124006 


D60302 


Hs.1 08977 


ESTs 


1 


4.85 


124059 


F13673 


Hs.99769 


ESTs 


1.49 


8.62 


124960 


T15386 


Hs.1 94766 


Seizure related gene 6 (mouse)-like 


0.76 


0.77 


125218 


W73561 


Hs.1 10024 


NADH:ubiquinone oxidoreductase MLRQ subu 


1.33 


1.77 


125453 


R06041 


Hs.1 8048 


"Melanoma antigen, family A, 10" 


0.8 


1.42 


125759 


AA425587 


Hs.82226 


Glycoprotein (transmembrane) nmb 
"ESTs, Highly similar to unnamed protein 


1.52 


2.26 


125972 


AA434562 


Hs.35406 


1.05 


2.48 


125994 


H55782 


Hs. 270799 


EST 


1 


1.95 


126395 


N70.192 


Hs.278956 


Hypothetical protein FLJ 1 2929 


1 


1.35 


126645 


AH 67942 


Hs.61635 


STEAP1 (Homo sapiens BAG clone RG041 D1 1 


1 


2.23 


127221 


AI354332 


Hs.72365 


ESTs 


0.73 


3.27 


127479 


M513722 


Hs.1 79729 


collagen; type X; alpha 1 (Schmid metaph 


0.51 


1.94 


128192 


AI204246 




KIAA1 085 protein 


1.8 


3.16 


128610 


L38608 


Hs.1 0247 


activated leucocyte cell adhesion molecu 


0.89 


0.97 


128777 


U46006 


Hs.1 0526 


Cysteine and glycine-rich protein 2 


1 


1 


128924 


AA234962 


Hs.26557 


Plakophilin 3 


1.3 


2.97 


129041 


H58873 


Hs 169902 

MO. 1 UdvUiu 


"Solute carrier family 2 (facilitated g! 


0.84 


2.04 


129099 


H50398 


Hs.1 08660 


"ATP-binding cassette, sub-family C (CFT 


0.87 


1.04 


129404 


M172056 


Hs.1 11 128 


ESTs 


1 


1 


129466 


L42583 




"Genbank Homo sapiens keratin 6 isoform 


0.72 


12.67 


129605 


S72493 


Hs.1 15947 


Keratin 16 (focal non-epidermolytic palm 


0.92 


1.5 


129628 


U26727 


Hs.1 174 


"Cyclin-dependent kinase inhibitor 2A (m 


0.85 


1.93 


130023 


X13461 


Hs.239600 


Calmodulin-like 3 


0.84 


1.22 


130080 


X14850 


Hs.147097 


"H2A histone family, member X" 


0.98 


1.96 


130385 


AA126474 


Hs.155223 


stanniocalcin 2 


1 


1 
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130410 


V01514 


Hs. 155421 


Alpha-fetoprotein 


0.63 


0.63 


130441 


U35835 


Hs.301387 


"Human DNA-PK mRNA, partial cds" 


1.15 


3.65 


130482 


L32866 


Hs.1 578 


Baculoviral IAP repeat-containing 5 (sur 


1 


1.88 


130553 


AA430032 


Hs.252587 


Pituitary tumor-transforming 1 


0.92 


1.96 


130577 


M35410 


Hs.162 


Insulin-like growth factor binding prote 


1.17 


4.7 


130627 


L23808 


Hs.1695 


Matrix metalloproteinase 12 (macrophage 


0.69 


4.05 


130800 


AA223386 


Hs. 19574 


ESTs; Weakly similar to katanin p80 subu 


1.13 


2.41 


130939 


AA598689 


Hs.21400 


ESTs 


0.8 


0.89 


131046 


X02530 


Hs.2248 


INTERFERON-GAMMA INDUCED PROTEIN PRECURS 0.8 


1.15 


131244 


D38076 


Hs.24763 


RAN hindinn nrotpin 1 


1.13 


1.85 


131877 


J04088 


Hs. 156346 


Tonntenmprasp fDNA^ II alnha M7flkrfl 


1 


1 


131927 


AA461549 


Hs.34780 


"nnnhlprnrtpy li^pnrpnhali/ X-linkpri ( 

WUUUIGwVJI ICA, Hood lUCf-M latj , /\ 1IJII\GU \ 


0.81 


0.62 


131965 


W90146 


Hs.35962 


ESTs 


0.74 


3.27 


131978 


D80008 


Hs.36232 


KIAA0186 nenp nrodurt 


1 


1 


132354 


L05187 


Hs.211913 


Small proline-rich protein 1A 


0.69 


1.43 


132543 


AA417152 


Hs.5101 


ESTsj Highly similar to protein regulati 


0.79 


4.27 


132632 


N59764 


Hs.5398 


guanine-monophosphate synthetase 


1* 


1.08 


132653 


U31201 


Hs.54451 


"laminin gamma2 chain gene (LAMC2), exon 


1 


1 


132659 


Z75190 


Hs.54481 


"I nw ripnsifv linnnrotein rpppntor-rplatp 

1— UVV UWIICilJf llJJVpi Ulwlll ICuwpiUI Ivlulv 


0.89 


0.89 


132710 


W93726 


Hs.55279 


"Serine (or cysteine) proteinase inhibit 


0.64 


4.41 


132758 


W52432 


Hs.56105 


"ESTs Weaklv similar to WDNM RAT WDNM1 

l— w IO| » V C ai\l J Oil HI Jul VU UMIVI 1 \n 1 liul ^IVI 1 


1.55 


2.08 


132767 


L05188 


Hs.231622 


Small proline-rich protein 2B 


o!83 


1.66 


132816 


M74542 


Hs.575 


Aldehyde dehydrogenase 3 


0.55 


0.55 


132990 


AA458761 


Hs. 18387 


transcription factor AP-2 alpha (activat 


1 


3.53 


133070 


U69611 


Hs.64311 


"A disintegrin and metalloproteinase dom 


1.16 


2 


133282 


U52960 


Hs.286145 


"SRB7 {suppressor of RNA polymerase B, y 


1 


2.7 


133317 


AA215299 


Hs.70830 


U6 snRNA-associated Sm-like protein LSm7 


0.95 


1.42 


133370 


AA1 56897 


Hs.72157 


Homo sapiens mRNA; cDNA DKFZp564H922 


1.12 


2.55 


133391 


X57579 


Hs.727 


H.sapiens activin beta-A subu nit (exon 2 


1.65 


1.76 


133832 


H03387 


Hs.241305 


pslrnnpn-rpsnnnsit/p. R box nroteln /EBBP) 


1.02 


1.39 


134032 


Z81326 


Hs.78589 


"Sprinp fnr rv'itpinp^ nrotpinasp. inhibit 


1 


1 


134168 


AA398908 


Hs.181634 


"Homo sapiens cDNA: FLJ23602 fis, clone 


0.95 


1.53 


134218 


AA227480 


Hs.80205 


pim. 9 nnrnnpnp 


1.36 


2.48 


134405 


R67275 


Hs.82772 


"""collaaen tvoe XI aloha 1""" 

ovJiiaywii, typw /a 1 1 alalia i 


0.76 


2.86 


134453 


X70683 


Hs.83484 


SRY f^py Hpfprmininn rpninn Y^-hnv A 

ur> I IOwA UClwl 1 1 III III iy IwyiUII 1 j UUA *t 


1.89 


3.78 


134470 


X54942 


Hs.83758 


CPiC.OR nrnteln kinase* 9 

piUlCIIJ rUllCJOC £ 


1.82 


4.11 


134645 


U87459 


Hs.1 67379 


T.anrpr/lp«5tk antinpn fNY-FSO-1 HTAR1 


0.82 


0.83 


134781 


M17183 


Hs.89626 


ParafhvrnlH hnrmnnp-.lfkp hnrmonp 
i aictuiyiuiu iiuiiihjiio~iii\c iiuiiuuiic 


{ 


1 


135002 


U19147 


Hs.272484 


antinpn fi 
diiuyen o 


1 


1 


100040 


M97935 




AFPX rnnfrnl* STAT1 

r\rrA UUIIUU1. O 1 r\ 1 1 


0.92 


1.25 


101201 


L22524 


Hs.2256 


mnlriv mplallAnrntpinasp 7 fmatrilvQirr 
ll lall ia iiioialiupi ULCiiiaoC i ^iiiaiiiiyoiii, 


2.92 


8.5 


101664 


M60752 


Hs.121017 


W9A hicfnnp ■famflu' momfior A 
Y\eJ\ llloLUIlfcJ Idlllliy, MlclUUcI A 


1 


1 


102025 


U0391 1 


Hs.78934 


miitQ /P rrAW hrnYmlnn 9 frnlrin mnrpr 1 
iiiuio \\-, (sUiij iiutiiuiuy /. \^aj\\j\\ oaiiuci , 


0.8 


1.61 


102031 


U04898 


Hs.2156 


rvr\r\ 1 Clalcu Ul pi Id II IGOBpiUI r\ 


\ 


1 


102221 


U24576 




1 IM domain nnlv A 

1—1 IV 1 UUIIlQlll \JlHj ' 


1 


1 


102270 


U30255 


Hs. 75888 


yii luopi luy iuuui laic uoi lyuiuyui iujv, 


1.08 


1.43 


102339 


U37022 


Hs.95577 




0.88 


1.32 


102391 


U41668 


Hs.77494 


UCUArU UCII iUOll lu rvll ICiaC 


1.07 


1.58 


103000 


X51956 


Hs. 146580 


pnnlacp 9* /namma* npnmnal\ 
eiiuiaoc £| yyaiiniia, ueuiunaiy 


0.91 


1.49 


103395 


X94754 


Hs.1 19503 


methionine-tRNA synthetase 


0.89 


1.32 


105638 


AA281599 


Hs.20418 


Homo sapiens mRNA for for histone H2B| c 


0.91 


1.25 


105726 


AA292328 


Hs.9754 


aulivcaiiiiy uciiiOuiipuuu lawiui <J 


0.94 


1.48 


114841 


AA234722 


Hs.55408 


FST«?- MnHpratelv similar to CALCIUM-DEPE 


0.78 


1.56 


115206 


AA262491 


Hs.1 86572 


ESTs 


1 


1 


115906 


AA436616 


Hs.82302 


ESTs 


0.74 


2.52 


119132 


R49046 


Hs.1 07911 


ATP-binding cassette; sub-family B {MDR/ 


1.1 


1.51 


124163 


H30539 


Hs.1 89838 


ESTs 


1 


1 


126487 


M482505 


Hs.1 84601 


<?nliifp oarripr familv 7 /rationio amino 

wUIUlw vQJ 1 Iwl IQlltllj t IvGIUUI llw Ol J III IV/ 


1.01 


1.46 


127141 


M307960 


Hs.75478 


KIAA0956 nrotein 


0.85 


1.4 


128034 


AA905754 


Hs.75103 


tv/m^inp .^-mnnnnYvnpnQ^p/trvntnnhan fi-mo 


1 


1.18 


128609 


AA234365 


Hs.1 02456 


survival of motor neuron protein interac 


1 


1.5 


128895 


R37753 


Hs.1 06985 


ESTs 


1.7 


2 


130199 


Z48579 


Hs.1 72028 


a disintegrin and metal loprotease domain 


1 


1 


130524 


U89995 


Hs.1 59234 


fnrkhpari hnx F1 




1 


133000 


U24152 


Hs.62402 


n91/f!Hrd9/Rar1-aotivatpH kinase 1 fveast 




1 


133658 


M25756 


Hs.75426 


eprrplnnranin II ^phromnnranin C\ 


1 


1 


135047 


AA460466 


Hs.93597 


ESTs 


\ 


1 


100053 


M27830 




APPy rnnlml' 9RS rihn«?nmal RNA 
r\r rA oui in ui. £Qw iiuuouiticii i\iir\ 


0.88 


1.53 


100114 


D00596 


Hs.82962 


thumiriulatp i\/n{hpta<ip 
uiyiiiiuyiaic oy i in iciaoc 


0.68 


1.86 


100128 


D11094 


Hs.61153 


proteasome (prosome; macropain) 26S subu 


{29 


2.03 


100154 


D14657 


Hs.81892 


KIAA0101 gene product 


0.71 


4.26 


100161 


D14694 


Hs.77329 


phosphatidylserine synthase 1 


1.02 


1.56 


100168 


D14874 


Hs.394 


adrenomedultin 


0.46 


1.17 


100187 


D17793 


Hs.78183 


aldo-keto reductase family 1; member C3 


1 


1 


100188 


D21063 


Hs.57101 


minichromosome maintenance deficient (S. 


0.97 


1.4 


100217 


D26600 


Hs.89545 


proteasome {prosome; macropain) subunit; 


1.13 


1.9 


100220 


D28364 




"""Human mRNA for annexin II, 5'UTR (seq 


1.11 


1.53 


100287 


D43950 


Hs.1 600 


chaperonin containing TCP1; subunit 5 (e 


1.13 


2.09 


100297 


D49489 


Hs!l82429 


protein disulfide tsomerase-related prat 


0.92 


1.78 


100330 


D55716 


Hs.77152 


minichromosome maintenance deficient (S. 


1.07 


1.61 


100355 


D78129 




"""Homo sapiens mRNA for squalene epoxid 


0.96 


1.87 


100364 


D78586 


Hs.1 54868 


carbamoyl-phosphate synthetase 2; aspart 


1.49 


2.46 


100368 


D79987 


Hs.153479 


extra spmdle poles; S. cerevisiae; homo 


0.59 


1.32 


100398 


D84557 


Hs.1 55462 


minichromosome maintenance deficient (mi 


1.08 


1.9 


100438 


D87448 


Hs.91417 


topoisomerase (DNA) II binding protein 


1 


2.15 
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100455 


D87953 


Hs.75789 


N-mvr Hnuun^trpflm rpnulatpri 

1 1 1 1 Ijf U UU W lluU Cwl 1 1 IG^UlulvU 


0.91 


1.48 


100491 


HG1153-HT1153 


NucleosidG Diphosphate Kinase Nm23-H2s 


0.99 


1.41 


mo5ift 

1 UUO 1 o 


HG174-HT174 




npc.mnnlflVin 1 
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Table 1B shows the accession numbers for those pkeys in Table 1A Ixking unigenelD's. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on' sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
Accession column. 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Accessions 

100661 23182J BE623001 L05096 M383604 AW966416 N53295 AA460213 AW571519 AA603655 

100667 26401.3 L05424 X56794 S66400 X55150 W60071 AW351820 X55938 M83326 BE005289 BE070059 M83324 BE005248 BE069717 BE181648 BE069700 

AW606203 BE069721 AW382138 AW803776 BE463954 BE005334 8E005274 T27386 AA932714 AA972695 AW377728 AI632506 T29066 
AI783934 AW377727 BE163715 AL047291 AA279047 M523003 BE008048 BE440141 W23614 BE090519 BE092193 N29181 N20358 N44153 
BE546944 T69231 AW377441 AA907406 H50799 AW051416 AI420712 BE620922 AI279161 AA992549 W47198 BE005241 AI342696 H50700 
AI969974 A1863855 AA374490 AW1 30675 Al 950633 AA146687 H99482 X55150 BE005414 BE005339 N28294 AI673068 AI887890 AW804171 
AI675961 AW804172 AA778841 AL048050 A1127757 AI095568 AW204965 AW46S978 W31898 AI052595 AI278771 BE464018 AI081503 AI824196 
AA513211 AA411062AW084376 N48752 AA703209 N35580AW059918 AA054563 A1280942 T27619 BE621435 N66010 AW589527 AI160414 
AA283090 M962536 H82726 W521 1 5 W45432 W60433 AA577548 AA146714 BE150994 M054615 AW796025 AW382768 BE565671 C00444 
AA054555 

100668 26401_3 L05424X56794 S66400 X55150 W60071 AW351820 X55938 M83326 BE005289 BE070059 M83324 BE005248 BE069717 BE181648 BE069700 

AW606203 BE069721 AW3821 38 AW803776 BE463954 BE005334 BE005274 T27386 M932714 AA972695 AW377728 A1632506 T29066 
A1783934 AW377727 BE163715 AL047291 AA279047 M523003 BE008048 BE440141 W23614 BE090519 BE092193 N29181 N20358 N44153 
BE546944T69231 AW377441 AA907406 H50799 AW051416 AI420712 BE620922 AI279161 AA992549 W47198 BE005241 AI342696 H50700 
AI969974 AI863855 AA374490 AW130675 A1950633 AA146687 H99482 X55150 BE005414 BE005339 N28294 A1673068 AI887890 AW804171 
AI675961 AW804172AA778841 AL048050A1 127757 AI095568 AW204965 AW468978 W31898 AI052595 AI278771 BE464018 AI081503 AI824196 
AA513211 AA411062 AW084376 N48752 AA703209 N35580 AW059918 AA054563 AI280942 T27619 BE621435 N66010 AW589527 A1160414 
AA283090 AA962536 H82726 W521 15 W45432 W60433 AA577548 AA14671 4 BE1 50994 M054615 AW796025 AW382768 BE565671 C00444 
AA054555 

101332 25130.1 J04088 NM_001067AF071747 AJ011741 N85424 AL042407 M218572 BE296748 BE083981 AL040877 AW499918 AW675045 H17813 
BE081283 M670403 AW504327 BE094229 AA1 04024 AI471482 AI970337 AA73761 6 AI827444 AW003286 A1742333 AI344044 AI765634 
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Table 2A shows 504 genes down-regulated in lung tumors relative to normal lung and chronically diseased lung. Chronically diseased lung samples represent chronic non- 
malignant lung diseases such as fibrosis, emphysema, and bronchitis. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene 
expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA expression. 
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90th percentile of Al for normal lung samples divided by the 80th percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor 
samples. 

median of Al for norma! lung samples divided by 90th percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor samples, 
median of Al for norma! lung samples minus the 15th percentile of Al for all normal lung, chronically diseased lung and tumor samples divided by 
the 90th percentile of Al for adenocarcinoma and squamous cell carcinoma lung tumor samples minus the 15th percentile of Al for all normal 
lung, chronically diseased lung and tumor samples. 

average of Al for normal lung samples divided by average Al for squamous cell carcinoma and adenocarcinoma lung tumors, 
median of Al for normal lung samples divided by the 90th percentile of Al for adenocarcinomas. 

median of Al for normal lung samples minus the 15th percentile of Al for all normal lung, chronically diseased lung and tumor samples divided by the 90th 
percentile of Al for adenocarcinomas minus the 15th percentile of Al for all normal lung, chronically diseased lung and tumor samples, 
average of Al for normal lung samples divided by the 90th percentile of Al for squamous cell carcinomas. 
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percentile of Al for squamous cell carcinomas minus the 15th percentile of Al for all norma) lung, chronically diseased lung and tumor samples. 
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3.58 



2.65 



17.00 



2.63 



2.35 

3.07 
2.48 

2.50 



59.20 



29.00 
32.20 



35.80 

39.60 
28.80 



8.00 



12.00 



9.00 



7.33 



10.00 



2.36 



32.20 
41.80 



82.80 

32.40 
70.60 

29.40 

30.80 

37.60 

34.20 

33.60 
29.80 

34.40 



29.40 
37.20 

38.40 



34.40 



41.60 
87.00 



19.00 



3.60 
3.14 



3.37 



3.95 



2.58 



13.00 



4.74 



3.14 



6.03 



10.00 



3.12 



2.79 



4.42 
2.58 

4.31 



2.66 



9.00 



11.00 



3.10 



3.17 



3.21 



4.02 
3.76 

3.68 
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129210 
129240 
129262 
129301 
129331 
129381 
129565 
129595 
129613 
129782 
129950 
129958 
129959 
130160 
130259 
130273 
130312 
130436 
130523 
130799 
130885 
131002 
131012 
131031 
131061 
131066 
131082 
131087 
131161 
131179 
131182 
131205 
131277 
131281 
131282 
131285 
131355 
131391 
131461 
131487 
131517 
131545 
131583 
131647 
131675 
131676 
131708 
131717 
131756 
131762 
131821 
131839 
131861 
132015 
132070 
132242 
132334 
132476 
132490 
132533 
132598 
132619 
132652 
132726 
133028 
133071 
133120 
133129 
133147 
133151 
133213 
133276 
133377 
133407 
133535 
133537 
133656 
133689 
133779 
133978 
133985 
134000 
134111 
134185 
134204 



AL039940 

M361258 

BE222198 

AF182277 

AW167668 

AW245805 

X77777 

U09550 

AW978517 

AW016932 

F07783 

R27496 

AL036554 

AA305688 

NM_000328 

AW972422 

AF056195 

NM 001928 

AA999702 

AB028945 

NMJ)05883 

AL050295 

AL039940 

NMJJ01650 

N64328 

AW169287 

AI091121 

AF147709 

AF033382 

AA171388 

AI824144 

NMJJ03102 

AA131466 

AA251716 

X03350 

AI567943 

R52804 

AW085781 

AA992841 

F13036 

AB037789 

AL1 37432 

AK000383 

AA359615 

H15205 

A1126821 

S60415 

X94630 

AA443966 

AA744902 

AA017247 

AB014533 

AL096858 

AI418006 

BE622641 

AA332697 

AW080704 

AL1 19844 

NM.001290 

AI922988 

X80031 

H28855 

N41739 

N52298 

R51604 

BE384932 

NM_003278 

AA428580 

AA026533 

NM_014051 

AA903424 

AW978439 

AJ131245 

AF017987 

AL1 34030 

U41518 

BE149455 

NM_001872 

T58486 

AF035718 

L34657 

AW175787 

AI372588 

AA285136 

AI873257 



Hs.202949 

Hs.237868 

Hs.109843 

Hs.330780 

Hs.279772 

Hs.110903 

Hs.1 98726 

Hs.1154 

Hs.172847 

Hs.1 04105 

Hs.1369 

Hs.1 378 

Hs.274463 

Hs.267695 

Hs.153614 

Hs.153863 

Hs.1 5430 

Hs.1 55597 

Hs.214507 

Hs.12696 

Hs.20912 

Hs.22039 

Hs.202949 

Hs.288650 

Hs.268744 

Hs.22588 

Hs.246218 

Hs.22824 

Hs.23735 

Hs.1 84482 

Hs.23912 

Hs.2420 

Hs.23767 

Hs.25227 

Hs.4 

Hs.25274 

Hs.25956 

Hs.26270 

Hs.27263 

Hs.27373 

Hs.263395 

Hs.28564 

Hs.323092 

Hs.30089 

Hs.30509 

Hs.30514 

Hs.30941 

Hs.3107 

Hs.31595 

Hs.107767 

Hs.164577 

Hs.33010 

Hs.184245 

Hs.3731 

Hs.38489 

Hs.42721 

Hs.45033 

Hs.49476 

Hs.4980 

Hs.172510 

Hs.530 

Hs.53447 

Hs.61260 

Hs.55608 

Hs.300842 

Hs.64313 

Hs.65424 

Hs.65551 

Hs.66 

Hs.94896 

Hs.6786 

Hs.69504 

Hs.7239 

Hs.7306 

Hs.284180 

Hs.74602 

Hs.75415 

Hs.75572 

Hs.222566 

Hs.78061 

Hs.78146 

Hs.334841 

Hs.8022 

Hs,301914 

Hs.7994 



KJAA1102 protein 

interleukin 7 receptor 2.29 
ESTs 

Human cytochrome P450-IIB {h)!B3) mRNA; 
ESTs; Highly similar to CGI-38 protein [ 

claudin 5 (transmembrane protein deleted 2.93 
vasoactive intestinal peptide receptor 1 1 60. 80 

oviductal glycoprotein 1 ; 1 20kD 1 0.00 

ESTs; Weakly similar to collagen alpha 1 

EST 9.00 

decay accelerating factor for complement 87.80 

annexin A3 44.60 

defensin; alpha 1 ; myeloid-related seque 2.72 

UDP-GakbetaGlcNAc beta 1 ;3-galactosy)tr 42.20 

retinitis pigmentosa GTPase regulator 2.54 

MAD {mothers against decapentaplegic; Dr 51 .60 

DKFZP586G1 21 9 protein 

D component of complement (adipsin) 

ESTs 

ESTs 6.00 
adenomatous polyposis colt like 
KIAA0758 protein 

KIAA1102 protein 20.00 
aquaporin4 41.20 

ESTs; Moderately similar to KIAA0273 [H. 31 .40 

ESTs 29.60 

ESTs; Weakly similar to zinc finger prot 9.00 

ESTs; Weakly similar to p160 myb-binding 

potassium voltage-gated channel; subfami 

DKFZP586D0624 protein 

ESTs 

superoxide dismutase 3; extracellular 2.98 

ESTs 3.15 

ESTs 32.20 

alcohol dehydrogenase 3 {class )); gamma 

ESTs; Moderately similar to putative sev 

DKFZP564D206 protein 8.00 

ESTs 10.00 

butyrate response factor 2 (EGF-response 28.80 

Homo sapiens mRNA; cDNA DKFZp56401763(f 

ESTs; Highly similar to semaphorin Via [ 39.00 

ESTs 11.00 
ESTs; Weakly similar to dual specificity . 10.00 

ESTs 2.47 
ESTs 

ESTs 45.80 
calcium channel; voltage-dependent; beta 2.28 
CD97 antigen 
ESTs 

ESTs; Moderately similar to CaM-KII inhi 

ESTs 2.87 
KIAA0633 protein 

KIAA0929 protein Msx2 interacting nuclea 54.00 
ESTs 
ESTs 

ESTs 2.68 
lacrimal proline rich protein 4.66 
Homo sapiens clone TUA8 Cri-du-chat regi 34.20 
LIM binding domain 2 2.66 
ESTs 13.00 
collagen; type IV; alpha 3 (Goodpasture 
ESTs; Moderately similar to kinesin ligh 
ESTs 

ESTs; Weakly similar to cDNA EST yk484g1 1 1 .43 

ESTs 2.37 

ESTs 2.27 

tetranectin (plasminogen-binding protein 2.63 

ESTs 

interleukin 1 receptor-like 1 6.20 
ESTs 

ESTs 31.40 
ESTs 9.00 
SEC24 (S. cerevisiae) related gene famil 41 .20 

secreted frizzled-related protein 1 50.20 
protocadherin 2 (cadherin-like 2) 
aquaporin 1 (channel-forming integral pr 

Accession not ilsted in Genbank 2.65 
carboxypeptidase B2 (plasma) 90.80 
ESTs 

transcription factor 21 2.92 
platelet/endothelial cell adhesion molec 
selenium binding protein 1 

TU3A protein 4.49 
Homo sapiens mRNA; cDNA DKFZp586K1 220 (f 

ESTs; Weakly similar to CGl-69 protein [ 40.80 



PCT/US02/12476 

4.09 



3.30 



3.40 



4.05 
4.09 



3.16 
4.77 
3.54 



3.14 
3.80 



6.40 



4.11 
3.50 

3.86 
3.67 

3.44 



4.03 



3.06 



40.60 



49.20 
34.80 



30.60 



3.78 
3.67 



3.48 



4.02 
3.18 



5.49 



3.72 



3.05 



3.35 



3.45 
4.05 



3.27 
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134641 
134677 
134745 
134749 
134786 
134825 
134978 
135010 
135053 
135081 
135091 
135135 
135203 
135236 
135266 
135346 
135378 
135387 
135388 
135402 



AI092634 

M251363 

NM_000685 

T28499 

T29618 

U33749 

AI829008 

N50465 

AW796190 

AF069517 

AA493650 

AA775910 

C15737 

AI636208 

R41179 

NM.000928 

AW961818 

NM.001972 

W27965 

L12398 



Hs.156114 

Hs.177711 

Hs.89472 

Hs.89485 

Hs.89640 

Hs.1 97764 

Hs.333383 

Hs.92927 

Hs.93678 

Hs.173993 

Hs.94367 

Hs.95011 

Hs.269386 

Hs.96901 

Hs.97393 

Hs.992 

Hs.24379 

Hs.99863 

Hs.99865 

Hs.99922 



protein tyrosine phosphatase; non-recept 
ESTs 

angiotensin receptor 1B 

carbonic anhydrase IV 

angiopoietin 1 receptor; TEK tyrosine ki 

thyroid transcription factor 1 

ficolin (collagen/fibrinogen domain-cont 

ESTs 

ESTs 

RNA binding motif protein 6 
ESTs 

syntrophin; beta 1 (dystrophin-associate 

ESTs 

ESTs 

Human mRNA for KIAA0328 gene; partial cd 
phospholfpase A2; group IB (pancreas) 
potassium voltage-gated channel; shaker- 
elastase 2; neutrophil 
EST 

dopamine receptor D4 



15.00 



3.05 
2.52 



32.20 
57.80 
31.60 
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3.76 



3.73 



28.80 



43.00 



37.20 
38.80 



8.00 



3.82 
4.15 



3.21 



4.31 



4.21 



4.24 



6.42 



TABLE 2B shows the accession numbers for those primekeys lacking unlgenelD's for Table 2A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
108447 



108655 
102397 
126303 
125810 
103627 
121366 
114609 
115272 
108338 
108434 
123802 
102310 
102636 
104776 
120504 
113502 
108499 
101308 
108629 
103098 
103241 
103508 
103575 
119514 
121082 
128634 
105817 
121518 
114449 
114648 
121950 
107723 



CAT number Accessions 

43452.-7 AA079126 

1 20073.1 M084867 AA084996 

127522.1 AA099960 M113013 

44371.-1 U41898 

1525933.1 D78841 D78880 

1554054.1 H00083R81062 

2615.2 Z48513Z48512 

280401.1 AI74351 5 AA40561 7 AW276706 

1 1 6777.1 AA079505 AA079537 

1 721 1 3.1 AW015947 AA21 1890 M279425 

1 1 21 86.1 AA070773 AA070774 

1 1401 2 J M078899 AA078782 AA075788 

genbank w _AA620448 AA620448 

NOT_FOUND_entrez_U33839 U33839 

entrez„U67092 U67092 

genbanl^AA026349 AAQ26349 

genbank AA256837 M256837 

genbank T89130T89130 

genbank_AA083103 AA083103 

entrez.141390 L41390 

genbank_AA102425 AA102425 

221.215 M86361 Z26593 X02850 D13070 AE000659 M17649 M87869 M87871 X61077 M16286 AF018169 X61079 S59351 X60142 AF043169 

entrez_X76223 X76223 

entrez_Y10141 Y10141 

entrez_Z26256 Z26256 

NOT.FOUND_entrez_W37937 W37937 

genbank_AA398722 AA398722 

AA464918_at M464918 

genbank_AA397825 AA397825 

genbank.AA412155 AA412155 

genbank.AA020736 AA020736 

genbank.AA1 01 056 AA1 01 056 

genbanleAA429515 M429515 

genbanleAAOl 5967 M01 5967 
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Table 3A shows 452 genes up-regulated In chronically diseased lung relative to normal lung. Chronically diseased lung samples represent chronic non-malignant lung diseases 
such as fibrosis, emphysema, and bronchitis. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for each 
probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA expression. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: 80th percentile of Al for chronically diseased lung samples divided by the 90th percentile of Al for normal lung samples. 

R2: 80th percentile of Al for chronically diseased lung samples divided by the 90th percentile of normal lung samples, squamous cell carcinomas and 
adenocarcinomas 

R3: 70th percentile of Al for chronically diseased lung samples minus the 15th percentile of Al for all normal lung, chronically diseased lung and tumor samples 



divided by the 90th percentile of normal lung samples, squamous cell carcinomas and adenocarcinomas minus the 15th percentile of Al for all normal lung, 
chronically diseased lung and tumor samples 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


R3 


135423 


U50531 


Hs. 138751 


Human BRCA2 region, mRNA sequence CG030 


12.40 






135378 


AW961818 


Hs.24379 


MUM2 protein 






2.13 


135346 


NM_000928 


Hs.992 


phospholipase A2, group IB (pancreas) 








135235 


AW298244 


Hs.293507 


ESTs 


12.40 






135057 


U90268 


Hs.93810 


cerebral cavernous malformations 1 


11.67 






134951 


BE305081 


Hs. 169358 


hypothetical protein 




8.00 




134799 


M36821 


Hs.89690 


GR03 oncogene 




8.20 




134786 


T29618 


Hs.89640 


TEK tyrosine kinase, endothelial {venous 








134772 


mm 0Q0829 


Hs 163697 


glutamate receptor, ionotrophic, AMPA 4 


29.80 






134752 


RF246762 


Hs 89499 


arachtdonate 5-lipoxygenase 






1.93 


134749 


T28499 


Hs.89485 


carbonic anhydrase IV 






2.07 


134696 


BE326276 


Hs.8861 


ESTs 








134636 


NM 00^82 


Hs 87205 


Ivmnhnrutp antiapn fi4 fmnuss} homoloa r 


13.60 






134627 


AI018768 


Hs. 12482 


glyceronephosphate O-acyltransf erase 






1.92 


134622 


AW9751 59 


Hs.293097 


ESTs, Weakly similar to A55380 faciogenl 






1.92 


134570 


U66615 


Hs. 172280 


SWI/SNF related, matrix associated, acti 


13.20 






134561 


U76421 


Hs.85302 


adenosine deaminase, RNA-specific, B1 (h 






1.78 


134468 


NM_001772 


Hs.83731 


CD33 antigen (gp67) 




6.20 




134417 


NM 006416 


Hs.82921 


solute carrier family 35 (CMP-sialic aci 








134343 


D50683 


Hs. 8202S 


transforming growth factor, beta recepto 








134323 


BE170651 


Hs.8700 


deleted in jjyer cancer 1 








134300 


NM 001430 

I^HVI \J\I 1 *TWW 


Hs8136 


endothelial PAS domain protein 1 








134299 


AW580939 


Hs.97199 


complement component C1q receptor 








134253 


X52075 


Hs.80738 


sialophorin (gpL115, leukosialin, CD43) 


20.60 






134182 


D52059 


Hs.7972 


KIAA0871 protein 


12.20 






133985 


L34657 


Hs.78146 


platelet/endothelial cell adhesion motec 








133978 


AF035718 


Hs.78061 


transcription factor 21 








133835 


AI677897 


Hs.76640 


RGC32 protein 








133651 


AI301740 


Hs 173381 


dihvdrnnvrimidlnasp-like 2 








133633 


D21262 


Hs.75337 


nucleolar and coiled-body phosphprotein 


15.20 






133565 


AW955776 


Hs.313500 


ESTs, Moderately similar to ALU7J-1UMAN A 








133548 


AW946384 


Hs.178112 


DNA segment, single copy probe LNS-CAI/L 






1.77 


133488 


AA335295 


Hs.74120 


adipose specific 2 








133478 


X83703 


Hs.31432 


cardiac ankyrin repeat protein 






2.08 


133337 


AF085983 


Hs.293676 


ESTs 




9.60 




133200 


AB037715 

nUUw ft 1 w 


Hs. 183639 


hypothetical protein FU 10210 






1.77 


133153 


AF070592 


Hs.66170 


HSKM-B protein 


30.60 






133130 

I ww 1 uU 


AH 28606 


Hs 6557 


zinc finger protein 1 61 


22.60 






133120 


NM_003278 


Hs.65424 


tetranectin (plasminogen-binding protein 








132928 


AW?68082 

nil i uuuul 


Hs. 169449 


protein kinase C, alpha 


13.80 






132836 

1 OtUOU 


AB023177 

nDUtw lit 


Hs 29900 

1 IO»£wvUU 


KIAA0960 protein 








132799 


W73311 


Hs. 169407 


SAC2 (suppressor of actin mutations 2, 


41.60 






132742 


AA025480 


Hs.292812 


ESTs, Weakly similar to T33468 hypotheti 


40.40 






132548 


X12830 


Hs.1 93400 


interleukin 6 receptor 




7.20 




132476 


AL1 19844 


Hs.49476 


Homo sapiens clone TUA8 Cri-du-chat regi 




4.76 




132439 


AK001942 


Hs.4863 


hypothetical protein DKFZp566A1524 






1.88 


132240 


AB018324 


Hs.42676 


KIAA0781 protein 


21.20 






132210 


NIVL007203 


Hs.42322 


A kinase (PRKA) anchor protein 2 






1.99 


132199 


AL041299 


Hs.165084 


ESTs 


15.20 






131751 


T96555 


Hs.31562 


ESTs 






1.76 


131745 


AI828559 


Hs.31447 


ESTs, Moderately similar to A4601 0 X-li 


27.80 






131694 


NM 000246 


Hs.3076 


MHC class II transactivator 




4.00 




131686 


NM_012296 


Hs.30687 


GRB2-associated binding protein 2 








131676 


AI126821 


Hs.30514 


ESTs 




6.20 




131629 


Z45794 


Hs.238809 


ESTs 


21.40 






131589 


C18825 


Hs.29191 


epithelial membrane protein 2 








131536 


AA019201 


Hs.269210 


ESTs 




9.40 




131517 


AB037789 


Hs.263395 


sema domain, transmembrane domain (TM), 




3.59 




131355 


R52804 


Hs.25956 


DKFZP564D206 protein 




4.48 




131253 


R71802 


Hs.24853 


ESTs 


15.00 






131207 


AF104266 


Hs.24212 


latrophilin 






1.75 


131156 


AI472209 


Hs.323117 


ESTs 






1.84 


131066 


AW169287 


Hs.22588 


ESTs 




3.54 




131061 


N64328 


Hs.268744 


KIAA1796 protein 








131053 


AA348541 


Hs.296261 


guanine nucleotide binding protein (G pr 






1.93 


130895 


AA641767 


Hs.21015 


hypothetical protein DKFZp564L0864 simil 


16.60 






130762 


D84371 


Hs.1898 


paraoxonase 1 


12.00 
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130657 


AW337575 


Hs.201591 


ESTs 




130655 


A1831962 

niuv i jut 


Hs 17409 


nwtpinp-rinh nrntpin 1 fintpstinaN 




130589 


AL1 10226 


Hs. 16441 


DKFZP434H204 protein 








hk 189611 
no. i otu i i 


Q^Info carrioi* famtlu 11 fnrnfnn-pru mfpd 
oututo uaiiici laiiiiiy i i ^jjiuluii (juupicu 




130555 


R6Q743 


Hs 116774 


infpnrin alnha 1 
iincyuii, alalia i 




130365 


W56119 


Hs.155103 


eukaryotic translation initiation factor 


11.60 


130273 


AW972422 


Hs. 153863 


MAD (mothers against decapentaplegic, Dr 




130259 


NM_000328 


Hs.153614 


retinitis pigmentosa GTPase regulator 




130090 


H97878 


Hs.1 32390 


zinc finger protein 36 (KOX 18) 


21.20 


129958 


R27496 


Hs.1378 


annexin A3 




129898 


AI672731 


Hs.13256 


ESTs 




129875 


AA181018 


Hs. 13056 


hypothetical protein FLJ13920 


18.60 


129699 


AB007899 


Hs.12017 


homolog of yeast ubiciuitin-protein ligas 




129626 


F 13272 


Hs 1 1 1334 


ferritin liahl oolvDeDtide 




129598 


N30436 


Hs.11556 


Homo sapiens cDNA FLJ 12566 fis, clone NT 


22.63 


129593 


Al 338247 


Hs.98314 


Homo sapiens mRNA; cDNA DKFZp586L0120 (f 




129565 


X77777 


Hs 198726 


Vfl^nartivp intpstinal npntiHp recpnfnr 1 




129527 


AA769221 


Hs 270847 


Hplta-hihulin 


39.20 


129402 


W72062 


Hs.11112 


ESTs 




129385 


AA172106 


Hs. 110950 


Rag C protein 


15.20 


129315 


NM_0 14563 


Hs. 174038 


spondyloepiphyseal dysplasia, late 


12.40 


129312 


T97579 


Hs 110334 


ESTs, Weakly similar to I78885 serine/th 


20.83 


129240 


aa 36 1258 


Hs 237868 


interleukin 7 receptor 




129210 


AL039940 


Hs.202949 


KIAA1102 protein 




129122 


AW958473 


Hs.301957 


nudix (nucleoside diphosphate linked moi 




129057 


N90866 


Hs.276770 


CDW52 antigen (CAMPATH-1 antigen) 




128946 


Y13153 


Hs 107318 

1 lw- 1 \J t \J IU 


kynurenine 3-monooxygenase (kynurenine 3 




128798 


AF0 15525 


Hs 302043 


rhprnokine (C>-C, motifl rpnpntor-likp 2 

\ji Id 1 llmll IW ' W llllflliy 1 WWWjJUJI llf\w £- 




128789 
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ESTs 6.04 
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NAG-7 protein 2.15 
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hypothetical protein FLJ23191 

ESTs 
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ESTs 
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ESTs 14.33 

zinc finger protein 204 6.00 
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ESTs 12.20 

frizzled (Drosophila) homolog 4 1.75 
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hypothetical protein FLJ20761 5.60 

ESTs 2.31 
Homo sapiens cDNA FLJ 13545 fis, clone PL 

ESTs 11.25 

ESTs 2.68 
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hypothetical protein 
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KIAA0863 protein 

gb:zn98g07.s1 Stratagene fetal retina 93 
ESTs, Weakly similar to T26845 hypotheti 
gb:z!84c04.s1 Stratagene colon (937204) 
hypothetical protein DKFZp434l0428 
macrophage receptor with collagenous str 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564G112 (fr 
ESTs 

Homo sapiens cDNA FLJ 14237 fis, clone NT 
ESTs 

chordin-Iike 

UM domains containing 1 

Ig superfamily receptor LNIR 

ESTs, Moderately similar to ALU7_HUMAN A 

EST 

DKFZP566F21 24 protein 

Homo sapiens serologically defined breas 

ESTs 

RAB7, member RAS oncogene family 
KIAA1563 protein 
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myeloid/lymphoid or mixed-lineage leukem 
hypothetical protein FLJ20417 
indolethylamine N-methyltransferase 
serum deprivation response (phosphatidyl 
ESTs 

sperm associated antigen 6 

period (Drosophlla) homolog 3 

hypothetical protein FLJ11273 

Homo sapiens cDNA FLJ 13569 fis, clone PL 

ESTs 

triple functional domain (PTPRF interact 
hypothetical protein FLJ 12492 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564B076 (fr 
chromosome 9 open reading frame 5 
plakophilin 4 
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ESTs 

protein phosphatase 1 , regulatory (inhib 
putative dipeptidase 
hypothetical protein FLJ 1061 8 
ESTs 

KIAA1301 protein 

receptor (calcitonin) activity modifying 
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ESTs, Moderately similar to A56194 throm 

nuclear receptor coactivator 2 

Homo sapiens HSPC285 mRNA, partial cds 

ESTs 

Homo sapiens PR02751 mRNA, complete cds 
ESTs 

hypothetical protein FLJ22029 
poly(A)-binding protein, nuclear 1 
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ESTs 

DKFZP434B203 protein 

gb:zj99f01.s1 Soares_pregnantuterusJlbH 

Homo sapiens beta-1 adrenergic receptor 

ESTs 

gb:EST00057 HE6W Homo sapiens cDNA clone 
Homo sapiens cDNA FLJ 10229 fis, clone HE 
KIAA0300 protein 

Homo sapiens mRNA; cDNA DKFZp434M229 (fr 
hypothetical protein FLJ 10849 
succinale-CoA ligase, GDP-forming, alpha 
caveolin 1, caveolae protein, 22kD 
CD83 antigen (activated B lymphocytes, i 
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UM domain only 2 (rhombotin-like 1) 
protein tyrosine phosphatase, receptor t 
progastricsin (pepsinogen C) 
CUG triplet repeat, RNA-binding protein 
CDP-diacylglycero! synthase (phosphatida 
signal transducing adaptor molecule (SH3 
amine oxidase, copper containing 3 {vase 
protein kinase C-like 2 
guanine nucleotide binding protein 11 
chemokine (C-X3-C) receptor 1 
steroidogenic acute regulatory protein 
spleen tyrosine kinase 
mannose receptor, C type 1 
myeloid cell nuclear differentiation ant 
S100 calcium-binding protein A4 (calcium 
tachykinin, precursor 1 (substance K, su 
complement component 5 receptor 1 (C5a I 
gb:Human alpha satellite and satellite 3 
coagulation factor VIII, procoagulantco 
hydroxyprostaglandin dehydrogenase 15-(N 
calcitonin receptor-like 
FBJ murine osteosarcoma viral oncogene h 
enhancer of fomentation 1 (cas-likedo 
microfibrillar-associated protein 4 
gb:Human dystrophin (dpi 40) mRNA, 5' end 
G protein-coupled receptor kinase 5 
transforming growth factor, beta recepto 
solute carrier family 6 (neurotransmitte 
Charot-Leyden crystal protein 
fatty acid binding protein 4, adipocyte 
S164 protein 

amyloid beta (A4) precursor protein (pro 
HIR (histone cell cycle regulation defec 
gb:Human nonmuscle myosin heavy chain-B 
KIAA0237 gene product 
sre homology three (SH3) and cysteine ri 
Down syndrome critical region gene 1-lik 

growth differentiation factor 10 
macrophage scavenger receptor 1 
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TABLE 3B shows the accession numbers for those primekeys lacking unigenelD's for Table 3A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



123619 
126433 
125831 
126816 
126852 
121059 
120637 
122011 
120934 
123802 
116814 
118329 
104404 
104776 
113502 
101262 
108573 
101447 
124357 
108781 
112794 
100351 
100555 



371681.1 AA602964 AA609200 
127143 1 AA325606AA099517N89423 
1522905J H04043 D60988 D60337 
122973 1 AA248234AA090985 
136135J AA399961 AA128347 
273450J AA393283 AA398628 
200885.1 AA81 1 804 AA809404 AA286907 AW977624 
7617_-2 AA431082 
177521.1 AA226198 AA226513 AA383773 
AA620448 
H50834 
N63520 



genbank_AA620448 
genbank_H50834 
genbank_N63520 
H58762_at H58762 
genbank.AA026349 
genbank_T89130T89130 
entrezj.35854 L35854 
genbank_AA086005 
entrez_M21305 M21305 
genbankJJ22401 
genbankJ\A1 28654 
genbank_R97018 
entrez_D64158 D64158 
tigr_HT2245 M69181 M81 105 U51039 



AA026349 



AA086005 

N22401 

AA128654 

R97018 
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Table 4A shows 202 genes up-regulated In samples from patients treated with chemotherapy or radiotherapy. These genes were selected from 59680 probesets on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting 
the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 
R1: 



average of Al for samples from patients treated with chemotherapy or radiotherapy divided by the average of Al for normal lung samples. 



Pkey ExAccn 



100113 
100187 
100210 
100225 
100269 
100438 
100877 
100893 
101273 
101447 
101649 
101724 
101748 
101809 
101879 
101915 
101973 
102025 
102031 
102052 
102391 
102420 
102610 
102829 
103000 
103036 
103507 
103587 
104660 
104896 
105038 
105298 
105510 
105667 
106073 
106205 
106516 
106533 
106575 
106654 
106851 
106995 
107332 
107532 
107922 
108609 
108780 
109166 
109260 
109280 
109292 
109384 
109415 
W445 
109502 
109633 
109786 
109958 
110920 
110924 
111084 
111132 
111229 
111337 
111987 
112046 
112268 
112685 
112871 
112897 
112973 
112992 
113073 



NM_001269 

D17793 

D26361 

D28539 

NMJ301949 

M013051 

X80821 

BE245294 

Z11933 

M21305 

AW959908 

L11690 

NM.001944 

M86849 

M1 76374 

AF207881 

U41514 

U04045 

U04898 

NM_002202 

AA296874 

U44060 

U65011 

NM_006183 

NM.001975 

M13509 

AJ000512 

BE270266 

BE298665 

AW015318 

AW603733 

BE387790 

Z42047 

AA767526 

AL157441 

AW965058 

AL137311 

AL1 34708 

AW970602 

AW075485 

AI458623 

AB023139 

T87750 

AA443473 

BE1 53855 

BE409857 

AU076442 

AA219691 

AW978515 

AK001355 

AW975746 

AA219172 

U80736 

AA232103 

AW967069 

AW003785 

AI989482 

AA001266 

N47224 

AW058463 

H44186 

AB037807 

AW389845 

AA837396 

NMJJ15310 

AA383343 

W39609 

R87650 

AL1 10216 

AW206453 

AB033023 

AL1 57425 

N39342 



UnigenelD Unigene Title 

Hs.84746 chromosome condensation 1 

Hs.781 83 aldo-keto reductase family 1 , member C3 

Hs.3104 KIAA0042 gene product 

Hs.1671 85 glutamate receptor, metabotropic 5 

Hs.1 1 89 E2F transcription factor 3 

Hs.9141 7 topoisomerase (DNA) II binding protein 

Hs.27973 KIAA0874 protein 

Hs.180789 S164 protein 

Hs. 182505 POU domain, class 3, transcription facto 
gb:Human alpha satellite and satellite 3 

Hs.1690 heparin-binding growth factor binding pr 

Hs.620 bullous pemphigoid antigen 1 (230/240kD) 

Hs.1925 desmoglein 3 (pemphigus vulgaris antigen 

Hs.323733 gap junction protein, beta 2, 26kD (conn 

Hs.243886 nuclear autoantigenic sperm protein (his 

Hs.1551 85 cytosolic ovarian carcinoma antigen 1 

Hs.80120 UDP-N-acety)-alpha-D-galactosamine:polyp 

Hs.78934 mutS (E. coli) homotog 2 (colon cancer, 

Hs.21 56 RAR-related orphan receptor A 

Hs.505 ISL1 transcription factor, LIM/homeodoma 

Hs.77494 deoxyguanosine kinase 

Hs.14427 Homo sapiens cDNA: FLJ21800 fis, clone H 

Hs.30743 preferentially expressed antigen in mela 

Hs.80962 neurotensin 

Hs. 146580 enolase 2, (gamma, neuronal) 

Hs.83169 matrix metalloproteinase 1 (interstitial 

Hs.296323 serum/glucocorticoid regulated kinase 

Hs.821 28 5T4 oncofetal trophoblast glycoprotein 

Hs.14846 Homo sapiens mRNA; cDNA DKFZp564D016 (fr 

Hs.23165 ESTs 

Hs.9414 KIAA1488 protein 

Hs.26369 hypothetical protein FLJ20287 

Hs.283978 Homo sapiens PR02751 mRNA, complete cds 

Hs.22030 paired box gene 5 (B-cell lineage specif 

Hs. 17834 downstream neighbor of SON 

Hs.11 1583 ESTs, Weakly similar to I38022 hypotheti 

Hs.234074 Homo sapiens mRNA; cDNA DKFZp76tG02121 ( 

Hs.145998 ESTs 

Hs.105421 ESTs 

Hs.286049 phosphoserine aminotransferase 

gb:tk04g09.x1 NCI_CGAPj_u24 Homo sapiens 

Hs.37892 KIAA0922 protein 

Hs.183297 DKFZP566F2124 protein 

Hs.173684 Homo sapiens mRNA; cDNA DKFZp762G207 (fr 

Hs.61 460 Ig superfamily receptor LNIR 

Hs.69499 hypothetical protein 

Hs.1 17938 collagen, type XVII, alpha 1 

Hs.73625 RAB6 interacting, kinesin-like (rabkines 

Hs.131915 KIAA0863 protein 

Hs.279610 hypothetical protein FLJ10493 

Hs.188662 KIAA1702 protein 

Hs.86849 ESTs 

Hs.1 10826 trinucleotide repeat containing 9 

Hs.189915 ESTs 

Hs.21 1556 hypothetical protein MGC5487 

Hs.170267 ESTs 

Hs.1 46286 kinesln family member 13A 

Hs.133521 ESTs 

Hs.20521 HMT1 (hnRNP methyltransferase, S. cerevi 

Hs.1 2940 zinc-fingers and homeoboxes 1 

Hs.1 5456 PDZ domain containing 1 

Hs.83293 hypothetical protein 

Hs.1 10855 ESTs 

Hs.263925 LI S1 -interacting protein NUDE1 , rat homo 

Hs.6763 KIAA0942 protein 

Hs.221 1 6 CDC1 4 (cell division cycle 14, S. cerevi 

Hs.22003 solute carrier family 6 (neurotransmitte 

Hs.33439 ESTs, Weakly similar to ALU1 _HUMAN ALU 

Hs.1 2285 ESTs, Weakly similar to 155214 salivary 

Hs.3782 ESTs 

Hs.318127 hypothetical protein FLJ10201 

Hs.133315 Homo sapiens mRNA; cDNA DKFZp761J1324 (f 

Hs.103042 microtubule-associated protein 1B 



R1 

27.20 

20.60 

20.40 

20.60 

29.40 

23.50 

35.56 

43.40 

21.80 

193.60 

38.40 

198.80 

78.60 

162.20 

50.00 

26.00 

37.20 

32.00 

51.20 

13.90 

28.80 

110.60 

116.80 

2.30 

181.40 

49.20 

86.60 

42.60 

29.40 

21.50 

32.80 

20.20 

28.40 

25.40 

32.00 

40.60 

59.80 

43.40 

50.80 

53.40 

20.88 

23.60 

57.20 

49.00 

19.67 

48.17 

59.20 

28.60 

22.80 

21.00 
31.60 
24.20 
21.40 
20.40 
19.60 
24.00 
28.40 
36.00 
61.20 
24.60 
27.20 
48.00 
37.80 
26.80 
63.80 
26.40 
47.64 
22.00 
65.00 
42.00 
55.40 
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113494 T91451 Hs.86538 ESTs 22.80 

113560 T91015 Hs.268626 ESTs 22.80 

113849 AA457211 Hs.8858 bromodomain adjacent to zinc finger doma 51.80 

1 1 3950 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr 28.20 

5 114339 AA782845 Hs.22790 ESTs 20.20 

114365 H42169 Hs.18653 hypothetical protein FL) 14627 21.00 

114455 H37908 Hs.271616 ESTs, Weakly similar to ALU8_HUMAN ALU S 25.80 

114518 AW163267 Hs.106469 suppressor of varl (S.cerevisiae) 3-like 23.60 

114824 M960961 Hs.305953 zinc finger protein 83 (HPF1) 27.20 

10 114837 BE244930 Hs.166895 ESTs 30.20 

114974 AW966931 Hs.179662 nucleosome assembly protein 1 -like 1 20.80 

115075 AA814043 Hs.88045 ESTs 30.60 

115084 BE383668 Hs.42484 hypothetical protein FLJ 1061 8 28.86 

. _ 115291 BE545072 Hs.122579 hypothetical protein FLJ 10461 38.00 

15 115313 AA808001 Hs.1 84411 albumin 22.60 

115697 D31382 Hs.63325 transmembrane protease, serine 4 173.60 

1 1 5909 AW872527 Hs.59761 ESTs, Weakly similar to DAP1.HUMAN DEATH 27.77 

116090 A1591147 Hs.61232 ESTs 20.80 

116107 AL133916 Hs.172572 hypothetical protein FLJ20093 164.20 

20 116399 AA889120 Hs.1 10637 homeoboxA10 38.00 

1 1 7099 H93699 gb:y v1 6a1 1 .s1 Soares fetal liver spleen 21 .60 

117881 AF161470 Hs.260622 butyrate-induced transcript 1 49.40 

118091 AW005054 Hs.47883 ESTs, Weakly similar to KCC1 .HUMAN CALCI 22.40 

118138 AA374756 Hs.93560 Homo sapiens mRNA for KIAA1 771 protein, 22.00 

25 118720 N73515 gb:za49d07.s1 Soares fetal liver spleen 20.00 

118873 AI824009 Hs.44577 ESTs 19.40 

119126 R45175 Hs.117183 ESTs 111.20 

119717 AA918317 Hs.57987 B-cell CLL/iymphoma 11B (zinc finger pro 33.00 

119940 AL050097 Hs.272531 DKFZP586B031 9 protein 31.00 

3 0 1 20266 AI807264 Hs.205442 ESTs, Weakly similar to T34036 hypotheti 20.20 

120515 AA258356 gb:zr59c10.s1 Soares_NhHMPu_S1 Homosapi 25.00 

120859 AA826434 Hs.1 619 achaete-scute complex (Drosophila) homol 95.40 

120983 AA398209 Hs.97587 EST 105.20 

121054 AW976570 Hs.97387 ESTs 38.80 

35 121369 AW450737 Hs.128791 CGI-09 protein 41.60 

122335 AA443258 Hs.241551 chloride channel, calcium activated, fam 30.80 

122612 AA974832 Hs.128708 ESTs 19.60 

123130 AA487200 gb:ab19f02.s1 Stratagene lung (937210) H 33.20 

123440 A1733692 Hs.1 12488 ESTs 23.17 

40 123596 AA421130 Hs.1 12640 EST 23.00 

123619 AA602964 gb:no97c02.s1 NCI CGAP_Pr2 Homo sapiens 28.80 

124006 A1147155 Hs.270016 ESTs 77.60 

124169 BE079334 Hs.271630 ESTs 22.20 

124281 AI333756 Hs.1 11801 arsenate resistance protein ARS2 42.20 

45 124472 N52517 Hs.102670 EST 32.60 

124617 AW628168 Hs.152684 ESTs 21.80 

124631 NM.014053 Hs.270594 FLVCR protein 30.40 

124839 R55784 Hs.140942 ESTs 21.20 

125186 AA610620 Hs.181244 major histocompatibility complex, class 42.80 

50 125321 T86652 Hs.178294 ESTs 27.00 

125535 NM_013243 Hs.22215 secretogranin III 23.80 

125646 AA628962 Hs.75209 protein kinase (cAMP-dependent, catalyti 23.20 

125684 AW589427 Hs.158849 Homo sapiens cDNA: FLJ21663 fis, clone C 21.20 

125724 AL360190 Hs.295978 Homo sapiens mRNA full length insert cDN 48.80 

55 125847 AW161885 Hs.249034' ESTs 31.00 

125934 AA193325 Hs.32646 hypothetical protein FLJ21 901 21.20 

126077 M78772 Hs.210836 ESTs 49.80 

126299 AW9791 55 Hs.298275 amino acid transporter 2 21 .80 

126395 AI468004 Hs.278956 hypothetical protein FLJ 12929 71.00 

60 126433 AA325606 gb:EST28707 Cerebellum li Homo sapiens c 23.20 

126509 R47400 Hs.23850 ESTs 23.80 

126538 AB030656 Hs.17377 coronin, actin-bin ding protein, 1C 23.10 

126666 AA648886 Hs.151999 ESTs 36.00 

126812 AB037860 Hs.173933 nuclear factor l/A 20.80 

65 126872 AW450979 gb:UI-H-BI3-ala-a-12-0-Ui.s1 NCLCGAP.Su 46.29 

127046 AA321948 Hs.293968 ESTs 22.80 

127431 AW771958 Hs.175437 ESTs, Moderately similar to PC4259 ferri 30.00 

127489 AA650250 Hs.272076 ESTs 20.80 

127521 AW297206 Hs.164018 ESTs 25.20 

70 127742 AW293496 Hs.180138 ESTs 28.00 

127925 AA805151 Hs.3628 mitogen-activated protein kinase kinase 21.20 

127930 AA809672 Hs.123304 ESTs 20.54 

127968 AA830201 Hs.124347 ESTs 28.20 

127987 AI022103 Hs.124511 ESTs 19.60 

75 128116 H07103 Hs.286014 Homo sapiens, clone 1MAGE:3867243, mRNA 20.40 

128609 NMJ)03616 Hs.102456 survival of motor neuron protein interac 34.40 

\2B777 AI878918 Hs.10526 cysteine and glycine-rich protein 2 53.80 

128949 AA009647 Hs.8850 a disintegrin and metalloproteinase doma 23.00 

129168 AI132988 Hs.109052 chromosome 14 open reading frame 2 37.60 

80 129404 AI267700 Hs.317584 ESTs 28.60 

129527 AA769221 Hs.270847 delta-tubulin ' 40.80 

129574 AA026815 Hs.1 1463 UMP-CMP kinase 31.20 

129598 N30436 Hs.11556 Homo sapiens cDNA FLJ 12566 fis, clone NT 29.60 

n _ 129785 H19006 Hs.184780 ESTs 72.20 

85 129970 AV655806 Hs.296198 chromosome 12 open reading frame 4 22.20 
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31.60 
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TABLE 4B shows the accession numbers for those primekeys lacking unigenelD's for Table 4A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools {DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

123619 
126433 
126872 

106851 
118720 
120515 
117099 
101447 
123130 



CAT number 



Accessions 



371681 1 AA602964 AA609200 
127143J AA325606 AA099517 N89423 

142696.1 AW450979 AA1 36653 AA1 36656 AW419381 AA984358 AA492073 BE1 68945 AA809054 AW238038 BE01 1212 BE011359 

BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 
322947 1 AI458623 AA639708 AA485409 R22065 AA485570 
genbanklN73515 N73515 
genbank_AA258356 AA258356 
321871 1 H93699H97976H80036 
entrez_M21305 M21305 
genbank_AA487200 AA487200 
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Table 5A shows 680 genes up-regulated in squamous cell carcinoma or adenocarcinoma lung tumors relative to normal lung and chronically diseased lung. These genes were 
selected from 59680 probesets on the Eos/Afrymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average 
intensity (Al), a normalized value reflecting the relative level of mRNA expression. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title . , 

R1: 70th percentile of Al for squamous cell carcinoma and adenocarcinoma lung tumor samples divided by the 90th percentile of Al for normal and chronically 

diseased lung samples. 

R2: 80th percentile of Al adenocarcinoma lung tumor samples divided by the 90th percentile of Al for normal and chronically diseased lung samples. 

R3: 80th percentile of Al squamous cell carcinoma lung tumor samples divided by the 90th percentile of Al for normal and chronically diseased lung samples. 

R4: 80th percentile of Al adenocarcinoma lung tumor samples divided by the 80th percentile of Al for squamous cell carcinoma lung tumor samples. 

R5: 70th percentile of Al for squamous cell carcinoma and adenocarcinoma lung tumor samples minus the 1 5th percentile of Al for all normal lung, chronically 

diseased lung and tumor samples divided by 90th percentile of Al for normal and chronically diseased lung samples minus the 15th percentile of Al for all 

normal lung, chronically diseased lung and tumor samples 
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Unigene Title R1 R2 R3 R4 R5 

AFFX control: GAPDH 6.76 
AFFX control: GAPDH 5.77 
AFFX control: GAPDH 5.75 
Human GABAa receptor aIpha-3 subunit 8.00 

thymidylate synthetase 5.71 
KIAA0101 gene product 3.84 
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minichromosome maintenance deficient (S. 4.52 
phosphofructokinase, platelet 5.49 
proteasome (prosome, macropain) subunit, 5.67 
E2F transcription factor 3 2.55 

chaperonin containing TCP1, subumt 5 (e 5.66 
protein disulfide isomerase-related prot 3.81 
minichromosome maintenance deficient (S. 4.50 
platelet-activating factor acetylhydrola 5.07 

uridine monophosphate kinase 4.82 

KIAA0175 gene product 3.79 

amylase, alpha 2A; pancreatic 15.65 

RAN, member RAS oncogene family 5.49 

non-metastatic cells 2, protein (NM23B) 4. 1 7 

carcinoembryonic antigen-related cell ad 7.20 

prolactin-induced protein 14.20 

collagen, type VII, alpha 1 (epidermolys 3.10 
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mitogen-activated protein kinase kinase 20.60 
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zinc ribbon domain containing, 1 8.60 
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flap structure-specific endonuclease 1 7.99 
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ret proto-oncogene (multiple endocrine n 8.00 
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gb:Human proliferating cell nuclear anti 4.69 
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potassium voltage-gated channel, Shab-re 1 2.91 
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macrophage migration inhibitory factor ( 5.69 
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kallikrein B, plasma (Fletcher factor) 1 1 2.80 

proliferating cell nuclear antigen 3.24 

small proline-rich protein 1B (cornifin) 7.90 
keratin 5 (epidermolysis bullosa simplex 8.31 

bone morphogenetic protein 2 38.80 
glutamic-oxaloacetic transaminase 2, mit 4.01 
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v-ros avian UR2 sarcoma virus oncogene h 9.09 
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H2A histone family, member A 7.00 

growth arrest and DNA-damage-inducible, 7.60 
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chymase 1, mast cell 

bullous pemphigoid antigen 1 (230/240kD) 
desmoglein 3 (pemphigus vulgaris antigen 
solute carrier family 7 {cationic amino 
myeloid cell nuclear differentiation ant 
TTK protein kinase 

gap junction protein, beta 2, 26kD (conn 
collagen, type XVII, alpha 1 
mannose receptor, C type 1 
midkine (neurite growth-promoting factor 
nucleotide binding protein 1 (E.coli Min 
aldo-keto reductase family 1, member C1 
zinc finger protein 131 (clone pHZ-10) 
interleukin enhancer binding 'factor 2, 4 
sulfotransferase family, cytosolic, 2A, 
centromere protein A (17kD) 
laminin, beta 3 (nicein (125kD), kalinin 
secreted phosphoprotein 1 {osleopontin, 
JTV1 gene 

proteasome (prosome, macropain) 26S subu 
heterochromatin-like protein 1 
DEAD/H (Asp-GIu-Ala-Asp/His) box poiypep 
chromosome segregation 1 (yeast homolog) 
chromobox homolog 1 (Drosophila HP1 beta 
macrophage stimulating 1 (hepatocyte gro 
aldehyde dehydrogenase 3 family, member 
Bloom syndrome 

a disintegrin and metalloproteinase doma 
vascular endothelial growth factor C 
RAB27A, member RAS oncogene family 
enhancer of zeste (Drosophila) homolog 2 
ubiquitin fusion degradation 1-like 
preferentially expressed antigen in mela 
melanoma antigen, family A, 9 
G protein-coupled receptor 
Human hbc647 mRNA sequence 
CUG triplet repeat, RNA-binding protein 
eyes absent (Drosophila) homolog 2 
retinoblastoma-binding protein 8 
ubiquitin carrier protein E2-C 
forkhead box M1 

gb:Homo sapiens clone 14.9B mRNA sequenc 
chaperonin containing TCP1, subunit7 (e 
transcription factor AP-2 gamma (activat 
Homo sapiens cDNA: FLJ21930 fis, clone H 
neurotensin 

ubiquitin carboxyl-terminal esterase L1 
matrix metalloproteinase 3 (stromelysin 
keratin 15 

small nuclear ribonucleoprotein polypept 
v-ski avian sarcoma viral oncogene homol 
non-metastatic cells 1, protein (NM23A) 
multifunctional polypeptide similar to S 
matrix metalloproteinase 1 (interstitial 
CDC28 protein kinase 1 
matrix metalloproteinase 1 1 (stromelysin 
NADH dehydrogenase (ubiquinone) Fe-S pro 
cadherin 3, type 1> P-cadherin (placenta 
glutathione peroxidase 2 (gastrointestin 
transmembrane protein (63kD), endoplasmi 
transforming growth factor, alpha 
chaperonin containing TCP1, subunit 3 (g 
alcohol dehydrogenase 7 (class IV), mu o 
SMA5 

sine oculis homeobox (Drosophila) homolo 
coated vesicle membrane protein 
similar to rat HREV1 07 
pyrroline-5-carboxylate synthetase (glut 
proteasome (prosome, macropain) subunit, 
translocase of inner mitochondrial membr 
sal (Drosophila)-like 2 
aminopeptidase puromycin sensitive 
transcription factor NRF 
S100 calcium-binding protein A2 
phosphoserine phosphatase 
keratin 17 

polymerase (RNA) ll (DNA directed) polyp 
5T4 oncofetal trophoblast glycoprotein 
SRY (sex determining region Y)-box 2 
POU domain, class 2, associating factor 
gb:Homo sapiens full length insert cDNA 
hypothetical protein FLJ23468 
tubby super-family protein 
ESTs 
ESTs 
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hypothetical protein DKFZp566N034 

estrogen receptor binding site associate 6.80 
RNA polymerase I subunit 

cdk inhibitor p21 binding protein 6.80 
heme-regulated initiation factor 2-alpha 1 0.29 

hypothetical protein MGC4816 
nuclear receptor subfamily 1, group 1, m 
Homo sapiens cDNA: FLJ21933 fis, clone H 
Homo sapiens mRNA; cDNA DKFZp564D016 (fr 
ESTs, Highly similar to S60712 band-6-pr 
cAMP responsive element modulator 
NPD002 protein 
hypothetical protein FU 12691 
mitotic spindle coiled-coil related prot 
chromosome 20 open reading frame 1 
hypothetical protein FU12934 
hypothetical protein MGC14833 
HBV associated factor 
ESTs, Weakly similar to I38022 hypotheti 
hypothetical protein NUF2R 
ER01 (S. cerevisiae)-like 
cytoskeleton associated protein 2 
gb:zr57e08.s1 Soares_NhHMPu_S1 Homosapi 
hypothetical protein FU20287 
DiGeorge syndrome critical region gene 8 
Homo sapiens, clone IMAGE:41 79986, mRNA, 
paired box gene 5 (B-cell lineage specif 
sema domain, immunoglobulin domain (Ig), 
B-cell CLL/lymphoma 11B (zinc finger pro 
ESTs 

heat shock 90kD protein 1, alpha 
McKusick-Kaufman syndrome 
ESTs, Weakly similar to G02075 transcrip 
downstream neighbor of SON 
hypothetical protein FU 13352 
hypothetical protein FU 10439 
mitochondrial ribosomal protein L36 
ESTs, Weakly similar to ALU INHUMAN ALU S 
high-mobility group (nonhistone chromoso 
ESTs, Weakly similar to putative p 150 [ 6,60 
cleavage and polyadenylation specific fa 
hypothetical protein, estradiol- induced 
glutamate-cysteine ligase, catalytic sub 
tyrosylprotein sulfotransferase 1 4.75 
ESTs 

Homo sapiens mRNA; cDNA DKFZp564B076 (fr 
phosphoserine aminotransferase 28.00 
deleted in lymphocytic leukemia, 1 3.00 
CGl-07 protein 

hypothetical protein FU11269 
M-phase phosphoprotein 9 
ESTs, Weakly similar to ALU5.HUMAN ALU S 
KIAA1272 protein 

RAD51 (S. cerevisiae) homolog {E coli Re 4.71 
ESTs 

nucleolar protein 1 (120kD) 

flap structure-specific endonuclease 1 2.60 
KIAA1040 protein 
programmed cell death 2 

DKFZP586E1621 protein 9.50 
solute carrier family 6 (neurotransmitie 2.71 
Homo sapiens clone 2441 6 mRNA sequence 
fibrillarin 

nucleolar protein (KKE/D repeat) 
Homo sapiens, clone IMAGE:3603836, mRNA, 
EST 

keratin 6B 3.40 
Ig superfamily receptor LNIR 2.88 
hypothetical protein FLJ21 620 7.50 
protein kinase NYD-SP15 
ESTs 

hypothetical protein FLJ 12572 
hypothetical protein FLJ 11 210 
ESTs 

gb:zm61e06.r1 Stratagene fibroblast (937 
gb:zm86a08.r1 Stratagene ovarian cancer 
hypothetical protein DKFZp434l0428 
gb:zn13b09.s1 Stratagene hNT neuron (937 
gb;zl84c04.s1 Stratagene colon (937204) 
Homo sapiens cDNA FLJ 11 448 fis, clone HE 
hypothetical protein FU20285 
KIM1 077 protein 3.00 
ESTs 

ESTs, Moderately similar to 21 09260A B c 
collagen, type XVII, alpha 1 11.21 
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5.81 
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TABLE 5B shows the accession numbers for those primekeys lacking unigenelD's for Table 5A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

117079 
124305 
101502 
109792 
126034 
102768 
126345 
127066 
127099 
119243 
125875 
112054 
126979 
126992 
122318 
114699 
114793 
108305 
108393 
100867 
123731 
109700 
120715 
113702 
115113 
101045 
108554 
108573 
119052 
126522 
126605 
10376Q 



CAT number 



Accessions 



1621717J 
242183J 
18202 -6 
754958.1 
1598157J 
44641J 
1653833J 
1703458J 
244301.1 
1774795.1 
1566433.1 
1538292.1 
171411.1 
880655 1 
292419.1 
135322.1 
150742J 
111550 1 
113411.1 
tigr_HT4586 
genbank_AA609839 
genbank_F09609 F09609 
genbank_AA292700 
genbankJ-97307 T97307 
genbank u AA256460 
entrez_J05614 J05614 
genbank_AA084948 
genbank_AA086005 
149538 1 
416020.1 
439280 1 



H92325T97125 

AW963221 AA344870 M344871 H93331 
M26958 

R49625 F10674 
H60340N91637 
U82321 H66077 
N49713N49819W03810 
R25066 R20144 R20145 Z43845 
AA347668 AW956810 Z44271 F07065 F07064 R13506 
T12603T12604 
H14480N98295 
R43590F10439 
AA210954 AA211007 
AI809521 H12174 242556 
AA429743AA442754 
AA127386 R15644 AA1 27404 
AA158245 AA1 58235 
AA071391 AA069892AA069891 
AA075211 AA075245AA075126AA074946 
U14622 

AA609839 



AA2927QQ 



AA256460 



AA084948 
AA086005 
R10889 R10888 
W31912AI167491 

M676910 AA778853 AA778865 W86800 
46922 J W42667 AI580740 AI690440 AI561350 AW467906 AW1 51450 AI825927 AL041 716 AI885600 AI74221 3 AW248624 AI955498 AA033947 

AA845593 AI623711 N68583 C00064 AA1 93567 AW0B3868 AW163216 M191595 AA522778 AI628008 AI915518 AA843508 AI926195 
AA176265 AW167963 AA9921 15 W93647 AW 1 03572 AI 862994 AI342059 AA91 1719 AA176155 AA024712 AA069988 AA205591 AI591 107 
AJ199673AI811766AI275832AI422233AI191852AI096682AI580124A)683612AA582453AA927559AA486415T32414AI084978 H44849 
H44848 H20477 T91 695 W47039 AA070055 AA024795 AA328855 AA379248 M379330 AA385580 W25920 W03688 AA448359 AA093881 
AW362477 AA089997 A1350265 W93479 N99688 M932257 AW351469 H68590 AA663402 AA069771 AW087986 AI858420 M600214 
AI970774 A1857712 AI683081 AI885584AW1 31 150 AI 56798 1 AW0027 14 AW1 89973 AW075495AW1 68303 AA9537 14 AW51 6 881 AI357375 
AI566663 AW512676 A1570580 AI023690 AA44821 6 AI079853 AI422707 M77951 6 AW026972 AW130082 AW1 62307 AW438646 M709332 
AW192394 AI167350 AI217879 A1129152 AA719509 AI350480 AA66341 8 AI 003634 AW118546 M180261 AA442833 AI268625 AA888881 
AI038759 AA846723 AI248770 AA993694 AI280335 AI885107 AW51 8649 AA641 563 AA995835 AA582521 AI276744 AA436478 AI017360 
AI620763 AI859887 N73926 AI076327 AI74161 5 A11 60617 AW1 72819 AI492005 AA677429 AA996334 AI693771 AI950039 AI245629 AI28851 5 
AI8661 86 T93293 AA173262 AA599779 AI680092 AW439316 AI084555 AI272672 AI583507 AW47321 9 AA7381 32 AW473283 AI367492 
M995410 AI689624 AA206353 AI033095 AI040382 AA87363Q AI221074 A1934840 AI418680 AA844306 R94503 AA773520 AA843169 
AA21 9425 AA629658 AI81 1 71 9 AW41 1 275 AI590981 W37907 AI591 178 AI684051 AA983238 M669347 AA976239 AA704570 A1628339 
AI884391 AI241 580 AI003539 AW1 76687 AA009650 N34566 AI333493 A11 86070 M070827 AA41 1 683 AI280884 AA872023 M207255 
AA021576 N71953 A1885888 AW076039 T15777 AI537673 AW248048 H09554 W93480 W47001 AW0791 14 AA0631 60 AA757453 R60788 
AI859431 H20478 AA218882 AA757465 AA100995 AI864135 AI934209 AA070503 H47008 AA219646 W61039 W93907 AW385050 W37967 
W78028 M1 89007 M4791 36 R93650 AA44231 2 T30287 AA847628 AA1 80262 AA009649 C03892 AW149464 AA31 0963 AA21 9693 
AA069747 R29207 AA094784 AA29361 5 AA447848 AI9841 67 N90393 C05097 N56499 AW292351 AW1 49681 AW473258 AA629322 AI004409 
AW105577 AI954937 AI811070 AA902422 AW514437 AA535460 M916877 AW517122 AA974657 M975649 AW517130 AW517129 F31737 
W07688 AA1 93645 AA378994 AA489273 F32267 W39303 AA021 181 N86810 AA406524 AA062553 AA436801 H08985 H15979 N40310 
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AA436789 AA232172 Am6Q778 W25862 R60282 AA436530 AA378894 M187461 A1940535 AA60421 0 AA0895U M360421 N88243 N84281 
AA209340 N56174 N88374AA191088AW247691 AA249013 M093111 M972536 AW298594AA375893T12139 W28186 AW243849 
AI288629 AA843996 W15260 AH88286 AW248079 R15836 

1 1 9599 genbank_W45552 W45552 

112382 genbank_R59904 R59904 

105264 genbank_AA227934 AA227934 

100071 entrez_A28102 A28102 

123315 714071.1 AA496369 M496646 



Table 6A shows 99 genes up-regulated nonsmokers with lung cancer relative to smokers with lung cancer. These genes were selected from 59680 probesets on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a normalized value reflecting 
the relative level of mRNA expression. 

Pkey; Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: average of Al for samples from non-smokers with adenocarcinoma divided by the 90th percentile of Al for samples from smokers with adenocarcinoma 

R2; average of A) for samples from non-smokers with squamous cell carcinoma divided by the 90th percentile of Al for samples from smokers with squamous cell 
carcinoma 
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TABLE 6B show the accession numbers for those primekeys lacking unigenelD's for Table 6A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

108562 36375 1 AA 1 00796 AF020589 AA074629 M075946 AA1 00849 AA085347AA1 26309 AA0793 11 AA079323 AA085274 

103439 35330 J X98266N41124 

123551 genbank_AA608837 AA608837 

123861 genbank_AA620840 AA620840 

102832 entrezJJ92015 U92015 

101972 entrez_S82472 S82472 

121558 genbankJ\A412497 AA412497 
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Table 7A shows 98 genes down-regulated in non-smokers with lung cancer relative to smokers with lung cancer. These genes were selected from 59680 probesets on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Ai), a normalized value reflecting 
the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

Unigene! D: Unigene number 

Unigene Title: Unigene gene title 

R1 : 90th percentile of A! for samples from smokers with adenocarcinoma divided by the average of AI for samples from non-smokers with adenocarcinoma. 

R2: 90th percentile of AI for samples from smokers with squamous cell carcinoma divided by the average of AI for samples from non-smokers with squamous cell 
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Hs.82547 
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AL079646 
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103242 


X76342 


Hs.389 
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AF002246 
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Hs.5462 


105024 
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AA449563 


Hs.151393 
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BE298210 
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Hs.21103 
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ESTs 70.20 

ESTs, Highly similar to unnamed protein 64.00 

ESTs 85.20 

ESTs 96.60 
ribokinase 

fibronectin leucine rich transmembrane p 

craniofacial development protein 1 66.80 

KIAA1357 protein 

KIAA1497 protein 64.20 

zinc finger protein 36 (KOX 18} 63.80 
stanniocalcin 2 

cadherin 13, H-cadherin (heart) 

KIAA1 467 protein 64.40 

Homo sapiens cDNA FLJ11640 fis t clone HE 76.20 
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TABLE 7B shows the accession numbers for those primekeys lacking unigenelD's for Table 7A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

103207 30635 -4 X72790 

106566 120358 1 BE298210 AI672315 AW086489 BE298417 AA455921 AA902537 BE327124 R14963 AA085210 AW274273 A1333584 A1369742 AI039658 

AI885095 AI476470 AI287650 A1885299 AI985381 AW592624 AW3401 36 AI266556 AA456390 AI31 081 5 AA484951 

116571 genbank_D45652 D45652 

118466 genbank_N66741 N66741 

101046 entrez„K01160K01160 

101941 entrez_S77583 S77583 

103351 entrez_X8921 1 X89211 

123130 genbank_AA487200 AA487200 
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Table 8A shows 1 720 genes either up or down-regulated In lung tumors or chronically diseased lung relative to a broad collection of over 40 distinct normal body tissues. 
Chronically diseased lung samples represent chronic non-malignant lung diseases such as fibrosis, emphysema, and bronchitis. These genes were selected from 39494 
probesets on the Eos/Affymetrix Hu02 Genechip array. Gene expression data for each probeset obtained from this analysis was expressed as average intensity (Al), a 
normalized value reflecting the relative level of mRNA expression. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : 70th percentile of Al for lung tumors divided by 90th percentile of Al for normal lung 

R2: 70th percentile of Al for chronically diseased lung divided by 90th percentile of A( for normal lung 
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dynein, cytoplasmic, tight intermediate 
mucin 4, tracheobronchial 
synaptonemal complex protein 2 
CD3-epsilon-associated protein; antisens 
adaptor-related protein complex 4, epsil 
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Homo sapiens mRNA; cDNA DKFZp564J062 (fr 
sine oculis homeobox (Drosophila) homolo 
UDP-N-acetylglucosamine:a-1,3-D-mannosid 
nuclear cap binding protein subunit 2, 2 
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gb:xt68f05.x1 NCI_CGAP_Ut2 Homo sapiens 
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gb:xt66h02.x1 NCI_CGAP_Ut2 Homo sapiens 



0.45 


1,15 


0 11 

U.00 


0.84 


0.52 


0.94 


2.76 


3.65 


1.00 


1.00 


1.68 


1.50 


1.00 


2.11 


9.38 


13.08 


3.27 


7.24 


2.54 


1.88 


1.00 


0.91 


2.63 


2.67 


5.82 


9.34 


3.66 


3.18 


2.44 


6.77 


0.44 


0.84 


4.18 


5.64 


1.85 


0.92 


2.04 


2.13 


1.44 


1.89 


0.51 


1.10 


2.64 


4.87 


5.34 


2.68 


1.00 


121 


0,52 


1.24 


1.00 


1.00 


1.58 


1.02 


2.72 


6.85 


1.00 


4.32 


2.97 


0.93 


0.80 


0.95 


1.68 


5.04 


2.70 


7.98 


4,25 


8.13 


3.91 


8.68 


2.20 


2.73 


0.54 


1.02 


4.28 


11.57 


157 


2.38 


2*94 


4.68 


3.49 


6.31 


0.80 


2.74 


1.13 


0.77 


3.14 


10.68 


3.04 


8.24 


1,80 


1.92 


1.00 


1.00 


0.53 


0.67 


2.45 


2.62 


4.88 


8.61 


1.41 


186 


1.51 


119 


0,72 


0.76 


1.17 


3.90 


4.08 


6.46 


2.50 


4.37 


5.38 


8.38 


3.26 


4.08 


2.02 


1.83 


1.32 


3.95 


0.77 


0.53 


0.24 


0.63 


3.56 


6.22 


2.28 


3.17 


0.38 


102 


2.30 


1.00 


1.86 


4.48 


0.10 


0.80 


4.54 


9.65 


0.09 


0.04 


1.00 


1.72 


4.96 


9.14 


2.06 


2.02 


1.00 


124 


1.08 


143 


1.76 


1.31 


2.30 


2.57 


3.10 


5.79 


5.06 


11.86 


5.14 


7.31 


2.83 


4.06 


1.15 


2.35 


2.20 


9.35 


4.85 


6.28 


3.21 


4.07 



125 



WO 02/086443 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



304008 


AW518198 


Hs.3297 


ribosomal protein S27a 


6.50 


11.08 


304009 


AW518206 


Hs.181165 


eukaryotic translation elongation factor 


1.88 


3.27 


304024 


T03036 




gb:FB21B7 Fetal brain, Stratagene Homo s 


2.15 


3.55 


304026 


T03160 




gb:FB26F2 Fetal brain, Stratagene Homo s 


5.88 


11.80 


304028 


T03266 




gb:FB7C1 Fetal brain, Stratagene Homo sa 


5.59 


13.46 


304rnfi 

JUtUull 


1 IUUJJ 


Hs 244621 


ribosomal protein S14 


6.55 


14.43 


304046 


T54803 




gb:yb42d06.s1 Stratagene fetal spleen {9 


6.18 


12.19 


304061 


T61521 




gb:yb73g01.s1 Stratagene ovary (937217) 


2.64 


8.23 


304063 


T62536 




gb:yc04c12.s1 Stratagene lung (937210) H 


0.53 


1.61 


304097 


R25376 


Hs.1 77592 


ribosomal protein, large, P1 


6.49 


11.67 


304114 


R78946 




gb:yi87gQ2.s1 Soares placenta Nb2HP Homo 


2.90 


4.18 


304122 


H28966 




gb:ym31a06,s1 Soares infant brain 1NIB H 


1.00 


2.76 


304155 


H68696 




gb:yr78b06.s1 Soares fetal liver spleen 


0.79 


1.18 


304203 


N56929 




gb:yy82d08.s1 Soares_multipIe_sclerosis_ 


4.28 


11.34 


304234 


W81608 




gb:zd88h06.s1 SoaresJetal_hearLNbHH19W 


6.47 


11.03 


304267 


AA064862 


Hs,73742 


ribosomal protein, large, P0 


1.34 


1.16 


304270 


M06971 1 


Hs.297753 


vimentin 


3.40 


5.40 


304287 


AA079286 


Hs.78466 


proteasome (prosome, macropain) 26S sub 


2.93 


4.42 


304348 


AA1 79868 




gb:zp38g12.s1 Stratagene muscle 937209 H 


3.98 


10.96 


304415 


AA290747 


Hs. 169476 


glyceraldehyde-3-phosphate dehydrogenase 


3.32 


5.99 


304430 


AA347682 




gb:EST54044 Fetal heart II Homo sapiens 


1.00 


1.00 


304456 


AA411240 




gb:zv26g05.s1 Soares_NhHMPu_S1 Homo sapi 


1.42 


3.33 


304521 


AA464716 




gb:zx82c11.s1 Soares ovary tumor NbHOT H 


2.18 


1.15 


304526 


AA476427 




gb:zx02c05.s 1 SoaresJotal_fetus_Nb2HF8_ 


5.38 


14.11 


304542 


AA482602 


Hs. 169476 


gIyceraldehyde-3-phosphate dehydrogenase 


4.16 


8.23 


304546 


AA486074 


Hs.297681 


serine (or cysteine) proteinase inhibito 


0.55 


1.20 


304607 


AA5 13322 




gb:nh85e08.s1 NCI_CGAP_Br1.1 Homosaplen 


1.95 


2.10 


304640 


AA524440 


Hs 111334 


ferritin, light polypeptide 


2.10 


2.83 


304650 


M527489 


Hs.3463 


ribosomal protein S23 


3.33 


12.62 


304735 


AA576453 




gb:nm75h11.s1 NCLCGAP_Co9 Homo sapiens 


1.33 


0.88 


304760 


M580401 




gb:nn13g09.s1 NCI_CGAP_Co12 Homo sapiens 


3.68 


8.14 


304849 


AA588157 


Hs. 13801 


KIAA1685 protein 


2.77 


3.70 


304917 


AA602685 


Hs.284136 


PRO2047 protein 


7,16 


11.01 


304921 


AA603092 


Hs.297753 


vimentin 


2.47 


4.24 


304966 


AA613893 


Hs.282435 


ESTs 


6.78 


11.66 


304987 


AA6 18044 


Hs 300697 


immunoglobulin heavy constant gamma 3 (G 


0.90 


1.23 


305016 


AA626876 




gb:zu89h06.s1 Soares_testis_NHT Homo sap 


6.46 


10.17 


305034 


AA630128 




gb:ab99c04.s1 Stratagene lung (937210) H 


1.00 


1.00 


305079 


AA641012 




gb:nr72a12.s1 NCI_CGAP_Pr24 Homo sapiens 


5.68 


11.59 


3051 1 1 


AA644187 


Hs 303405 


ESTs 


1.48 


1.37 


305148 


AA654070 




gb:nt01g08.s1 NCLCGAPJ_ym3 Homo sapiens 


1.76 


4.61 


305159 


AA659166 


Hs.275668 


EST, Weakly similar to EF1D_HUMAN ELONG 


1.00 


2.15 


305190 


AA665955 




gb:ag57d1 2.s1 Gessler Wilms tumor Homo s 


5.31 


8.14 


305232 


AA670052 


Hs. 169476 


g lyceratde hyde-3-phosph ate de hydrogenase 


0.78 


1.18 


305235 


AA670480 




gb:ag37e01.s1 Jia bone marrow stroma Horn 


3.11 


8.66 


305245 


AA676695 


Hs.81328 


nuclear factor of kappa light polypeptid 


4.38 


7.53 


305312 


AA700201 




gb:zj44f07.s1 Soares _fetal_liver_spleen_ 


2.13 


2.66 


305322 


AA701597 


Hs.163019 


EST 


1.20 


1.40 


305394 


AA720942 


Hs.300697 


immunoglobulin heavy constant gamma 3 (G 


1.16 


0.68 


305413 


M724659 




gb:ai 1 0fOS.s 1 Soares _parathyro)d_iumor_N 


5.86 


9.87 


305447 


AA737856 




gb:nx10c08.s1 NCLCGAP_GC3 Homo sapiens 


2.21 


2.86 


305476 


AA745664 


Hs. 287445 


hypothetical protein FLJ11726 


3.36 


6.54 


305483 


AA748030 


Hs.303512 


EST 


1.00 


2.02 


305528 


AA769156 




gb:nz12e05.s1 NCI_GGAP_GCB1 Homo sapiens 


6.44 


9.10 


305612 


AA782347 


Hs 272572 


hemoglobin, alpha 2 


0.19 


0.79 


305614 


AA782866 




gb:aj09h02.s1 Soares_parathyroidJumor_N 


1.00 


1.00 


OUvJU 1 u 


AA782884 


Hs 275865 


ribosomal protein S18 


7.57 


10.20 


305617 

uuOUJl 


AA8061 24 




gb:oe29a12.s1 NCI_CGAP_Pr25 Homo sapiens 


4.78 


12.42 


305639 


AA806138 




gb:o©29c12.s1 NCLCGAP_Pr25 Homo sapiens 


0.89 


0.70 


305650 


AA807709 




gb:nw31e04.s1 NCI_CGAP_GCB0 Homo sapiens4.49 


8.71 


305690 


AA8 13477 




gb:ai67a05.s1 Soares_testis_NHT Homo sap 


4.91 


9.40 


305726 


AA828156 


Hs.73742 


ribosomal protein, large, P0 


0.19 


0.81 


305728 


AA828209 




gb:of34a02.s1 NCI_CGAP_Kid6 Homo sapiens 


5.12 


9.29 


305759 


AA835353 




gb:ak72b06.s1 Barstead spleen HPLRB2 Horn 


1.66 


4.11 


305792 


AA845256 




gb:ak84a08.s1 Barstead spleen HPLRB2 Horn 


2.34 


4.25 


305864 


AA864374 


Hs.73742 


ribosomal protein, large, P0 


0.30 


1.40 


305901 


AA872968 




gb:oh63h08.s1 NCI_CGAP_Kid5 Homo sapiens 


2.10 


5.21 


305910 


AA875981 




gb:nx21h02.s1 NCI_CGAP_GC3 Homo sapiens 


0.32 


1.01 


306015 


AA897116 




gb:am08b07.s1 Soares_NFU_T_GBC_S1 Homo s1. 56 


1.12 


306017 


M897221 


Hs. 109058 


ribosomal protein S6 kinase, 90kD, polyp 


5.21 


7.90 


306020 


M897630 


Hs. 130027 


EST 


1.96 


6.59 


306063 


AA906316 




gb:ok03g03.s1 Soares_NFL_jLGBC_S1 Homos 


7.38 


20.69 


306065 


AA906725 




gb:ok78g02.s1 NCI_CGAP_GC4 Homo sapiens 


7.19 


13.48 


306104 


AA910956 




gb:ok85h1 1.s1 NCLCGAP_Kid3 Homo sapiens 


6.50 


9.13 


306109 


AA911861 




gb:og21a07.s1 NCI_CGAP_PNS1 Homo sapiens 


4.21 


5.25 


306148 


AA917409 


Hs.288036 


tRNA isopentenylpyrophosphate transferas 


2.20 


2.70 


306242 


AA932805 




gb:oo60g04.s1 NCI CGAPJ.U5 Homo sapiens 


2.84 


5.35 


306288 


AA936900 




gb:oi53h05.s1 NCf_CGAP_HN3 Homo sapiens 


1.60 


1.12 


306325 


AA953072 


Hs.210546 


interleukin 21 receptor 


1.65 


2.26 


306353 


AA961382 


Hs.275865 


ribosomal protein S18 


3.78 


6.32 


306375 


AA968650 


Hs.276018 


EST, Moderately similar to JC4662 ribos 


4.30 


5.74 


306396 


AA970223 




gb:op09d05.s1 NCI_CGAP_Kid6 Homo sapiens 


0.95 


2.45 


306428 


AA975110 


Hs.191228 


hypothetical protein FLJ20284 


3.19 


4.10 


306442 


AA976899 




gb:oq35e09.s1 NCI_CGAP_GC4 Homo sapiens 


4.67 


7.44 


306446 


AA977348 




gb:oq72e12.s1 NCI_CGAP_Kid6 Homo sapiens 


3.92 


6.27 



126 



WO 02/086443 



PCT/US02/12476 



5 
10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



oriR/RR 

OUD400 


AAQ7R1RR 
AA3/0 I0D 




qnfi/R7 
OU040 / 


AAyooouo 


ns. 1 oooyo 


onccm 
OUDOIU 


A AQRR5/R 
AAy00D40 




OUuOOO 


AAyy40U4 


He 97fiHftO 
ns. Zf ouoo 


oriRRR7 
OUOODf 


A AQQ/ Rqn 

AAyy40ou 




Onfi579 
JUDO/ Z 


AAQQRRRR 

AAyyoooo 




OUOOoZ 


A A QQ ROAR 

AAyyoz40 




ouuoyo 


Ainnnqon 

AIUUUOZU 


U5 1RQ/7R 
no. ioy*tf 0 


OUODUO 


AIUUU4y/ 


no. 1 1 30UU 


oneecc 
OUODOD 


AIUU4UZ4 




qnfifi7fi 
0UOD/ D 


AIUUOOUO 


Ue Ofl/IOR 
rlo.ZOH 1 00 


q/IRRRR 
OUODOD 


AlfHRRI 5 
AIU I 00 IO 




qflR7n9 
OUO/ UZ 


AI0995R5 
AIUZZOOO 


Lie qn7R7fl 
no.OU/D/U 


0HR79R 
0U0/ ZD 


AIH97qRQ 

Aiuz/ooy 


He 979579 
no.Z/ZO/Z 


OUo/ol 


AlflqORRQ 




0OR7R7 
0UD/O/ 


Aiuooyoo 


Ua 9401 -fQ 
n&.Z43 1 10 


QHRROO 

ouooyz 


Aiuyz*fDo 






Atuyoyof 




oUDoOD 


Al JZO 1 1 1 




OHRQCR 

oUDaoo 


Al I ZD IOC 




ori7noR 


A H A 077 A 
Al I H£.l f 


Mc 1 1Q199 
no. 1 I y I ZZ 


qry7fl/1 

0U/U4 I 


Al I**hZ**0 




OUf uy I 


ai lo/Hoy 




on7i ri 

OUf IO I 


ai i oyzo i 




qfV79Q7 

ou/ zy / 


A|0flR7QP 

Aizuo/ yo 


Uq 11100/ 

no. I I IOO*f 


qfY7017 
OU / 0 1 / 


AIZUOOUO 


Ue 1/7000 
no. 14/ 000 


0fY7007 


AIZ1414Z 


Uc 9/R0R1 
r1S.Z4000 1 


oOYobY 


A 1000 ICQ 

AlzzOlOo 


Up 1/7RRC 
MS. 14/000 


307410 


AIZ41 f 10 


77H0Q 

ns. / / uoy 


'ifYTA 1 K 

OUf 410 


AIO/011R 
AIZ4Z I lo 




0/Y7/00 
0U/4Z0 


AlO/lQOnR 
AIZ40ZUD 


Uc 17QR70 

ns. i /yo/o 


o/w/oc 
oU/4zo 


AIZ400O4 




on7EM 7 
0U/O1 / 


A107RnR<? 
AIZ* OUOO 




OU/OD 1 


AlORIKqR 
AIZO IOOD 




q(Y75R1 


A10R00n7 
AlZOZZUf 




qry7RnR 
OU/DUD 


Aizyuzyo 




QH7RK7 
OU/ DO/ 


Aiqnc/oR 

AIOUD^ZO 


M«s 9QR9R9 
Do.ZyOZDZ 


qfV7RQ1 

ou/ oy i 


AI01PORC 
AIO 1 OZOO 




OU/ f U I 


AIO IOOOO 


Ue 97RR79 

nstZf 00/z 


q/Y771 R 
OUf / 10 


Aiqqq/nR 

AI0004UD 


ns. 00/00 


on77on 

0\Jf (Ov 


AiqqcnQO 
Aiooouyz 




OUf / DU 


Aiq/oqQ7 
AlO^ZOOf 




o/Y77R/ 
OUf /04 


A 10/0701 
AlO^Zf 0 I 




0/Y77RO 
OUf / 00 


A 1 0/797/ 
AIO**/ Zf 4 




0/Y77QR 
OUf fvO 


AIORHRRR 
AI00UOOD 




Of"17Qn7 

oU/oU/ 


AIOK17QQ 

ai jot /yy 




OU/oUo 


AIOC1POR 
AIOOJOZD 




OU/ ozU 


AIOCC7R1 
AlOOO/Dl 




o/Y7Ron 
JU/ooU 


AI0KR700 
AlOOo /ZZ 


Ue 97R707 
nS.Z/O/ Of 


OfY7RKO 
OUf OOZ 


AI0R^c;/1 
AI00004 I 




ou/yu^ 


AlOflri/lRO 
AlOOUtDZ 




ouf yy/ 


AlfOHO IZ 


Ue 1R11fi£; 
no. lull 00 


OUoUUZ 


Ai/OKOxn 
AI400Z4U 


Ue 0R0//0 
I1S.Z0044Z 


OUoU 1 1 


AMOQ/70 




onflnoo 
OUoUZO 


AJ/C0709 
A140Zf OZ 


U c 9R1R77 
Do.ZO 10/ / 


onRftAi 
0UoU41 


AUCRROA 
AI^O0OZ*t 


Ue 1 RQ/7R 
no. 1 004/0 


OUouoy 


AI/RPQOR 

AI40oy0o 


Uc 97RR77 
no.Zf 00 1 / 


OUoUoD 


AI/7/10R 
AI4f 4100 


Uc 1R11RR 

ns. 10 1 1 do 


OUolUl 


AI/7CORn 
AI4f OaOU 


Ue 1R11R5 
no. loll 00 


oUolUb 


AI/7RflnO 
AI4f DOUO 




Q-flQI 00 

OUolZZ 


AMRn-100 
Al4aU1 ZO 


Ue ono/n 
ns.ouy4 1 1 


onfli c/ 
0Uolo4 


AIOUUOUU 




0DR171 
OUO I f I 


AIRO0R0O 
AIOZOOOZ 


He 9QR7RR 
no.zyo/OD 


OflROI 1 
OUoZ 1 1 


AlOOf UZo 


He 97RK79 

ns.z/oo/z 


OfkQOI 0 

OUozlO 


AlOOf U41 




OftflO-lR 
OUOZ 1 D 


AIRK710R 
AlOOf 100 




qnfl91Q 

ouoz I y 


AlOOf Z*fD 




OUOZ/ I 


AIRR7R// 
A10D f 0*f*t 


Ue 9R99RQ 

ns.zozzoy 


q/ifloiQ 
ouoo i y 


Aioooyoo 


He 1R11R5 
no. IO I 1 00 


ouoooz 


AIRIORiq 

aid 100 1 y 


He m«v74Q 
no. i vol 4y 


onfl/ 1 0 
0U04 I 0 


A|R0R9t;q 

AIOODZOO 


He 1QRR11 
no. 1900 1 1 


qflR/RA 
OU04DU 


AlfifinRfifl 
AIOOUOOU 


He QRRZ.0 
no.yoo4U 


OOR/R/ 
0Uo404 


AIR79/9C 
AID/ ZHZO 


He 9771 17 
nS.Zf /II/ 


OUoooo 


AI71R9QQ 

Aif lozyy 




ouooyy 


A171QRQ0 

Aif i yoyo 




OUoolo 


A170RCQ0 
ai/ oooyo 


Ue 10177/ 
MS. IU1 / /4 


Oflflft/O 
0U0O40 


AI/*tOUHU 




OflflG70 
OUOO/O 


Al7RnRR4 
Al/OUOOt 




qf)RRQ7 


Al7R714q 




308762 


A1807405 


Hs.259408 


308778 


AI811109 




308782 


AI811767 


Hs.2186 


308808 


AI818289 




308823 


AI824118 


Hs.217493 


308875 


AI832332 
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gb:ou57e08.s1 NGI_CGAP_Br2 Homo sapiens 
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hypothetical protein FLJ23045 
gb:tr19a12.x1 NCI_CGAP_Ov23 Homo sapiens 
gb:wi09c10.x1 NCLCGAP_CLL1 Homo sapiens 
gb:wi97a07.x1 NC!_GGAP^Kid12 Homo sapien 
ESTs 

gb:tr04c11.x1 NCI_CGAP_Ov23 Homo sapiens 
eukaryotic translation elongation factor 
gb:wk52c01.x1 NCI_CGAP_Pr22 Homo sapiens 
annexin A2 

gb:at48g03.x1 Barstead colon HPLRB7 Homo 



3.35 


5.77 


^ 70 
O. f z. 


5.37 


1.00 


1.00 


6.61 


10.91 


1R 9n 

ID.ZU 


31.83 


9 51 
z.o 1 


6.52 


1.42 


3.13 


4.91 


8.68 


1.96 


8.60 


U. I I 


0.45 


9.56 


17.28 


1.86 


3.60 


1 A7 

1.4/ 


1.19 


1 9R 
J.ZO 


9 fli 
z.00 


q qi 
o.y i 


5 91 
o.z 1 


3.33 


6.06 


3.77 


7.46 


9 19 
Z. IZ 


2.85 


6.10 


10.52 


1.72 


1.56 


2.00 


4.70 


9.12 


12.56 


4.88 


8.52 


3.55 


6.44 


2.46 


4.65 


5 R4 
0.04 


10.13 


q 1R 
0. IO 


r 15 
0. 10 


9 no 

Z.UZ 


q 7q 

0.(0 


n 79 
u. /z 


0.48 


9 qR 

Z.00 


1 51 
0.0 1 


9 Rn 
z.ou 


5.44 


1 1R 
0. 10 


7.67 


1.00 


1.00 


3.40 


11.20 


4.74 


15.51 


3.50 


7.19 


1.76 


2.44 


1.59 


1.31 


1.90 


2.13 


0.45 


0.99 


1.51 


0.99 


1 nn 


1.00 


A 59 
4.0Z 


12.58 


1 AO 
I.4Z 


1.00 


ft R7 
0.0/ 


9 61 


0.00 


7.68 


n w 
u.oo 


0.86 


7.94 


21.57 


2.05 


3.32 


3.18 


5.21 


3.13 


4.99 


1 nn 

I.UU 


3.01 


5 RR 

0.00 


12.64 


^ 7Q 
0. f y 


5.83 


n ^R 

U.00 


0.88 


4.36 


6.06 


1.80 


1.98 


*3 ^R 

0.00 


4.14 


1 ^n 

I.OU 


3.87 


or 
.00 


8.72 


2.70 


3.86 


0.66 


1.33 


9 AR 
Z.40 


4.86 


2.43 


2.14 


3.34 


3.79 


A R1 

4.0 1 


4.78 


4.87 


7.94 


2.40 


6,35 


2.45 


3.33 


1.24 


1.41 


3.16 


4.82 


1.79 


2.68 


4.87 


8.27 


^ on 

0.3U 


5.64 


3.32 


5.12 


0. 1 I 


2.36 


0 QR 
0.30 


3 RQ 
o.oa 


n R9 
U.OZ 


0.99 


2.76 


5.59 


3ll7 


6^30 


1.00 


1.00 


2.94 


5.15 


4.41 


8.34 


1.85 


1.92 


2.52 


3.80 



127 



Hs.177 



Hs.78202 
Hs.1 19598 
Hs.265884 

Hs.1 80842 



Hs.299426 



WO 02/086443 

308879 AI832763 Hs.75968 

308886 AI833240 

308898 AI858845 

308934 AI865023 

5 308966 AI870704 

308979 AI873111 

309045 AI910902 

309051 AI911975 

309069 AI917366 

10 309083 AI922426 

309105 A1925503 

309122 AI928178 

309128 AI928816 

309164 AI937761 

15 309177 A1951118 

309288 AI991525 

309299 AW003478 

309303 AW004823 

309411 AW085201 Hs.244144 

20 309437 AW090702 Hs.278242 

309459 AW117645 Hs.65114 

309476 AW1 29368 

309499 AW1 36325 Hs.279771 

309529 AW150807 Hs.181357 

25 309532 AW151119 

309626 AW192004 Hs.297681 

309641 AW194230 Hs.253100 

309675 AW205681 Hs.253506 

309693 AW237221 Hs.181357 

30 309695 AW238011 Hs.295605 

309700 AW241170 Hs.179661 

309747 AW264889 

309769 AW272346 

309782 AW275156 Hs.156110 

35 309783 AW275401 Hs.254798 

309799 AW276964 

309866 AW299916 

309903 AW339071 Hs.300697 

309923 AW340684 

40 309928 AW341418 

309931 AW341683 

309933 AW341936 

309964 AW449111 Hs.257111 

310002 AI439096 Hs.323079 

45 310096 AW1 36822 Hs.1 72824 

310098 AI685841 Hs.161354 

310109 AI203094 Hs.148633 

310112 AW197233 Hs.147253 

310115 A1611317 Hs.223796 

50 310121 AW195642 Hs.148901 

310146 A1206614 Hs.197422 

310193 AI627653 Hs.147562 

310255 AW450439 Hs.1 53378 

310261 AI240483 Hs.201217 

55 310264 A1915771 Hs.74170 

310275 AI242102 Hs.213636 

310282 AI243332 Hs.156055 

310290 AW013815 Hs.149103 

310333 AI253200 Hs.145402 

60 310346 AI261340 Hs.145517 

310385 A1263392 Hs.156151 

310443 AW119018 Hs.164231 

310444 AW1 96632 Hs.252956 
310446 AI275715 Hs.145926 

65 310468 AI984074 Hs.196398 

310477 AI948801 Hs.171073 

310512 AW275603 Hs.200712 

310514 AI681145 Hs.160724 

310524 AW082270 Hs.12496 

70 310547 AI302654 Hs.208024 

310584 AI653007 

310608 AI962234 

310624 A1341594 

310636 AI814373 

75 310648 AI347863 

310694 A1654370 

310695 AI472124 
310714 A1418446 
310722 AI989803 

80 310756 AI916560 

310764 AI376769 

310848 A1459554 

310851 AW291714 Hs.221703 

310854 AI421677 Hs.161332 

85 310858 AI871000 Hs.161330 



Hs.156304 
Hs.196102 

Hs.164175 
Hs.156672 
Hs.157752 
Hs.157757 
Hs.157882 
Hs.157289 
Hs.158707 
Hs.1 671 72 
Hs.161286 



thymosin, beta 4, X chromosome 
gb:at76d10.x1 Barstead colon HPLRB7 Homo 
gb:w!32d10.x1 NCLCGAPJJtl Homo sapiens 
phosphatidylinosito) glycan, class H 
gb:wI47h01.x1 NCI_CGAP_Ut1 Homo sapiens 
gb:w!52h05.x1 NCI_CGAP_Brn25 Homo sapien 
gb:tq39f01.x1 NCLCGAPJJtl Homo sapiens 
gb:wd78d01.x1 NCLCGAP_Lu24 Homo sapiens 
SWI/SNF related, matrix associated, act 
ribosomal protein L3 
ESTs 

gb:wo95a11.x1 NCI_CGAP_Kid11 Homo sapien 
ribosomal protein L13 

gb:wp84b09.x1 NCI_CGAP_Brn25 Homo sapien 
gb'.wx63g05.x1 NCI_CGAP_Br18 Homo sapiens 
ESTs 

gb:wq66c06.x1 NCLCGAPJ3C6 Homo sapiens 
gb:ws93a08.x1 NCl_CGAP_Co3 Homo sapiens 
EST 

tubulin, alpha, ubiquitous 
keratin 18 

gb:xe14b05.x1 NCI_CGAP_Ut4 Homo sapiens 
Homo sapiens clone PP1596 unknown mRNA 
lamlnin receptor 1 (67kD, ribosomal pro 
gb:xg33e10.x1 NCLCGAPJJtl Homo sapiens 
serine (or cysteine) proteinase inhibit 
EST, Moderately similar to GHHU Ig gamm 
EST, Moderately similar to ATPN JHUMAN A 
laminin receptor 1 (67kD, ribosomal prot 
mannosidase, alpha, class 2A, member 2 
tubulin, beta polypeptide 
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gb:xs13c10.x1 NCLCGAP__Kid11 Homo sapien 
immunoglobulin kappa constant 
EST 

gb:xp58h01.x1 NCl_CGAP_Ov39 Homo sapiens 

gb:xs44c01.x1 NCi_CGAPJ<id11 Homo sapien 

immunoglobulin heavy constant gamma 3 (G 

gb:hd05g08.x1 Soares_NFLJLGBC_S1 Homo s 

gb:hd08c03.x1 Soares_NFL_T_GBC_S1 Homos 

gb:hd13d01.x1 Soares_NFL_T„GBC„S1 Homos 
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ESTs, Weakly similar to B48013 proline-r 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 
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ESTs 
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metallothionein 1E (functional) 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 
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ESTs 
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ESTs 

ESTs, Highly similar to AC004836 1 simil 
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ESTs 

ESTs 

gb:Human endogenous retrovirus H proteas 
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Homo sapiens mRNA full length insert cDN 

ESTs 

ESTs 
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ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 



3.38 


7.96 


3.06 


2.65 


2.45 


3.44 


4.14 


6.76 


1.00 


1.00 


7.15 


11.10 


0.61 


0.59 


1.78 


4.42 


3.27 


5.88 


2.39 


3.34 


5.54 


17.78 


1.00 


2.92 


1.38 


5.55 


2.43 


3.11 


0.81 


0.97 


4.86 


7.46 


4.36 


9.43 


2.88 


7.54 


4.30 


7.14 


2.49 


3.11 


2.88 


4.55 


2.08 


6.60 


2.82 


3.55 


4.78 


3.95 


1.18 


4.40 


4.46 


12.06 


147 


1.39 


5.68 


15.20 


1.00 


1.00 


5.45 


9.61 


1.41 


1.25 


5.00 


8.35 


5.76 


11.90 


0.42 


0.69 


1.00 


4.11 


1.68 


1.44 


3.02 


5.04 


1.05 


1,18 


2.30 


3.67 


7.41 


13.71 


1.20 


12.70 


4.90 


18.29 


1.99 


3.07 


0.20 


0.47 


1.51 


1.22 


0.31 


0.76 


2.06 


5.83 


2.92 


3.55 


1.25 


0.84 


1.00 


2.71 


9.50 


15.31 


2.85 


4.18 


4.26 


10.63 


3.28 


4.40 


0.26 


0.86 


5.43 


8.19 


3.15 


8.06 


2.19 


3.12 


1.17 


1.91 


4.81 


9.95 


5.96 


7.79 


2.90 


4.63 


0.85 


1.01 


2.18 


3.85 


3.39 


5.19 


1.00 


1.00 


3.87 


8.12 


3.30 


7.33 


0.72 


1.44 


3.26 


3.46 


2.39 


4.08 


5.60 


6.49 


4.91 


9.09 


1.85 


1.71 


0.17 


0.69 


5.40 


13.22 


4.82 


6.27 


1.76 


3.51 


1.14 


6.85 


8.46 


13.01 


4.76 


7,37 


2.84 


1.96 


1.00 


2.32 


6.37 


7.94 


6.07 


9.84 



PCT/US02/12476 



128 



WO 02/086443 



PCT/US02/12476 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



80 



85 



110RR4 


Al9945 ( iR 


Hs 161399 


ESTs 


0.87 


0.78 


0 IUO/U 


T47764 


Hs 119917 
no. i oici i f 


ESTs 


1.00 


3.63 


110896 


AW1 57731 

nYi lui f o i 


Hs 270982 


ESTs Moderate! v similar to ALU7 HUMANA 


7.07 


16.68 


310922 


AW1 95634 


Hs 170401 


ESTs 


1.00 


1.00 




AI5R0910 


Hs 263912 


ESTs 


10.08 


17.66 


11 09*17 


AW1 90974 

f\VV 1 3U3 1 t 


Hs 196918 


ESTs 


2.18 


3.18 


mono 


Al 591 810 


Hs 171050 


ESTs 


3.06 


6.64 


311012 


AW298070 


Hs 241097 


ESTs 


1.23 


3.77 


311034 


Al 564023 


Hs 311389 


ESTs, Moderately similar to PT0375 natur 


2.44 


2.09 


311074 




Ue 1QQ848 


ESTs 


6.04 


14.19 


311134 


AIQ90R4Q 


Hs 19R971 


ESTs 


3.54 


6.96 


01 1 1 /*♦ 


AW450559 


Hs 905457 


npriayin 

UOllOAIIJ 


0.65 


0.95 


311187 


Al 63 8374 


Hs.224189 


ESTs 


246 


2.78 


311220 


AI656D4Q 


Hs 196532 


ESTs 


1.10 


2.52 


311230 


AI989808 


Hs. 197663 


ESTs 


1.41 


1.75 


311236 


Al 653378 


Hs. 197674 


ESTs 


2.18 


2.11 


311242 


AW016812 


Hs.200266 


ESTs 


0.63 


5.11 


311258 


AI671221 


Hs 199887 


ESTs 


1.00 


1.41 


311277 


AW072813 

nvvui tu I u 


Hs 270868 


ESTs, Moderately similar to ALU4_HUMAN A 


2.56 


1.94 


311294 


AA826425 


Hs 291829 


ESTs 


1.04 


2.69 


311308 


F12664 


Hs 49Q00 


ESTs 


1.96 


6.70 


311351 


AI682303 


Hs.201274 


ESTs 


4.77 


9.38 


11 1 19(1 


AW192Q97 

rtVVUOi.U\7 f 


Hs 202280 


ESTs 


2.80 


6.06 


311405 


AW290961 


Hs.201815 


ESTs 


3.80 


11.66 


311409 


AI698839 




gb:wd31f02.x1 Soares_NFL_T_GBC_S1 Homos 


3.84 


6.94 


311420 


AI91R991 


Hs 209867 


ESTs 


5.30 


12.56 


311443 


AI791591 

Ml / O IJ*- 1 


Hs 192206 

no. 1 3t£UU 


ESTs 


4.39 


6.09 


311467 


AI914909 


Hs 175377 


ESTs 


1.00 


1.04 


311479 


AIQ11R79 


no.t l l ocJ3 


ESTs 


2.76 


5.61 


311488 


R57390 


Hs 301064 


arfaptin 1 


2.50 


5.73 


31 1495 


AW100D77 


Hs 221358 


ESTs 


3.63 


6.09 


311511 


AW4445RR 
r\ v v Trt u u u 


Hs 210303 


ESTs 


2.00 


2.87 


311534 


AW1 30351 


Hs.243549 


ESTs 


0.31 


1.33 


31 1 537 


AI805121 


Hs.211828 


ESTs 


3.69 


5.85 


311543 


AI681360 


Hs.201259 


ESTs 


1.73 


1.34 


311551 


AW449774 


Hs.296380 


POM (P0M121 rat homolog) and ZP3 fusion 


3.31 


6.12 


311557 


AI819230 


Hs 211238 


interleukin-1 homolog 1 


1.00 


1.00 


311558 


Z44432 


Hs.63128 


KIAA1292 protein 


2.25 


3.41 


0 1 1 333 


AW008271 


Hs 265848 


similar to rat mvomeoalin 


2.68 


5.90 


311563 


AI922143 


Hs 211334 


ESTs 


2.39 


3.32 


1115RR 

O I IOOU 


AIR97R14 

MlQ£ f Out 


Hs 211227 


ESTs 


2.47 


3.85 


311616 


AW450R75 

AAVVtUUU/ v* 


Hs 212709 


ESTs 


1.00 


1.00 


311621 


AIQ94107 


H<5 9114R4 


ESTs 


4.16 


6.74 


infill 


AI99R45R 


Hs 2110R1 


ESTs 


2.17 


3.76 


111RRR 
0 1 luuo 


AW191R74 
MVV 1 SOU/ n 


Hs 940044 


ESTs 


2,60 


3.12 


311672 


R11807 


Hs 20914 


hypothetical protein FLJ23056 


2.79 


5.18 


111681 

0 1 1 UUU 


AW181718 

/AW lUwf UU 


Hs 212644 


ESTs 


0.19 


0.96 


311700 


R49601 


Hs. 171495 


retinoic acid receptor, beta 


6.28 


8.83 


311714 


AW1117R5 


Hs 246831 


ESTs, Weakly similar to CIKGJHUMAN VOLTA 


5.00 


8.17 


311735 


AW29441 6 


Hs 144687 


Homo sapiens cDNA FLJ12981 fis, clone NT 


0.96 


0.72 


311743 


T99079 


Hs.191194 


ESTs 


1.00 


1.95 


311783 


AIRR947R 


Hs 13528 


hvnnthptifal nrotein FLJ 14054 

Hj^pL'LJIV'UvOI fjl U toil 1 1 l—\J IfWtfl 


0.16 


0.77 


311785 


AI056769 


Hs 133512 


ESTs 


1.34 


3.97 


311799 


AA7R07Q1 

fV\l ou/ »? 1 


Hs 14014 

I lo. t*tu i*t 


ESTs, Weakly similar to KIAA0973 protein 


8.52 


13.32 


111R19 


rtWiiuucf O 


Hs 254325 


ESTs 


3.58 


3.91 


311823 


AI0R9499 


Hs 111297 


ESTs 


1.40 


1.72 


311877 


AA149RQ1 


Hs 85139 


G protein -coupled receptor 39 


0.95 


0.91 


111RR6 

O 1 1 uuu 


AA59971R 


Hs 132554 


ESTs 


0.88 


0.87 


111R96 


AW90R447 

/AVV&UUT^t/ 




gb:UI-H-BI1-afg-g-02-0-Ul.s1 NCI_CGAP__Su 


1.66 


1.13 


311910 


N28365 


Hs 22579 


Homo sapiens clone CDABP0036 mRNA sequen 


1.66 


2.30 


311923 


T60843 


Hs. 189679 


ESTs 


0.42 


2.63 


311933 


AI597963 


Hs.1 18726 


ESTs 


1.88 


3.02 


311959 


T67262 


Hs 124733 


ESTs 


2.02 


2.33 


311960 


AW4401 33 


Hs 189690 


ESTs 


3.87 


6.62 


311967 


Al 382726 


Hs.1 82434 


ESTs 


5.80 


8.14 


311975 


AA804374 


Hs 272203 


Homo sapiens cDNA FLJ20843 fis, clone AD 


0.98 


3.26 


119005 

O 1 tUUu 


7-7845O 


Hs 11941 

1 19. lost 1 


ESTs 


0.12 


1.39 


312028 


T78886 

1 f uuuu 


Hs 284450 


ESTs 


3.78 


4.92 


312046 


4)580018 


Hs. 268591 


ESTs 


4.11 


7.32 


112056 


T83748 


Hs 268594 

1 IO.CuUvv't 


ESTs 


2.36 


3.08 


312064 


AA676713 


Hs.191155 


ESTs 


3.34 


5.28 




AW303760 


Hs 13685 


ESTs 


1.60 


1.15 


119093 


T91809 


Hs 121296 


ESTs 


0.68 


0.85 


312094 


278390 




gbiHSZ78390 Human fetal brain S. Meier-E 


3.05 


4.48 


312097 


AI352096 


Hs 112180 


zinc finger protein 148 (pHZ-52) 


4.52 


9.70 


1191 1ft 


TR5119 


Hs 17R9Q4 
no. I / o^at 


ESTs 


2.40 


2.60 


312128 


AI059R0Q 


Hs 17631 

i io. i / uu i 


Homo sapiens cDNA FLJ201 1 8 fis, clone CO 


2.39 


3.53 


312147 


T89855 


Hs 195648 


ESTs 


0.67 


1.03 


312175 


AA953383 


Hs. 127554 


ESTs 


5.85 


10.60 


312179 


A1052572 


Hs.269864 


ESTs 


2.41 


3.32 


312201 


AI928365 


Hs.91139 


solute carrier family 1 (neuronal/epithe 


0.24 


0.89 


312207 


H90213 


Hs.191330 


ESTs 


2.20 


4.55 


312220 


N74613 




gb:za55a07.s1 Soares fetal liver spleen 


4.28 


11.13 


312252 


AI128388 


Hs.143655 


ESTs 


1.64 


1.57 


312304 


AA491949 


Hs.269392 


ESTs 


0.12 


2.47 



129 



WO 02/086443 



PCT/US02/12476 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



80 



85 



010010 

OlZOlO 


AlA/OOEOQO 


Ue> 1^9QR1 
nS.140yo 1 


CO 1 S 


3.46 


5.69 


312319 


AA21 6698 


It. 1QQ70n 

HS.IoU/oU 


1 lka protein 


R 7R 
0. fa 


4.40 


01 O0O1 
31Z3Z1 


KbbzlU 


n. 10RQ07 

HS.iobyo/ 


CO 1 S 


0.44 


1.74 


0001 

312331 


AA0Z00IZ 


Uo OQQ1 ni 
HS.ZoyiUl 


qIucosg regulated protein, 58 kD 


3.73 


5,96 


OH OqqQ 


AA0Z40y4 


ns. 1 00044 


ESTs 


3.07 


0.95 


9199R9 
J 1 ZOOO 


AIR7RRRQ 
AID/ 0000 


Wc 1R1RR7 
(15. 10 100/ 


CO 1 5 


10.08 


16.73 


91997R 
O 1 ZO / O 


AI97R0QR 


ns. 1 / a4uo 


noil Hit/ictnn ni/plti 97 
Ucll UIVIolUll l/yl/lc £1 


2.78 


3.71 


9.1 997R 


Dconon 

rwzuoy 


U c 179717 
no. 1 1 £.1 1 / 


ESTs 


1.00 


1.00 


qi9qflQ 

O 1 4L.OOS 


Ml QUO I4U 




nh't749.h19Yl NCI PfiAP Brn52 Homo saoien 
yu.UC40ll it. a 1 IN wi_uOnr_Di 1 1 Jt nuitiu oapicti 


2.37 


3.98 


oh 9407 
0 1 Z40 / 


AAQQ.R09R 

AAyyouzo 




ah- RH4-RT0629-1 20200-01 1-b10 BT0629 Homo 


4.06 


5.41 


0 I Z44U 


Ainciiqq 
MIUOI 100 


u c iqoqic 
ns. 1000 10 


Hnmn canipnc mRNA- rRNA nKF7n7fi1 119.94 /f 


1.00 


1.00 


qi9ZR1 
01Z401 


Koyycjy 


u c i7ficoq 
ns. 1 /oooy 


CO i s 


4.96 


10.04 


01 OAXQ 
0lZ4OO 


AJ1R7R07 
AllO/OO/ 


ns. 1 4oyz4 


CO 15 


1.11 


1.00 


•31 0KO7 
01Z0U7 


AHR0177 

Anooi / / 


Ue 1vl9RR9 

ns. 140000 


FQTc 
CO 1 S 


5.89 


8.24 


01 oc90 

Ol ZOZU 


A}7/19t:Q1 
At/4Z0yi 


ns.zuooyz 


CO 1 O 


3.30 


8.92 


01 OC/Q 
31 Z04O 


AIRRROOQ 
AlOObZZa 


U<» 1RQ^9R 
no. IOy4ZD 




1.38 


1.65 


01ORR/1 
31 Z004 


uoikoo 
MZ10ZU 


Ue 9RHRR 

ns.oouoo 


CO I s 


0.40 


0.77 


01 OGQ9 
01 Z080 


A11Q9199 

ai iyo izz 


Uc 1941A1 
nS. IZ4I4I 


CO 1 s 


0.13 


0.94 


01 OCQQ 

oizoyy 


AIQftCn70 

AIObOU/3 


Ue 19R79H 

ns. I/O/ zu 


to I S 


3.75 


5.29 


oi oroo 


AAU4040 I 


Uc iRR9nn 
ns. ioozuu 


CO 1 s 


6.78 


12.93 


31Zb4o 


ucoi 01 
MoZlZl 


Up iQ9nn7 
ns. lyouu/ 


CO 1 S 


0.38 


1.13 


•51 orrr 
OlZbbb 


AIO/IOCQO 

AIZ4U00Z 


Uc 914R7R 
nS.Z140/o 


F<5Te 
CO 1 S 


0.98 


2.03 


010RQQ 

31 zboy 


A\AMG0/1R1 
AW40U401 


ns.zuoyoo 


FQTc 
Co 1 S 


0.21 


0.61 


OlZol / 


U7K/1GQ 

n/04oy 


Ue 999/19R 

ns.zoo4Zo 


CQTe 

to 1 S 


1.51 


0.85 


0.1 9R4R 
0 I Z04D 


A\A/1R910/1 
AW IOZ IU4 


u c 9nnR7Q 

ns.zuuo/y 


ESTs 


8.93 


13.78 


01 0070 

Olzo/0 


AIR00071 

Aioyuu/ 1 


Ue 9R9RR9 

ns.zooooz 


to 1 S, VvcoMy olilllldi IU Ulllldllicu piuiclil 


4.20 


6.23 


Ol OOQO 
OlZoyO 


aioicoo>i 
AIU10ZU4 


Ue 179Q99 

ns. i / luce 


Co 1 S 


2.67 


3.15 


o«i onno 

oizyuz 


A\A<0007Q7 

AW292/y/ 


Ue 10001 R 

hs.iouoio 


CCTe VA/oaWli/ ctmilar M T9D9 UllMAMTRANQ 

Co i s, vveaKiy similar to i zL/o_nuiviAiH iiaaivo 


1.19 


0.71 


01 oooc 

31 zyzo 


mooqcq 
NyUobo 


Ue 971 RQR 

Ms.z/ioyo 


CO IS 


2.50 


4.25 


01 0/10£ 

olzyob 


A )£»£}< CQ4 
AIOOIOO 1 


u^. ioico<; 
nS.IZioZO 


CQTe 

CO ( S 


1.00 


1.17 


312975 


AI04U0UO 


Ue 9Q91 10 

ns.zyo i iy 


CQTe WoaUu cimilar \f\ AI 1 17 HI IMAM AI 1 ) 3 
Co iSj vveaKiy oiiiuidi iu Ai_u/_nuiviAiM alu o 


2.30 


4.80 


312978 


N24oo7 


Ue oooenn 

Hs.zyzouu 


CQTe 

CO IS 


0.80 


1.05 


31 ZboO 


A A/lQ7n/10 

AA497043 


Ue 1 1RAQI* 

ns.noooo 


CQTe 

Co 1 s 


3.12 


3.60 


01 OQQyl 

31 zyo4 


NZOd/1 


Ue 177997 
HS.l / loot 


CQTe 
CO 1 S 


2.03 


2.13 


01 onno 
313UUU 


AH/17/10 
AI14/41Z 


Uc 14RRR7 
nS.14D00/ 


CQTe 
CO 1 5 


5.52 


8.42 


01 onoa 
313U29 


a A701 con 
AA/310ZU 


Ue 17nK0yl 
MS.1/U0U4 


CQTe 
to 1 5 


0.96 


1.39 


01 onoa 
OlOUOy 


AMiQOQn 

Ai4iyzyu 


Ue 1/fQQQO 

ns.i4yyyu 


co i s, Wedwy simiidr iu uiiiidiiicu piumui 


6.48 


13.20 


0100/ia 
010U4y 


Avvzyouoo 


Ue 11Q9R7 

ns. i iyoo/ 


to 1 s 


6.44 


10.73 


01 oorr 
0 I ouoo 


rtioo j you 


U c 19RRRA 

ns. 1 00004 


ESTs 


1*51 


2,04 


oi onco 


noinic 
uo luiu 


Uc 19R9R9 

ns. I zoooz 


ESTs 


0.25 


1.50 


qiqn7o 

0 1 OUf U 


AMOonoq 
MI4ZZUZ0 


Ue 1R199R 

no. 1 0 1 000 


ESTs 


8.56 


11.60 


9190Q7 

oi ouy 1 


Alfi7f?1RA 
MtO/0 1 04 


nS.ZU400y 


FQTc 

CO 1 o 


3.72 


4.56 


01 01 on 

010! OU 


A\A/il4Q171 
MVV44y I / I 


u c 1RRR77 

ns. iooo/ / 


ESTs 


3.28 


5.06 


oioioe 
0 1 0 1 00 


IN0yZ04 


Ue 9RR010 

ns.^oou iu 


ESTs 


0.49 


1.36 


9191R9 


AlO/nQOQ 
AIZ4U00O 


Uc 1997R0 

ns. ioz/ou 


FQTc 
CO 1 s 


5.36 


5.52 


0100m 
31 321 U 


M7//\77 
N/4U/ / 


Ue 1070/19 

ns.iy/U40 


CQTe 
CO 1 S 


0.30 


0.66 


01 OOOR 

31 3Z30 


AVA/0OQ1CO 

AWZOol oy 


Uc R9R1 9 
MS. 000 I 0 


CQTe WoaWlu cimilar In AI 1 11 HI IMAM AI U Q 
to i s, weaKiy similar 10 alu i_nuiviAiH alu o 


5.16 


8.76 


01 OOOQ 

OlOZOy 


vviyooz 


Ue 19A170 
nS. IZ4 1 / U 


CQTe 
to i s 


1.00 


3.87 


01 oocc 
31 3200 


MOO ICC 

iMyonbb 


Ue 1017R/ 
HS.1Z1 /04 


CQTe lAfoalrtu cimilar in too tint liar JotHi 

to i s, weaKiy similar 10 icoiiouidr icmi 


0.74 


2,06 


01 00R7 

OlOZb/ 


A)77nnno 
AI//UUUO 


Ue 19QRR9 

ns. izyooo 


CQTe 
to 1 S 


0.23 


1.30 


313275 


Aino7cnvi 
AlU2/bU4 


Ue 1 RQRRO 

ns.ioyoou 


CQTe 
CO 1 S 


6.68 


9.57 


01 oooo 
0 lozyu 


AI7R00/7 
AI/O0Z4/ 


Ue 9QR>iq 

ns.zyo4o 


Unmn canipnc pRMA Fl I19.1fiq fie rtnnp NT 
nuiliu od|JJwMo uuivm i l.j iu iuvj no, uiuiie n i 


1.34 


1.07 


Ol OOQO 


AI0ROQQ1 

Aioozyyi 


Ue 0O9191 

ns.zuzizi 


to I s, vveaKiy similar 10 eiiv prumin [n.b 


2.00 


4.32 


Ol OOOK 

31J320 


AMonci 1 
AI4ZUb1 1 


Ue 197009 

ns.iz/ooz 


CQTe 
to 1 S 


1.20 


2.27 


31 3357 


A\A/n7>IO/IQ 

AvVU/4o4o 


Ue 901 R01 
HS.ZU10U1 


CQTe 
CO 1 S 


4.02 


5.33 


01 OOQO 

3133y3 


A1R7>IRQR 

Alb/4boO 


Ue 9001/11 

HS.ZUU141 


CQTe 
CO 1 S 


1.36 


2.84 


01 OOQO 

313399 


AW3/boby 


Ue 10/1007 

ns.iy4uy/ 


CQTe 

to IS 


2.58 


5.26 


01 9/M/1 

313414 


AlO>11R/n 

AIZ4104U 


Ue 199099 

ns. lozyoo 


FQTc 
tO 1 5 


6.57 


15.07 


313417 


A A*7jI A 1 C1 

AA/41151 


Ue 1 O7O0O 

HS.lo/oZo 


CQTe 

CO 1 S 


0.63 


3.01 


01 0/1 G7 


A AR7RncO 
AAO/OUOZ 


Uc 1Q9999 

ns. \so£.co 


Unmn canipnc pnMA Fl I11fi4fifk rlnnp HF 


2.78 


4.70 


010>IQQ 

3io4yy 


A10R1 OQQ 

Aizoioyu 


Uc 1/1RORR 

ns. 140U00 


klAA19/lR nrnlpin 
rSIAA I040 proiclll 


0.91 


2.37 


oioci c 

31 301 D 


AAUzyuoo 


Uo 19R14R 

ns. 1 00 1 40 


ESTs 


3.41 


7.08 


01 QCCC 

313000 


A AR0QC17 
AAOZaO I / 


Ue 11RR09 

ns. i ioouz 


FQTc 
CO 1 S 


0.23 


0.70 


01 ORRQ 

oioooy 


A1070/110 

aiz/04 iy 


Ue iqciAR 
ns. 1 00 1 40 


hunnlhpftrat nrnfpin Fl I19.QRA 

nypuineiicdi proiciii plj 10004 


1.88 


1.00 


313570 


AAU41400 


Ue 90Q919 

ns.zuyoiz 


FQTc 

to I s 


0.73 


2.27 


Ol OCOQ 

31363b 


A l7COn7C 

AI/03U/0 


Ue 10/IC97 
MS.lU4bZ/ 


nonno sapiens cuina tlj iu ioo ns, uune nt 


1,00 


1.72 


010RR9 


AA7jM1K1 

AA/4U101 


u c iqoA9R • 
ns. 1 ou4zo 


FQTc 
CO I s 


0.20 


1.42 


01 0R71 

3130/1 


W4yoZO 


Ue 10/1R19 

ns.lU4oio 


DO/1 0 linmnlnn 

Kr4z nomuiog 


1.00 


1.00 


01 0R79 
OlOO/Z 


A\AMRQQQ1 

AW4booyi 


Uc 1 99Q/1R 

ns. izzy40 


CQTe 
CO 1 5 


3.46 


5.80 


01 ofion 
oiooyu 


A1/109RQ1 

Ai4yooy I 


Uc 7R14R 
no, / 0 I4D 


nlafptpWpnrlnfhplial rail arlhpcinn mnlpn 
UldlclBl/clliiUlllclldl w>oii duncoiuii uiuico 


0.51 


0.97 


01071 1 
310711 


A A9QPfl7n 

AAoyou/u 


Up iqq>L7l 

ns. 1004/ 1 


FQTc 
CO 1 S 


0.18 


1.01 


oi 0700 
0 10/ ZO 


AAf|7nA19 
MrtU / X Z 




nh*7mRRn10c1 Qfratanpnp nptiropnithpliitm 


1.08 


1.03 


01 070R 
313/20 


AI7/MRQ7 
A!/ 4400/ 


Uc 9R7R0R 
nS.ZO/OUO 


CQTe 
CO IS 


2.13 


2.99 


qi 0774. 
OIO/ /*t 


AW19RR9.R 


Hq 14dRR9. 
no. J 44000 


ESTs 


1.38 


1.19 


010704 
0 10/ 04 


AA3 I UO 1 4 


Ue 19/lQnc; 
no. i04yuo 


FQTc 

CO 1 0 


3.88 


5.78 


0 io/yu 


A\A/fl7Q.ERQ. 

Avvu/oooy 


Uc 1770/19 

ns. 1 / /U40 


FQTc 
CO 1 S 


0.22 


2.06 


01OQO9 

313H0Z 


A\A/07inOO 
AVVZ/ lUZZ 


Ue 19990A 

ns. ioozy4 


FQTc 
CO 1 S 


1.15 


0.91 


01 OQO/1 

313U34 


AVAM1D77Q 

AW41o/ / y 


Ue 1 1 XQQQ 

riS.114ooy 


CQTe 
CO IS 


0.68 


3.14 


oiqoqc 
0 10000 


AIOOOHJO 


He 1 ^Qf1R7 
no. 1 09UO/ 


ESTs 


5.74 


&88 


313852 


H18633 


Hs.123641 


protein tyrosine phosphatase, receptor t 


o!ie 


1.14 


313854 


AW470806 


Hs.275002 


ESTs 


2.09 


4.06 


313865 


AA731470 


Hs.163839 


ESTs 


3.41 


4.09 


313871 


AW471088 


Hs.145950 


ESTs 


5.28 


6.83 


313883 


AI949384 




gb:nu76d01.s1 NC!_CGAP_Alv1 Homo sapiens 


2.90 


10.91 


313915 


AI969390 


Hs.1 63443 


Homo sapiens cDNA FLJ11576 fis, clone HE 


1.00 


1.00 



130 



WO 02/086443 

313926 AW473830 Hs.171442 

313948 AW452823 Hs.135268 

313978 AI870175 Hs.13957 

313983 AI829133 Hs.226780 

314035 AA164199 Hs.270152 

314037 AW300048 Hs.275272 

314040 AA166970 Hs.1 18748 

314067 AW293538 Hs.51743 

314103 AI028477 Hs.132775 

314107 AA806113 Hs.189025 

314113 AA218986 Hs.118854 

314124 AW1 18745 Hs.9460 

314126 AA226431 

314128 AA935633 Hs.194628 

314151 M236163 Hs.202430 

314184 AW081795 Hs.233465 

314192 AW290975 Hs.118923 

314244 AL036450 Hs.103238 

314253 AA278679 Hs.189510 

314262 AW086215 Hs.246096 

314320 AA811598 Hs.275809 

314332 AL037551 Hs.95612 

314335 AA287443 Hs.142570 

314340 AW304350 Hs.130879 

314351 AA292275 Hs.193746 

314376 AI628633 Hs.324679 

314443 AA827125 Hs.1 92043 

314458 AI217440 Hs.143873 

314466 AA767818 Hs.122707 

314478 AI521173 Hs.1 25507 

314482 AL043807 Hs.134182 

314506 AA833655 Hs.206868 

314519 R42554 Hs.210862 

314529 AL046412 Hs.202151 

314546 AW007211 Hs.16131 

314562 AI564127 Hs.143493 

314579 AW197442 Hs.116998 

314580 AW451832 Hs.255938 
314585 AA918474 Hs.216363 
314589 AW384790 Hs.153408 
314592 AA435761 Hs.192148 

314603 AA418024 Hs.270670 

314604 AA946582 Hs.8700 
314606 M418241 Hs.188767 
314648 AA878419 

314699 AI038719 Hs.132801 

314701 AI754634 Hs.131987 

314710 AI669131 Hs.290989 

314750 AI095005 Hs.1 351 74 

314767 AW135412 Hs.164002 

314801 AA481027 Hs.109045 

314817 AI694139 Hs.192855 

314835 AI281370 Hs.76064 

314852 AI903735 

314853 M729232 Hs.153279 

314940 AW452768 Hs.162045 

314941 M515902 Hs.130650 
314943 AI476797 Hs.184572 
314955 M521382 Hs.192534 
314973 AW273128 Hs.300268 
315004 AA527941 Hs.325351 
315006 AI538613 Hs.298241 
315033 AI493046 Hs.146133 
315035 A1569476 Hs.177135 
315056 AI202703 Hs.152414 
315069 AI821517 Hs.105866 
315071 AA552690 Hs.152423 
315073 AW452948 Hs.257631 
315078 AA568548 Hs.190616 
315080 AA744550 Hs.1 36345 
315120 AA564991 Hs.269477 
315175 AI025842 Hs.152530 
315193 AI241331 Hs.131765 
315196 AA972756 Hs.44898 
315200 AI808235 Hs.307686 
315254 AI474433 Hs.179556 
315353 AW452608 Hs.279610 
315397 AA218940 Hs.137516 
315403 AW362980 Hs.163924 
315431 AA622104 Hs.1 84838 

315454 AI239473 

315455 AW393391 Hs.1 5691 9 
315473 AI681671 Hs.312671 
315483 AW512763 Hs.222024 
315526 A1193048 Hs.128685 



CO 1 S 


3.40 


Co 1 S 


5.77 


CO IS 


n ar 


to 1 S 


d in 


PCTe 
to 1 5 


5.88 


ESTs 


1.00 


ESTs 


7.60 


r\IMrt 1 OHKI \ll Ulo II I 


1.86 


ESTs 


2.90 


F<3T<: 
CO 1 5 


2.00 


ESTs 


0.91 


Hnmn <?anipn<* mRN)A* rDNA DKF7n547C244 ffr 


2.53 


nh*nMflh1? q1 MCA CGAP Pr1 Mnrno ^aniens 


3.13 


ESTs 


2.90 


ESTs 


4.15 


ESTs 


3.44 


ESTs 


1.00 


ESTs 


2.88 


CO 1 b 


4.98 


CO lb 


0.38 


CO 1 b 


3.34 


CO 1 o 


2.85 


Unmn canione Hnnn OAROQ mRNA cpniipnrp 


4.35 


FST<: Mnrtarafplv "similar fn ntitafivp. n15 


0.77 


CO 1 o 


3.07 


ESTs 


4,10 


ESTs 


6.20 


pen**: 

CO lb 


0.58 


ESTs 


2.53 


nCAH-hny nrntpin 
L/CAL/-UUA piuiciii 


3.94 


CO 1 b 


1,30 


Hnmn <5ariipnc. cDNA FLJ 14056 fis clone HE 


3.28 


I-DOa, Drain, 1 


3.12 


ESTs 


3.43 


nypuuitmudi piuiciii ruj izo/o 


1.38 


to I S 


2.29 


CO lb 


3.87 


to is, ivioaeraieiy similar io imami^uu piu 


0.10 


ESTs 


U8 


Homo sapiens cDNA FLJ 10570 fis, clone NT 


1.00 


ESTs 


0.90 


ESTs 


4.56 


deleted in liver cancer 1 


3.42 


ESTs 


2.97 


gb:EST391378 MAGE resequences, MAGP Homo1.42 


ESTs 


3.66 


ESTs 


0.03 


EST 


3.40 


ESTs 


2.80 


ESTs 


3.20 


hypothetical protein FLJ 10498 


1.00 


ESTs 


0.91 


ribosomal protein L27a 


5.75 


gb:MR-BT035-2001 99-031 BT035 Homo sapien 


1.68 


ESTs 


0.60 


ESTs 


10.10 


ESTs 


0.31 


cell division cycle 2, G1 to S and G2 to 


2.18 


ESTs 


2.59 


ESTs 


1.05 


EST 


5.64 


Transmembrane protease, serine 3 


0.52 


ESTs 


2.46 


ESTs 


0.34 


ESTs 


2.10 


ESTs 


1.00 


Homo sapiens cDNA: FU21274 fis, clone C 


1.78 


ESTs 


1.17 


ESTs 


3.00 


ESTs 


1.00 


ESTs 


0.64 


ESTs 


0.61 


ESTs 


1.06 


Homo sapiens clone TCCCTA001 51 mRNA sequ 


0.48 


EST 


3.76 


ESTs 


5.37 


hypothetical protein FLJ 10493 


1.00 


fidgelin-like 1 


3.38 


ESTs 


2.04 


ESTs 


2.36 


gb:qh36f02.x1 Soares NFL_TJ3BC„S1 Homos 


3.46 


ESTs 


3.78 


ESTs, Moderately similar to OVCA1 


0.89 


transcription factor BMAL2 


2.32 


ESTs 


1.67 



WO 02/086443 



PCT/US02/12476 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



80 



85 



316630 

U 1 UUUU 




Hs 127780 


ESTs 


1.05 


1.01 


315R41 
0 I OOH 1 


AHRR933 

Ml IOOZOO 


1931^9 


Qnprm aQcnriafprl anfinpn 4 


0.85 


0.56 


315552 


AW445034 

MVV*f £ tOUO'r 


l-k 9^6*178 


ESTs 


1.00 


2.22 




AA737415 


u q 169826 


ESTs 


2.66 


2.48 


31KK77 

O J 00/ f 


MVVO 1 OOtO 


H<; 179R3 


hunnthpHral nrnfpin PI I10RQD 

1 ly UUll ICULdl UIULUMI rLU IUU9U 


2.20 


2.25 


Q1CCQ7 
J 1 UOOj 


AI9RR3QQ 


Hs 140489 


ESTs 


1.00 


1.04 


O 10009 


AW0793R7 


1*iR258 


Homo saDiens mRNA- cDNA DKFZd434B1272 (f 


0.14 


1.05 


31 RR93 

O I OUZO 


AA3R407R 

MMOOHU / O 


Hs 258189 


ESTs 


7.44 


12.56 


0 1 00OH 


AAR370R6 

MMO0 /UOO 


H«; 99n f iR5 


ESTs 


0.50 


1.40 


315RRR 

•J 1 OUUU 


AA91 2347 


Hs 136585 


ESTs 


0.43 


1.22 


31 5677 


AIQ39RR9 


Hs 164073 

no. 1 utui 0 


ESTs 


0.60 


1.39 


31 R7(]R 
O ID l UD 


AW440749 


Hs I'WSfi 


hvnothplinal nrotein FLJ20202 


2.18 


3.77 


31^707 

OI0/U/ 


A141R056 

MIH 1 OUOO 


no. 1 0 1 1 ou 


ESTs 


2.88 


2.63 


31^730 

0 I 0/ OU 




Hq 901 691 


ESTs 


0.11 


0.60 


O I 0/HO 


AIR9176Q 

MI0Z 1 / 09 


Hq 1Q1RRR 
no. 1 9 1 oou 


ESTs 


3.50 


7.25 


315791 


AA678177 




gb*zi15a05.s1 Soares fetal_liver spleen_ 


1.78 


2.63 


315flni 
O I OOU i 


AAR97769 


Hs 266134 


ESTs 


4.31 


6.23 


31 5R90 


A1652022 


Hs.258785 


ESTs 


2,35 


3.01 


315R7R 

O 1 JO / u 


AARR3336 


Hs 189046 


ESTs 


2.12 


2.64 


3150.05 


AI821911 


Hs 209452 


ESTs 


1.03 


1.97 


31 RQ93 


AI0697RQ 

MIUO£/ 09 


Hs 133263 
no. 1 ojauu 


ESTs 


2.63 


5.06 


31RQ64 
O I 09O4 


AW276810 
MWt/ OO iu 


Hs 254859 


ESTs, Moderately similar to ALU5_HUMAN A 


1.21 


0.85 


o I oy /o 


AAR30RQ3 
MM0OUO9O 


H<5 1 1 Q7RQ 
no. I 1 9/ 09 


ESTs 


3.09 


3.41 


O IOW I 


A)94fi«;R4 


Hs 190745 
no. i9t/f **u 


Hnmo sflnipns cDNA' FLJ21326 fis clone C 


2.20 


6.82 


316011 


AW61RQ53 


Hs 901379 


ESTs 


0.35 


1.63 


316012 


AA7R4Q50 

r\r\l Ut9UU 


Hs 119898 


ESTs 


6.56 


8.13 




AlQR34flQ 
MI900 / tU9 


He 1RQ99R 

no. 1 09££0 


ESTs 


5.69 


10.69 


31 ROAR 
O l0U*+O 


Al7907*iQ 
MI/ZU/ 09 


Uc 994Q71 


ESTs 


2.84 


10.45 


31 R07R 

O I OU / U 


AW9Q7RQ5 

MVV £9/ 09U 


Hs 116424 

no. 1 1 uttt 


ESTs 


0.30 


1.05 


316124 


A)3flRflfi9 


He 1R702R 

no. Ivi ytu 


ESTs 


1.00 


1.43 


316151 


airoroir 

MIOUOU l O 


Hs 156520 

no. 1 uutjt.u 


ESTs 


5.80 


9.03 


-51 R1 Q7 
0 ID JO/ 


AWMR9QQ 

MVVO 1 0t99 


Hq 1Q9953 
no. 1 9tto o 


ESTs 


1.20 


3.96 


31R904 


aa731 c >oq 

MM/ O IOU9 


Uc 190967 


ESTs 


4.92 


6.94 


31R232 


AI/V2Q7R53 

MVV^9/ OUO 


Hs 251203 


ESTs 


1.48 


1.60 


31 6275 


AI671041 
MIO/ 1 U*t l 


Hs 292611 


ESTs, Moderately similar to AlU1_HUMAN A 


5.86 


12.14 


316291 


AW376Q74 


Hs 156704 


ESTs 


2.73 


2.69 


31R303 


AA740QQ4 


Hs 209609 


ESTs 


1.53 


1.26 


316344 


AA744^1fi 

r\r\/ HHO IO 


no. 1 (.uu iu 


ESTs 


3.66 


8.34 


11 R34R 


Alfl9R47R 


Hs 157447 

no. 1 *ji tti 


ESTs 


3.51 


6.69 


31R3R6 
O 1 UJUU 


Ml Oit/ OHO 


Hq 910776 


ESTs 


2.50 


4.33 


qi coon 


Ml 0900/ O 


Hq 1R44QR 
no. 1 0*t*t9U 


ESTs 


l!l6 


2.16 


31R470 
0 1 04 /U 


AAflnQQn9 

MM0U99UZ 


Hq 943R13 


ESTs 


5.40 


10.34 


o i uouy 


AA7R731fl 

MM/ O/ O 1 U 


Hq 9Q1766 
no. 4.9 1 1 uu 


ESTs 


2^46 


2.89 


31R514 

O I 00 l*r 


AA7RRD37 

MM/ OOUO/ 


Hs 291671 
noxo iuf 1 


ESTs 


4.70 


6.04 


0 IOO 19 


AIQ9QHQ7 
M19^,9U9/ 




ob-od10r11 s1 NCI CGAP GCB1 Homo saDiens 


4.41 


9.70 


31RR0Q 
0 1 0OU9 


AW9Q9 I ?9fl 

MVVZ9Z0ZU 


Hq 1990R2 

no, l^£UO£ 


ESTs 


1.00 


2.89 


0 1 0000 


AHOCCDC 
Ml I^OOOO 


Hq 197955 
no. 1 £.1 900 


ESTs 


2.61 


3.72 


31R7fin 
OIO/UU 


AW17931R 

MVV I / tO 1 0 


Hs 959961 

MO.£U<&9U 1 


FSTs Weaklv similar to ALU1 HUMAN ALU S 


3.46 


4.64 


316711 


Al 743791 


Hq 9R5316 


ESTs, Moderately similar to ALU7_HUMAN A 


4.45 


6.95 


316713 


Aif)Qnfi71 


Hs 134807 


hypothetical protein FLJ 12057 


0.30 


2.40 


316715 


AI44fi9fifi 


Hs 170673 


ESTs Weaklv similar to AF1 26780 1 retin 


0.20 


1.45 


316787 


AW3RQ770 


Hs 130351 

no . 1 ouv/u 1 


ESTs 


405 


5.53 


31fiflOQ 
o i oouy 


AAR9*iR3Q 

MMOt0OO9 


Hq 902938 


ESTs 


2.25 


3.82 


31RR11 

0 I 00 I 1 


AAQ99flR0 

MM9^£VJ0U 


Hs 132471 
no. 1 sjtsii 1 


ESTs 


1.00 


1.32 


31RR19 
0 IOO \ c 


AW1 3^04^ 

MVV IOOUHO 


Hs 232001 


ESTs 


3.28 


4.70 


0 1 00 1 0 


AAR9717R 
MMOi/ I /O 


Hq 19431R 
no. I t*to 1 0 


ESTs 


0.67 


1.81 


Q1 RROA 
0l00Z4 


A AR0741R 
MMOO / H 1 0 


He 1 949QQ 
no. \£.*t£.vo 


ESTs 


3.53 


6.00 


qi CR07 


AI3Rnd9Q 
M100U t fZ9 


Hq 179445 
no. 1 / cH*+u 


ESTs 


0.72 


1.56 




AW9QR1 1 Q 
MVVt90 1 1 9 


Hq 909R3R 
rio.^U^OOO 


ESTs 


1.64 


2.97 


0 1 090 I 


AA1343RR 

MM 1 OHOOO 


Hq 5754R 


ESTs 


1.45 


1.08 


Q1R070 


AAflfini79 


Hq 13940R 
no. 1 Jituu 


ESTs 


1.00 


1.53 


316971 


AARRI1919 


Hs 170991 


ESTs 


1.08 


1.96 


3iRQQn 

O 1 039U 


AARR1R11 


Hq 130643 
no. 1 ouutu 


ESTs 


5.44 


10.04 


317001 


AIR97Q17 


Hs 233694 


hvnnthptinal nrotein FLJ 11 350 


3i56 


4.37 


Oil UUO 




Hs 143707 


ESTs 


0.69 


1.37 


317061 
0 1 /UO 1 


AA8739R3 

MMO/ <J£.\JO 


Hs 126233 


ESTs 


6.18 


12.72 


317128 


AA971374 


Hs 125674 


ESTs 


1.87 


2.66 


317129 


H 12523 


Hs 78521 


Homo sapiens cDNA: FLJ21193 fis, clone C 


4.12 


6.64 


317137 


AW341'5fi7 


Hs 125710 

1 10. 1 tui 1 \j 


ESTs 


2.82 


5.12 


317196 


Al 348258 


Hs 153412 


ESTs 


1.98 


2.51 


317212 


AIRRR4RR 
Miouw^uu 


Hs 148294 

no. itotst 


ESTs 


1.86 


2.83 


317223 


AW2Q7Q20 


Hs 130054 

ltd* 1 OUUOt 


ESTs 


0.83 


1.57 


317994 

0 I / £.CH 


UOO/ uu 


Hq Q3099 

rlo.90U£9 


Qnarn/n^tpnnpntin rwcv and kazal-like d 


2.74 


0.86 


317266 


AAQnR9RQ 

MM9UU£09 


Hq 903614 


ESTs 


1.00 


1.00 


317282 


AIR07444 


Hs 176101 
no ■ ifuiui 


ESTs 


2.60 


4.21 


317285 


A\A/37fiRR9 


Hq 9990R0 


ESTs 


1.96 


3.49 


31 7302 


AA0nR7flQ 

MM9UOf U9 


Hq 136664 
no. 1 ujjuh 


ESTs 


7^16 


8.32 


317304 
0 1 / OU*f 


A\A/44QRQQ 

MVV < r 1 r9099 


Hq 1301R4 
no. I ou 1 oh 


ESTs 


1.38 


2.28 


317320 


AA927151 


Hs. 130452 


ESTs 


3^58 


8.13 


317413 


AW341701 


Hs!l 26622 


ESTs 


2.08 


4.92 


317417 


AA918420 


Hs.145378 


ESTs 


3.06 


4.79 


317452 


AA972965 


Hs.1 35568 


ESTs 


4.22 


9.21 


317519 


AI859695 


Hs.126860 


ESTs 


1.88 


4.15 


317521 


A1824338 


Hs.1 26891 


ESTs 


3.12 


4.55 


317529 


AI916517 


Hs.1 26865 


ESTs 


2.73 


3.34 



132 



WO 02/086443 



PCT/US02/12476 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



80 



85 



317570 


AI733361 


Hs.127122 


ESTs 


1.00 


2.43 


317571 


AAQ38663 


Hs 199828 


ESTs 


5.20 


11.95 


\J 1 1 03U 




Hs 192123 


ESTs 


0.33 


1.56 


31 7627 


A134R110 


Hs.1 32553 


ESTs 


1.50 


1.39 


Oil uuu 


A1733310 


Hs. 127346 


ESTs 


0.48 


1.46 


317659 


AA961216 


Hs. 127785 


ESTs 


4.18 


7.14 


317674 


AW294909 


Hs.1 32208 


ESTs 


2.92 


3.20 


317686 


AA969051 


Hs. 187319 


ESTs 


1.00 


1.01 


317692 


AI307659 


Hs.1 74794 


ESTs 


5.33 


9.59 


317701 


AI674774 


Hs. 128014 


ESTs 


1.00 


1.00 


317711 


AI733015 


Hs.272189 


ESTs 


5.13 


7.81 


317722 


AJ733373 


Hs.1 281 19 


ESTs 


2.50 


6.03 


317756 


AA973667 


Hs.1 28320 


ESTs 


1.59 


1.30 


317777 


AH 43525 


Hs.47313 


KIAA0258 gene product 


1.00 


2.48 


317799 


AI498273 


Hs.1 28808 


ESTs 


1.78 


2.11 


317803 


AA983251 


Hs, 128899 


ESTs 


0.80 


1.06 


317821 


AI368158 


Hs.70983 


PTPL1 -associated RhoGAP 1 


0.17 


0.68 


317848 


AI820575 


Hs.1 29086 


Homo sapiens cDNA FLJ 12007 fis, clone HE 


5.30 


8.16 


317850 


N29974 


Hs.1 52982 


hypothetical protein FLJ 131 17 


1.30 


2.28 


317861 


AW341064 


Hs.1 291 19 


ESTs 


2.18 


5.93 


317865 


AI298794 


Hs.1 291 30 


ESTs 


4.48 


8.20 


317869 


AW295184 


Hs.1 291 42 


deoxyribonuclease II beta 


0.44 


0.99 


317881 


AI827248 


Hs.224398 


Homo sapiens cDNA FLJ11469 fis, clone HE 


4.06 


2.23 


317890 


AI915599 


Hs.1 29225 


ESTs 


4.68 


7.48 


317899 


A1952430 


Hs.150614 


ESTs, Weakly similar to ALU4__HUMAN ALU S 


3.14 


3.37 


317986 


AI005163 


Hs.201378 


ESTs, Weakly similar to T12545 hypotheti 


0.28 


1.66 


318001 


AW235697 


Hs.1 30980 


ESTs 


5.12 


9.97 


318016 


AI016694 


Hs 256921 


ESTs 


1.86 


4.50 


318023 


AW243058 


Hs.1 31155 


ESTs 


2.92 


5.22 


318054 


AW449270 


Hs.232140 


ESTs 


3.92 


6.37 


318068 


A1024540 


Hs.1 31 574 


ESTs 


1.21 


1.27 


318117 


AI208304 


Hs.250114 


ESTs 


0.86 


1.17 


318187 


AI792585 


Hs.1 33272 


ESTs, Weakly similar to ALUC_HUMAN !!!! 


5.90 


6.98 


318223 


AI077540 


Hs.1 34090 


ESTs 


1.05 


0.90 


318240 


AIQ85377 


Hs.143610 


ESTs 


3.10 


2.40 


318255 


A1082692 


Hs.1 34662 


ESTs 


0.02 


1.05 


318266 


AI554341 


Hs.271443 


ESTs 


6.12 


10.55 


31 8330 


AI093840 


Hs.1 43758 


ESTs 


4.98 


7.90 


318369 


AI493501 


Hs.170974 


ESTs 


2.46 


5.62 


318428 


AI949409 


Hs.1 94591 


ESTs 


0.77 


0.45 


318458 


Al 149783 


Hs.1 58438 


ESTs 


3.54 


4.92 


318467 


AI151395 


Hs.1 44834 


ESTs 


4.56 


5.62 


318473 


AI939339 


Hs.1 46883 


ESTs 


2.08 


4.05 


318476 


AI693927 


Hs.265165 


ESTs 


4.22 


8.07 


318487 


Al 167877 


Hs.1 4371 6 


ESTs 


1.47 


1.05 


318488 


AI217431 


Hs.1 44709 


ESTs 


1.40 


4.14 


318491 


T26477 


Hs.22883 


ESTs, Weakly similar to ALU8.HUMAN ALU S 


1.84 


1.90 


318499 


T25451 




gb:PTHI188 HTCDL1 Homo sapiens cDNA 573 


2.58 


5.20 


318537 


AA377908 


Hs.1 3254 


ESTs 


3.26 


4.18 


318538 


N28625 


Hs. 74034 


Homo sapiens clone 24651 mRNA sequence 


0.35 


107 


318547 


R20578 


Hs.90431 


ESTs 


3.22 


4.60 


318552 


R18364 


Hs.90363 


ESTs 


4.87 


9.06 


318575 


R55102 


Hs.1 07761 


ESTs, Weakly similar to unnamed protein 


1.91 


1.98 


318580 


T34571 


Hs 49007 


poly(A) polymerase alpha 


2.74 


6.22 


318587 


AA779704 


Hs. 168830 


Homo sapiens cDNA FLJ 121 36 fis, clone MA 


0.85 


2.46 


318596 


AI470235 


Hs.1 72698 


EST 


4.88 


4.93 


318622 


T48325 


Hs.237658 


apolipoprotein A-ll 


4.80 


12.51 


318629 


N25163 


Hs.8861 


ESTs 


0.39 


1.04 


318637 


AA243539 


Hs.9196 


hypothetical protein 


1.72 


3.57 


318648 


T77141 


Hs.184411 


albumin 


6.27 


9.91 


318650 


AA393302 


Hs.1 76626 


hypothetical protein EDAG-1 


3.96 


8.84 


318671 


AA1 88823 


Hs.299254 


Homo sapiens cDNA: FLJ23597 fis, clone L 


1.53 


0.81 


318679 


T58115 


Hs.10336 


ESTs 


1.00 


2.19 


318711 


AI936475 


Hs.1 01 282 


Homo sapiens cDNA: FLJ21238 fis, clone C 


3.05 


3.18 


318725 


AI962487 


Hs.242990 


ESTs 


1.08 


2.46 


318728 


Z30201 


Hs.291289 


ESTs, Weakly similar to ALU1JHUMAN ALU S 


0.77 


1.33 


318740 


NM_002543 Hs.77729 


oxidised low density lipoprotein {lectin 


0.25 


1.49 


318776 


R24963 


Hs.23766 


ESTs 


1.00 


3.01 


318784 


H00148 


Hs.5181 


proliferation-associated 2G4, 38kD 


2.70 


3.86 


318816 


F07873 


Hs.21273 


ESTs 


3.90 


7.13 


318865 


H10818 




gb:ym04f10.r1 Soares infant brain 1NIB H 


2.25 


3.56 


318879 


R56332 


Hs.1 8268 


adenylate kinase 5 


1.78 


5.00 


318881 


243224 


Hs.1 24952 


ESTs 


4.79 


14.13 


318894 


F08138 


Hs.7387 


DKFZP564B1 16 protein 


5.31 


7.00 


318901 


AW368520 


Hs.301528 


L-kynurenine/alpha-aminoadipate aminotra 


1.03 


0.91 


318925 


Z43577 


Hs.21470 


ESTs 


2.23 


3.80 


318936 


AI219221 


Hs.308298 


ESTs 


1.86 


7.16 


318982 


Z44140 


Hs.269622 


ESTs 


5.84 


9.79 


318986 


Z44186 


Hs.169161 


ESTs, Highly similar to MAONJHUMAN NADP- 


1.00 


1.00 


319041 


Z44720 


Hs.98365 


ESTs, Weakly similar to weak similarity 


3.38 


6.11 


319103 


H05896 


Hs.4993 


KIAA1313 protein 


1.00 


1.07 


319170 


R13678 


Hs.285306 


putative selenocysteine lyase 


3.79 


5.03 


319196 


F07953 


Hs.1 6085 


putative G-protein coupled receptor 


1.00 


2.98 


319199 


F07361 


Hs.13306 


ESTs 


3.53 


5.66 


319242 


F11472 


Hs.12839 


ESTs 


5.87 


7.26 



133 



WO 02/086443 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



319263 


T65331 


Hs 81360 


Homo sapiens cDNA: FLJ21927 fis, clone H 


1.81 


1.57 


0 1 y&Dr 


F11Rn9 


Uo COH Q 
nb.DO IO 


ESTs 


1.10 


4.72 




R1 "3474 


Uq oqn9R3 


ESTs 


4.80 


10.40 


319279 


Tfi5flQ4 


no. I lui ( 


(Y3L147 nrnfpm 


1.50 


2.11 


319282 


AA4R115R 
rVrtnQ lOOO 


nb. i zo/ o 


ESTs 


1.00 


1.00 




VVU/ Out 


l-k 7Qfl59 
no. / yooy 


Ironcfnrnninn nr/iu/fh faptor hots rof'pnfrt 
U diioiui i mi iy yiuwin iciuiui] ucio icucuiu 


0.18 


0.68 


319291 


WRR57R 
VVOOOf o 




hunnlhpftfsl nrnrpin Fl J9909Q 


0.26 


0.62 


O 1 UC.UO 


F12119 


H«; 1 9583 
no. i tuuw 


ESTs 


3.13 


4.50 


319312 


245481 




gb:HSC2QE041 normalized infant brain cDN 


1.10 


1.00 




H 54954 




cctq MnHpra^Plv similar tn A! i )5 HI IMAM A 
nolo, iviuuci aLciy oiiiiiiai 10 nLUJ i tuivinii n 


0.16 


0.73 


319391 

O t COS I 


R06304 


Hs 13911 


ESTs 


1.26 


2.43 


o i yoyo 


HR7130 
noi ioo 


H<? 301743 


ESTs 


0.70 


0.76 


31939R 
o i yoyo 


AA35Q754 


u- 101196 

no. i \? i i ou 


ESTs 


2.45 


3.59 


319407 


R0539Q 
r\UOOty 




nh'VfiQ1h04 r1 finarpe fplal liupr Qnlppn 


2.00 


3.54 


319425 


T82930 




gb:yd39f07.r1 Soares fetal liver spleen 


428 


8.81 


319433 


R06050 


Hs 191198 


ESTs 


6.15 


14.13 


319437 


AA282420 


Hs.111991 


ESTs, Weakly similar to Y48A5A.1 [C.eleg 


3.26 


5.68 


319466 


AIR0Q937 


Hs 116417 


ESTs' 


1.76 


5.65 


319471 


R06546 


Hs 19717 

1 IOi I v l If 


ESTs 


4.29 


4.84 


319480 


R06933 


Hs.1 84221 


ESTs 


1.00 


1.00 


319484 


T91772 




nh'vri59a10 e1 Snare 1 ; fpfal liupr snleen 


2.81 


4.88 


319486 

U 1 0"<JU 


AI382429 


Hs.250799 


ESTs 


2.08 


2.82 


319508 

O 1 Owl/O 


TQ9R9R 


Hs 270104 


ESTs, Moderately similar to ALU8_HUMAN A 


2.80 


4.39 


319593 
O 1 oo^o 


TR9499 

1 OOtOO 


Uq iqi184 

no. loll u*t 


ESTs 


1.55 


3.25 


319545 


R83716 


Hs 14355 


Homo sapiens cDNA FLJ 13207 fis, done NT 


1.65 


1.19 


31Q54R 
o i yoHo 


RflQRQ9 
r\uyoy^ 




ntvvf93h19 r1 finarpe fpfal liupr enlppn 


5.11 


8.54 


o I yo Ji. 


AA09R10R 

MMUSU 1 OO 


Hs 90403 


ESTs 


1.89 


3.36 


O-fQCOO 

o i yoo/. 


1 oiyyo 


Hq 9^ni , i4 
noxou i oh 


hvnnrhfltiral nmtatn Fl J12Q73 

I lyjJOU ICllUdl OIUICIM iLO ItJI o 


3.48 


4.82 


o i yooo 


n7RflflR 


H<s 9R3RR3 
no.ZOOOOO 


phrnmnsnmp ft nnpn rpadinn framfl 4 

Ulil uiiiuoui i ic o uuoii icauiny naiiiu "r 


0.26 


0.82 


?1 QR04 

O 1 3UU*t 


R11679 


Hq 9Q7753 
no.4.y/ 1 oo 


uimpnKn 

VIIIICHUII 


1.68 


3.41 


o i youy 


AW947514 


He 19993 
no. I 


hunnfhpHnal nrnipin Fl 191103 
iiyuouicuudi piuicin r uo c i iuo 


3.06 


4.24 


319611 


H 14957 




nh'umlQrIO r1 Rnarpt; infant hrain 1 Ml R H 
y u.y 1 1 1 1 ou i u.i i ouaics it iiai u main imiu ii 


2.76 


4.24 


o i yooo 


AA7701R3 
/VAX / vl IOO 


no. 1 1 jj i o 


nnrhflraf*tpri7prl hvnnfhalamne nrofpln HTO 
u iiu i idi aL/ici i^cu i iy uuii ioiqi i iuq uiul^hi iiiu 


2.61 


3.55 


31 9657 

O 1 SUul 


R1 9897 


Hs 106604 

1 IOi lUVJUVt 


ESTs 


5.32 


7^68 


319658 

O 1 3UJD 


R13432 


Uq 1R7481 

no. i ui *tu i 


ivntrnnhin namma 1 
oy i ui uui hi i| yaiiiiiicj i 


3.35 


5.00 


31Q661 


H08035 

nooooo 


Hs 21398 


FSTs MouGratelv similar to A Chain A H 


5.18 


12.55 


31QR69 


H063R9 


Hs 21400 


ESTs 


1.58 


1.56 


O 1 O f uo 


R15372 


Hs 22664 


ESTs 


1.00 


1.22 


319742 


T77668 


Hs 21162 


ESTs 


2.48 


3.13 


319748 


R18178 


He 995R66 
no.^yoooo 


Homo saDiens mRNA' cDNA DKFZd434N1923 (f 

nuniu oauicno iiirNinn, uunn t-utuTin i o&u 


3.02 


4.85 


*}1Q779 
o iy/ t & 


R7fifin 
f\l OOOO 


He: 99R4R 
rio.ttOHo 


ESTs 


4.36 


11.61 


oiy/oo 




Uq 117414 
no. t i / HI**- 


KIAA1390 nrntpin 


2.56 


3.68 


o i youo 


RQOQC7 

Kazoo » 


He 9713R0 


MLoIu nrthntr^n rsf mnncp nnlv/Hnm 
iiisciy uiuiuiuy oi niuuoc uoiyuuui 


4.63 


6.56 


11QR19 
o jyo \e. 


M74RR0 

IN f HOOU 


He 964330 
no.i&OHoou 


M.ar'ulQnhinnneinp amiHnhurfrnla^p /arid p 

N duy lofji in lyuaii ic di i nuui lyui uiaac ^auiu u 


0.63 


1.32 


319R34 
o i yoo*r 


AA0719R7 




nh'7mR1n01 M Rtratanpnp fihrnhlast /937 

uo*4*i 1 iu i yu i «i i uiiuiuyciic iiui uuicjol 


0.30 


0,94 


319R7R 
o I jo r u 


T78517 


Uc 1^941 

1 IO. 1 J3t 1 


ESTs 


3.99 


6^44 


o i yoo£ 


AA95RQR1 

rv-\<COOoO 1 




ESTs 


5.09 


7.36 


319912 


T77559 


no.on i uo 


Homo sapiens cDNA FLJ 13634 fis, clone PL 


3.24 


3^21 


31 9935 


H79460 


Hs 271722 


ESTs, Weakly similar to ALU1 HUMAN ALU S 


4.40 


9.42 


31 9944 


T79248 


He 133510 
no. i oo«j iu 


ESTs 


3.31 


5.39 


319947 


AA1 60967 


Hs 14479 

1 IO- 1 fc t*T# & 


Homo sapiens cDNA FLJ 141 99 fis, clone NT 


2.90 


4.95 


qiqqfi9 
o i yyoz 


HOR350 

nuoooo 


He 135056 
no. ioouoo 


Human PIMA epnitpnpp from rlnnp RP5-850F9 

nuuidii UI in OCUUUI IwC 11 UI 1 1 UIUI IO 1 M V UJUU^ 


1.81 


1.57 


•39nnn7 

0£UUUr 


AA33R314 

M/AOOOO 




ntvFST40943 Fnrinmptrial tnmnr Homo sanie 

uu.tu i ^uo*tu cuuuiiiouiaj iuiiiui 1 iuiiiw oapiv 


3.42 


6.29 


•jonnift 

dc\J\J t O 


TR19R3 




nh'uH40hOQ r1 ^narpt; fptal liupr enlppn 
yo.yunoiiuo.i I oodico iciai uvci opicoii 


2.77 


5.14 


o^uoou 


HR37RQ 
no j / oo 


Hs 296288 


ESTs Weaklv similar to KIAA0638 orotein 

LO 1 O] V¥wQJM¥ OlIlMlul WJ l\lnr\UUvU |JIULt>|IJ 


4.10 


6.69 




A1R9Q779 


l-le 9Q9RR4 
no.^y^oo'r 


F^Tq Weaklv similar to A46010 X-linked 

COIO| vvoaMy oil 1 nidi io/^*tuoiu /\ inii\cu 


3.27 


3.27 


O^UUHU 


AA933R71 


He R7164 
no. Or lot 


hunnthptiral nrntpin Fl .114001 

I ly^JOli ICUUal [JIUICIII IL.U l*tUU 1 


1.81 


1.64 


320047 


TRR564 
1 DOUGH 


He 309956 

no.OU££00 


EST 


3.38 


7.36 


o^uuqo 


AA07410R 

nnUi *r I OO 


He 190R44 

no. 1 £\JOHH 


FOX 19 fnrkhpad fapfor 
rvAJt iuiimicou iduiui 


5.90 


16.73 


o^uoyu 


H5R13R 

noo 1 OO 


He 117915 
no. i uoiu 


ESTs 


2.08 


4.47 


QonnQQ 

o^uuyy 


AW41 1 307 


He 114311 
noi i i*to i i 


f.nn45 rrpll Hiuielon rurlft 45 R pprpvfc 

UUOHJ ^UCII UIVIOIUII UyUlCJ HO, w.uCICVIo 


1.00 


1.00 


320112 


T92107 


Hs 188489 


ESTs 


2.27 


2.06 


320140 


H94179 


Hs 119023 


SM09 fstruotiiral maintenance of chromoso 


1.00 


1.00 


3901RR 

jcy i uo 


AW419200 


Hs 172318 

no. 1 1 £.<j io 


ESTs 


1.26 


1.00 


320193 

0£U 1 so 


AA831259 


Hs 17132 

1 IO« 1 * 1 \j£m 


ESTs 


2.58 


6.23 


320195 


R62203 


Hs 24321 


Homo sanlene cDNA FLJ 12028 fis clone HE 

I iuiiio odpiwlio wuiin i i— u ilv^u no, utuiiw i 1 1— 


2.85 


4.53 


320199 


R78659 


Hs 29792 


ESTs 


0.40 


0.94 


390903 


AL049927 


Hs 124776 

1 IOi 1 «£" / I w 


Hnmo eaDipne mRNA 1 cDNA DKFZd564N1 1 16 (1 

1 IUIIIU OdpiCIIO llll\i^i/i, uuiin i^/ixi i—yj\j\j~tiv i i iu \i 


0.84 


1.18 


320219 


A A 397564 


He 197011 
no. it/vi i 


hjhiilninfpr<;fiHal npnhrifie antinpn 


1.00 


1.17 


320220 


AF054910 


Hs.127111 


tektin 2 (testicular) 


0.18 


1.09 


320225 


AcncoqQq 
Mruooyoy 


He 198931 

no. 1 £0£0 1 


R antinpn familv R 1 /nrn^talp ae^onia 

vjdiiuycii, idililiy d, i yuiuoidic ooouuia 


5.26 


13.75 


320231 


H031 3Q 
nuo i oy 


He 94RR3 


ESTs 


1.59 


1.93 


3909R0 


NM n03fif)R 

INIVI UuOOUO 


Hs 131924 

no. i \j i o*.*t 


d nrofpin-fniinlpd rp^pntor 65 


1.38 


4.56 


OlUZOf 


rtLunyoo/ 


He 139571 
no, lO^o/ I 


Hnmo eanf'pne mRWA' rDWA DKF7n5fi4P01fi (fr 


1.00 


1.92 


3909RR 


nuuu i 3 


Hs 151293 


Hnmo eaniane rDNA FLJ 10664 fis clone NT 

riUJIIU OdplCIIO UL/lirA 1 Lu IUUU^t HO, UIUII^ Ii 1 


5.58 


5.70 


320322 


ART77T74 

MrU/ I Of *t 


He 13Q399 
no. i oy 0£.£ 


pmall nrolinp-rirh nrn{pin 3 

Ol 1 1 all UIUIIIIO HUH pi U IOI II o 


1.41 


1.01 


^90395 


AMR797R 
r\t i o / y / o 


He 139R51 
no. i oouo i 


powpnlin 9 
udvcoiiii c 


0.05 


0.67 


OtUOOU 


AFfl9fifin4 


He 141660 
no. i*t ioou 


rhlorldp rhannpl 9 
uiiiunuc uiidiiuci c 


2.17 


1.26 


320339 


H 10807 


Hs.281434 


Homo sapiens cDNA FLJ 14028 fis, clone HE 


1.81 


£32 


320388 


H16065 


• Hs.31286 


ESTs 


1.00 


3.22 


320402 


R22291 


Hs.23368 


Homo sapiens clone FLC0578 PR02852 mRNA, 


1.41 


1.36 


320413 


AA203711 


Hs.173269 


ESTs 


2.31 


3.61 


320432 


R62786 


Hs.124136 


ESTs 


11.25 


20.78 


320436 


AA253352 


Hs.293663 


ESTs 


2.22 


3.49 


320438 


W24548 


Hs.5669 


ESTs 


3.53 


8.14 



134 



WO 02/086443 



PCT/US02/12476 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



80 



85 



320448 
320451 
320484 
320499 
320514 
320521 
320526 
320527 
320536 
320556 
320564 
320587 
320635 
320639 
320648 
320651 
320664 
320676 
320683 
320689 
320696 
320714 
320727 
320771 
320794 
320822 
320824 
320830 
320843 
320849 
320853 
320896 
320921 
320927 
320957 
320997 
321045 
321046 
321052 
321059 
321062 
321067 
321102 
321130 
321142 
321155 
321158 
321170 
321199 
321206 
321225 
321236 
321244 
321270 
321317 
321318 
321325 
321342 
321356 
321418 
321420 
321430 
321453 
321467 
321468 
321491 
321498 
321504 
321510 
321513 
321516 
321565 
321577 
321581 
321582 
321587 
321626 
321628 
321642 
321669 
321687 
321688 
321693 
321700 
321701 



A1240233 

R26944 

AA094436 

R32555 

AB007978 

N31464 

AW374205 

R34672 

AA331732 

AF054177 

AF056209 

244524 

R54159 

M243258 

N48521 

AA489268 

AI904216 

AA1 32650 

R59291 

AA334609 

AW135016 

AI445591 

U96044 

A1793266 

AA281993 

AF100780 

AF120274 

AJ132445 

AA317372 

D60031 

A1473796 

AB002155 

R94038 " 

AI205786 

A1878933 

H22544 

W88483 

H27794 

AW372884 

A1092824 

R87955 

AF131782 

AA018306 

H43750 

AI817933 

AA336635 

AA700289 

N53742 

AW385512 

H54178 

AL080073 

AW371941 

AF068654 

R83560 

AI937060 

AB033041 

AB033100 

AA1 27984 

R93443 

AI739161 

AI368667 

U05890 

N50080 

X13075 

M514198 

H70665 

AW295517 

W02356 

AA703650 

H84972 

AI382803 

A1525773 

H84260 

AA019964 

M1 43755 

H95531 

AA295430 

H87064 

AW085917 

H95404 

AA625149 

H97646 

AA700017 

N55160 

AW390923 



Hs.80887 

Hs.180777 

Hs.296267 

Hs.24321 

Hs.158278 

Hs.24743 

Hs.111314 

Hs.324522 

Hs.1 37224 

Hs.14570 

Hs.1 59396 

Hs.1 67456 

Hs.80506 

Hs.7395 

Hs.26549 

Hs.1 11334 

Hs.91251 

Hs.300511 

Hs.26638 

Hs.171929 

Hs.1 72780 

Hs.181125 

Hs.1 17176 

Hs.91226 

Hs.1 94679 

Hs.1 94689 

Hs.266416 

Hs.34744 

Hs.34771 

Hs.135904 

Hs.271580 

Hs.199538 

Hs.213923 

Hs.92023 

Hs.293650 
Hs.269055 
Hs.240770 
Hs.126465 
Hs.241411 
Hs.241438 

Hs.125494 
Hs.298351 
Hs.99598 

Hs.1 72982 

Hs.226469 
Hs.251414 
Hs.18192 



Hs.6298 

Hs.137507 

Hs.300646 

Hs.222024 

Hs.271770 

Hs.161075 

Hs.132743 

Hs.82845 

Hs.38540 

Hs.292549 

Hs.255436 

Hs.268980 

Hs.255748 

Hs.108551 

Hs.159235 

Hs.266514 

Hs.28803 
Hs.21858 

Hs.96322 
Hs.161051 
Hs.247084 
Hs.294110 

Hs.123158 
Hs.1 73737 
Hs. 167260 
Hs.42568 



v-yes-1 Yamaguchi sarcoma viral related 
Homo sapiens mRNA; cDNA DKFZp564M0264 (f 
follistatin-like 1 

Homo sapiens cDNA FLJ 12028 fis, clone HE 

KIAA0509 protein 

hypothetical protein FLJ20171 

ESTs 

ESTs 

ESTs 

hypothetical protein FLJ22530 

peptidylglycine alpha-amidating monooxyg 

Homo sapiens mRNA full length insert cDN 

small nuclear ribonucleoprotein polypept 

hypothetical protein FLJ23182 

Homo sapiens mRNA for KIAA1708 protein, 

ferritin, light polypeptide 

hypothetical protein FLJ11198 

ESTs 

ESTs, Weakly similar to unnamed protein 
ESTs, Weakly similar to A54849 collagen 
ESTs 

gb:yq04a10.r1 Soares fetal liver spleen 
immunoglobulin lambda locus 
poly{A)-binding protein, nuclear 1 
ESTs 

WNT1 inducible signaling pathway protein 

artemin 

claudin 14 

Homo sapiens mRNA; cDNA DKFZp547C136 (fr 

ESTs 

ESTs 

uropiakin 1B 
inhibin, beta C 
ESTs 

core histone macroH2A2,2 
gb:yn69f11.r1 Soares adult brain N2b5HB5 
ESTs 
ESTs 

nuclear cap binding protein subunit 2, 2 
ESTs 

Homo sapiens mRNA full length insert cDN 
Homo sapiens clone 24941 mRNA sequence 
gb:ze40d08.r1 Soares retina N2b4HR Homo 
ESTs 

ASPL protein 

hypothetical protein MGC5338 
gb:yu76f1 1 ,r1 Soares fetal liver spleen 
ESTs 

gb:yy56d10.s1 Soares_multiple_sclerosis_ 
Homo sapiens cDNA FLJ 1241 7 fis, clone MA 
Homo sapiens mRNA; cDNA DKFZp564B1462 {f 
Ser/Arg-related nuclear matrix protein ( 
gb;Homo sapiens isolate AN.1 immunoglobu 
gb:yv76c06.s1 Soares fetal liver spleen 
KIAA1151 protein 
KIAA1215 protein 

KIAA protein (similar to mouse paladin) 

transcription factor BMAL2 

ESTs 

ESTs 

ESTs 

gb:H.sapiens (D1G3) mRNA for immunoglobu 

Homo sapiens cDNA: FLJ21930 fis, clone H 

gb:Human 2a12 mRNA for kappa-immunoglobu 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

hypothetical protein FLJ 11 342 
gb:ys90g04.r1 Soares retina N2b5HR Homo 
ESTs 

trinucleotide repeat containing 3 

gb:ys76e02.r1 Soares retina N2b4HR Homo 

hypothetical protein FLJ23560 

ESTs, Moderately similar to ALU6_HUMAN A 

ESTs 

ESTs 

gb:af70c12.r1 Soares_NhHMPu_S1 Homosapi 
Homo sapiens cDNA FLJ 12830 fis, clone NT 
ras-related C3 botulinum toxin substrate 
ESTs 
ESTs 



1.42 


3.46 


0.87 


0.81 


0.65 


1.18 


3.44 


7.15 


6.44 


13.62 


1.48 


1.04 


3.66 


7.87 


3.16 


5.63 


2.83 


5.83 


128 


1.00 


1.22 


0.81 


1.84 


2.44 


1.00 


6.25 


2.60 


2.30 


1.00 


1.53 


0.14 


0.79 


5.02 


8.84 


3.63 


5.37 


0.37 


1.31 


1.27 


1.02 


3.53 


4.60 


1.06 


0,85 


1.35 


1.49 


0.04 


0.82 


2,96 


4.33 


0.10 


0.79 


1.16 


1.11 


1.06 


1.75 


1.36 


1.47 


5.30 


7.49 


1.00 


1.00 


5.90 


2.55 


2.20 


1.17 


0.18 


1.46 


1.67 


2.18 


3.26 


3.62 


2.25 


4.55 


2.69 


4.25 


2M 


2.56 


1.69 


0.53 


2.76 


5.20 


4.79 


7.41 


1.79 


4.27 


1.00 


3.14 


8.73 


15.36 


3.04 


5.03 


4.62 


8.39 


2.21 


4,46 


5.69 


8.01 


4.00 


7.32 


4.17 


4,63 


1,00 


1.00 


2.18 


9.13 


3.80 


5,26 


1.81 


1,65 


1.00 


1.00 


0.44 


0,93 


4.94 


4.93 


3.10 


4.66 


2.28 


2.54 


1.13 


0.97 


2.42 


3.35 


1.60 


3.11 


0.42 


0.72 


2.46 


6.50 


1.00 


125 


3.19 


6.24 


2.28 


3.86 


2.14 


3.94 


2.78 


5.37 


3.06 


7.19 


4.89 


7.82 


1.00 


1.73 


4.88 


6.73 


1.00 


2.08 


2.26 


4.52 


1.95 


3.83 


0.47 


102 


1.52 


1.38 


2.17 


2.45 


4.31 


6.95 


2.82 


3.28 


0.51 


108 


4.57 


7.46 


1.00 


100 



135 



WO 02/086443 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



321709 
321710 
321775 
321777 
321779 
321829 
321846 
321879 
321883 
321899 
321911 
321949 
321955 
321956 
321987 
321991 
322002 
322035 
322044 
322057 
322060 
322070 
322083 
322091 
322125 
322130 
322147 
322166 
322173 
322178 
322179 
322186 
322196 
322212 
322221 
322277 
322278 
322284 
322288 
322320 
322336 
322339 
322366 
322372 
322374 
322378 
322388 
322416 
322419 
322425 
322431 
322450 
322465 
322467 
322473 
322509 
322523 
322527 
322560 
322566 
322585 
322635 
322641 
322653 
322664 
322687 
322692 
322694 
322708 
322712 
322766 
322770 
322794 
322810 
322818 
322820 
322872 
322882 
322887 
322913 
322926 
322929 
322968 
322971 
322981 



N25847 

N35682 

AI694875 

AI637993 

N42729 

D81993 

AA281594 

AL1 09670 

AA426494 

N55158 

AF026944 

R49202 

AI651866 

AL1 10177 

AL133612 

AL1 33627 

AA328801 

AL137517 

AW340926 

N92197 

AI341937 

U80769 

AF074982 

AI819863 

R93901 

R98978 

AF085919 

AF085958 

H52567 

H56535 

H92891 

H67346 

W87895 

AF087995 

AI890619 

AI640193 

AF086283 

AI792140 

AL037273 

AF086419 

AA308526 

W17348 

AW404274 

W25624 

AI394663 

AF064819 

AI815730 

AA223183 

AA248987 

W37943 

M069222 

AA040131 

AA137152 

AF1 16826 

AA744286 

T52172 

W80398 

AF147359 

AI916847 

W87285 

AA837622 

AA679084 

AA007352 

AI828854 

AA011522 

AI110759 

AA018117 

AH 10872 

AF1 13674 

AA021328 

AW068805 

AA045796 

A1608591 

A1962276 

AW043782 

AI377755 

AA827228 

AW248508 

AI986306 

AI733737 

AI825940 

AI365585 

AI905228 

C15953 

AA493252 



Hs.108923 

Hs.259743 

Hs.202312 

Hs.202312 

Hs.163835 

Hs.8966 

Hs.87902 

Hs.302809 

Hs.46901 

Hs.29468 

Hs.293797 

Hs.181694 

Hs.1 95689 

Hs.1 32882 

Hs.272759 

Hs.158923 

Hs.84522 

Hs,306201 

Hs.154679 

Hs.210322 
Hs.226031 
Hs.106243 

Hs.1 17767 
Hs.1 14176 



Hs.269187 
Hs.211516 
Hs.134877 
Hs.1 79662 
Hs.226389 

Hs.49265 
Hs.7886 

Hs.76152 

Hs.122492 

Hs.153943 

Hs.1 221 16 

Hs.201877 

Hs.247474 

Hs.298442 

Hs.14084 

Hs.34892 

Hs.141892 

Hs.25144 

Hs.286049 

Hs.180340 

Hs.266935 

Hs.302213 

Hs.1 931 97 

Hs.270947 
Hs.269587 



Hs.256042 
Hs.258538 



Hs.60843 

Hs.279812 

Hs.283773 

Hs.23607 

Hs.288467 

Hs.122682 

Hs.38991 

Hs.127444 

Hs.293616 

Hs.1 20695 

Hs.126943 

Hs.279727 

Hs.86149 

Hs.68837 

Hs.211192 

Hs.146246 

Hs.83484 

Hs.212760 

Hs.159577 



RAB38, member RAS oncogene family 
ESTs 

Homo sapiens clone N11 NTera2D1 teratoca 
Homo sapiens clone N11 NTera2D1 teratoca 
ESTs 

tumor endothelial marker 8 

ESTs 

ESTs 

KIAA1462 protein 

ESTs 

ESTs 

EST 

ESTs 

ESTs 

KIAA1457 protein 

Homo sapiens mRNA; cDNA DKFZp434K0722 (f 
ESTs 

hypothetical protein DKFZp56401278 
gb:xy5lb10.x1 NGI_CGAP_Lu34.1 Homosapie 
synaptotagmin 1 

gb:qt10e03.x1 NCI_CGAP_GC4 Homo sapiens 
Homo sapiens mRNA for KIM1766 protein, 
ESTs, Highly similar to KIAA0535 protein 
ESTs 

gb:yq16c12.r1 Soares fetal liver spleen 

ESTs 

ESTs 

gb:yr88b03.r1 Soares fetal liver spleen 

gb:yt85d04.r1 Soares_pineaLgland_N3HPG 

gb:yt88g03.r1 Soares_pineal_gland_N3HPG 

gb:yt94c02.s1 Soares_pineaLgland_N3HPG 

ESTs 

ESTs 

ESTs 

nucleosome assembly protein 1-)ike 1 
ESTs 

gb:zd46f01.r1 Soares Jetal_heart_NbHH19W 
ESTs 

pellino (Drosophila) homolog 1 
gb:zd78d03,r1 Soares_fetal_heart_NbHH19W 
decorin 

gb:zb18c07.x5 Soares_fetalJung_NbHL19vV 

hypothetical protein 

ESTs 

ESTs, Moderately similar to Osf2 [M.musc 
DESC1 protein 

hypothetical protein FLJ21032 

adaptor-related protein complex 3, mu 1 

ring finger protein 7 

KIAA1323 protein 

ESTs 

ESTs 

phosphoserine aminotransferase 

putative protein-tyrosine kinase 

tRNA selenocysteine associated protein 

ESTs 

ESTs 

gb:Homo sapiens full (ength insert cDNA 

ESTs 

ESTs 

gb:zh69c01 .r1 Soares Jeta!Jiver_spleen_ 
gb:zh90h08.r1 Soares_feta!Jiver_spleen_ 
ESTs 

striatin, calmodulin-binding protein 

gb:zi03g07.r1 Soares_fetal_liver_spleen_ 

gb:AF074666 Human fetal liver cDNA libra 

potassium voltage-gated channel, shaker- 

PRO0327 protein 

clone FLB1727 

hypothetical protein FLJ11109 

Homo sapiens cDNA FLJ 12280 fis, clone MA 

ESTs 

S100 calcium-binding protein A2 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FLJ 14035 fis, clone HE 

phosphoinositol 3-phosphate-binding prot 

ESTs 

ESTs 

ESTs 

SRY (sex determining region Y)-box 4 
hypothetical protein FU 13649 
ESTs 



1 nn 
j.uu 


1 00 


9 Q7 


5 9fi 


i fin 

l.UU 


1 nn 

l.UU 


1 fiR 
I .DO 


n 45 


n on 
u.yu 


0.90 


z.oy 


j.oy 


1 11 

0. 1 1 


7 g4 


6.49 


9.58 


0.28 


0.95 


0.39 


0.95 


6.20 


10.76 


4.62 


10.51 


2.89 


5.47 


0.32 


1.25 


1.00 


1.83 


4.00 


6.47 


2.10 


3.48 


1.00 


1.90 


3.20 


9.67 


1 15 


1 n7 

I.Uf 


4.59 


7.68 


9 7fl 


4.52 


0. IU 


1 59 


1 5Q 
i.oy 


1.75 


2,06 


5.27 


10.12 


16.49 


n Q4 


0.64 


A (\Q 


fi fi7 

O.Uf 


1 4fi 


4.85 


fi AA 


9 54 


A 59 


7.50 


n 15 

U. IO 


0.98 




5.04 


3.42 


4.84 


n R9 


2.14 


3 62 


3.98 


1.00 


1.00 


n fifi 

U.OO 


2.76 


n 71 

U.J I 


0.70 


2.02 


2.76 


9 Q9 


4.44 


ft 50. 

O.OU 


11.56 


n ri 

U.O l 


1.34 


7.37 


12.07 


4.78 


10.50 


1.00 


1.00 


7.09 


8.49 


3.20 


5.80 


1.64 


1.57 


0.83 


1.00 


3.96 


5.22 


5.18 


12.67 


^ 41 


9 9^ 
ceo 


1 nn 


1 10 

I.OU 


1 71 
l.f O 


9 oi 
z.uo 


1 nn 

l.UU 


9 97 


9 75 


5.49 


1 91 


1 97 


4.57 


8.81 


1.00 


1.42 


4.18 


6.94 


2.40 


4.85 


2.94 


4.64 


0.48 


0.38 


1 Q9 


2.18 


A 14 


6.75 


i in 

o.ou 


5.00 


1 nn 

i.OU 


1 79 

l.f £. 


1 nn 

l.UU 


3.43 


*3 9ft 


3.86 


1.63 


1.53 


1 51 

1. 00 


1 OR 

I.UD 


19 nfi 


1 Q4 


a no 


fi QO 


1 70 
i.iCU 


1.63 


n 91 

U.£ I 


1 cn 


9 OA 


1.63 


5.26 


1.22 


2.80 


2.24 


2^38 


&61 


4.02 


5.79 


0.30 


1.14 


2.06 


1.13 


1.18 


2.00 


2.28 


2.61 



136 



WO 02/086443 

322988 C18727 Hs.171941 ESTs 0.39 
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TABLE 8B shows the accession numbers for those Pkeys in Table 8A lacking unigenelD's. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
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"Accession" column. 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT number Accessions 



144 



WO 02/086443 



PCT/US02/12476 



324580 
301882 
324804 
324889 
302697 
302711 
302742 
318499 
310624 
302847 
304122 
303598 
311409 
312094 
319312 
319407 
319425 
320007 
320018 
319484 
318865 
312220 
319546 
312389 
319611 
312437 



311896 
319834 
321102 
321158 
321199 

305528 
321270 
314126 
320714 
306442 
306446 
306458 
306510 
306557 
306572 
306582 
306656 
306686 
306751 
308011 
306892 
308106 
308154 
306956 
306958 
308213 
308216 
308219 
308588 
308599 
308643 
308673 
308697 
308778 
308808 
308875 
308886 
308898 
308966 
308979 
303011 
303077 
305016 
305034 
305072 
305148 
305190 
303978 
303990 
303998 
303999 
305235 
305312 
305413 
305447 
321244 



328264J 

275087J 

398093J 

1515978.1 

4321 9_1 

4541 9_1 

458_39 

364430J 

34624_4 

458.105 

77271.-5 

270283.1 

837264.1 

797889.1 

1540116.1 

1688823 1 

1689571.1 

229683.1 

1815987.1 

1691553.1 

1535937.1 

1671 607 J 

243305.1 

902067.1 

1566863.1 

291472.1 



579192.1 
11 2523 J 
80531.1 
410938.1 
212379.1 

28832.-3 

1662057.1 

177666.1 

743644.1 

AA976899 

M977348 

AA978186 

AA988546 

AA994530 

AA995686 

AA996248 

AI004024 

AI015615 

AI032589 

A1439473 

A1092465 

AI476803 

AI500600 

AI125111 

A1125152 

AI557041 

AI557135 

AI557246 

AI718299 

A1719893 

A1745040 

AI760864 

A1767143 

A1811109 

AI818289 

A1832332 

AI833240 

AI858845 

A1870704 

AI873111 

41689.1 

44060J 

M626876 

AA630128 

M641012 

AA654070 

AA665955 

AW513315 

AW515465 

AW516449 

AW516611 

AA670480 

AA700201 

M724659 

M737856 

29327.1 



AA492588 AA492498 AA492571 
T78054T79888M398185 

AI692552 AI393343 A1800510 AI37771 1 F24263 M661876 

D31010D30991 D31168 D31166 D31465 

AJ001409 AJ001410 

L08442D51348 

L12061 

T25451 AA585296AA585305 

U88896 U88898 AA916056 T03285 AI341594 AI359534 AI634031 U88897 

X98941 X98942 X98943 X98953 X98949 

H28966 

M382814 AA402411 AA412355 
AI698839 AI909260 AI909259 
Z78390T97427 
Z45481 F12393T74437 
R05329 R01555 R08276 
T82930 R02424T85145 

M336314 T82938 AA327744 AW967388 AA639967 T10753 

T83263T85731 T85730 

T91772 R07257 R07098 

H10818F07831 Z43072 

N74613T98756T98589 

R09692 R09414AA346353 

AI863140W80703 R43474 

H14957 R56522 R11908 

BE080180 AW827313 AW231970 AA995028 M428584 AW872716 AW892508 AW854593 AA578441 AW975234 AA664937 AA984131 
AA528743 AA552874 AA564758 AW063245 AI267534 AW070190 AW893483 AA770330 AA906928 AA906582 AA758746 AA551717 
AW06331 1 AA429538 

AW206447 AI248530 AI084433 AI400976 R16553 
AA071 267 T65940 T6451 5 AA071 334 
AA01 8306 H38925M001 221 
H79670 H47798AA700289 

N34524 AA305071 AW954803 M502335 AI433430 AI203597 AW026670 AW265323 AW850787 AA317554 AW993643 AW835572 

AW385512AI334966 W32951 H62656 H53902 R88904 AW835732 

AA769156 

N59537 N78278R83560 
AA226431 AA226569 AA488748 
R91883AI445591 



AF090405 AF090407 AF090406 
AF163305AF163307AF163303 



AF068654 AF068656 AF068655 



145 



WO 02/086443 



305614 


AA782866 


305637 


AA806124 


305639 


AA806138 


305650 


AA807709 


305690 


AA813477 


305728 


AA828209 


305759 


AA835353 


305792 


AA845256 


307041 


AI144243 


307091 


AI167439 


307181 


A1189251 


305901 


M872968 


305910 


AA875981 


307415 


AI242118 


307426 


AI243364 


307517 


A1275055 


307551 


AI281556 


307561 


AI282207 


307608 


AI290295 


307691 


AI318285 


307730 


AI336092 


307760 


AI342387 


307764 


AI342731 


307796 


AI350556 


309045 


AI910902 


309051 


AI911975 


307807 


AI351799 


307808 


AI351826 


307820 


AI355761 


307852 


AI365541 


309122 


AI928178 


309164 


AI937761 


309177 


AI951118 


307902 


AI380462 


309299 


AW003478 


309303 


AW004823 


309476 


AW129368 


309532 


AW151119 


309747 


AW264889 


309769 


AW272346 


309799 


AW276964 


309866 


AW299916 


302679 


31 1853 J H65022M1 86889 


309923 


AW340684 


309928 


AW341418 


309931 


AW341683 


309933 


AW341936 


302705 


31765 1 U09060U09061 


302789 


34161 1 AJ245067AJ245070 


304006 


AW517947 


304024 


T03036 


304026 


T03160 


304028 


T03266 


304046 


T54803 


304061 


T61521 


304063 


T62536 


302802 


34487 1 Y08250Y08245 


304114 


R78946 


304155 


H68696 


304203 


N56929 


304234 


W81608 


304348 


AA179868 


304430 


AA347682 


304456 


AA411240 


304521 


AA464716 


304526 


AA476427 


304607 


AA513322 


304735 


AA576453 


304760 


AA580401 


306015 


AA897116 


306063 


AA906316 


306065 


AA906725 


306104 


AA910956 


306109 


AA911861 


306242 


AA932805 


306288 


AA936900 


306396 


AA970223 


330568 


NOT.FOUND entrez U56244 


330599 


15323.-12 U90437 


331131 


genbank_R54797 R54797 


331203 


NOT.FOUND.entrez T82310 


331531 


genbank_N51343 N51343 


331547 


467396.1 AA828597N54811 


332074 


genbank_AA5990l2 AA599012 



PCT/US02/12476 



146 



WO 02/086443 



PCT/US02/12476 



TABLE 8C shows the genomic position for those Pkeys in Table 8A lacking unigene ID's and accession numbers. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 
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. etat. 
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333512 


Dunham 


. et.al 


Plus 
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333524 


Dunham 
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. et.al. 


Plus 
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3335R5 


Dunham 

UUIil Idi 1 1, 


. et.al. 


Plus 


6234778-6234R94 


333R18 


Dunham 


. eta). 


Plus 


6562391-6562566 


333R97 


Dunham 
UUl II Idi 1 1, 


. et.al. 
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UUtUUOH UUt-UaUO 


333R2R 


Dunham 
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. etal. 


Plus 


6629004-6629233 




Dunhsrn 
UUl II lal II, 


et.al. 
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Dunham 
uui ii lai 1 1, 


et.al. 


Plus 
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1 UUO££t}"l UUU4UO 


333750 

uudl UU 


Dunham 
uui ii lain, 


. et.al. 


Plus 


7608165-7608234 


3337R3 

UUU/ Du 


Dunham 
uui nidil I, 


. etat. 


Plus 


7R92491-7R92R3fl 
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3337R7 


Dunham 

UUIIUdlll, 


. et.at. 


Pius 


7fiq44n7-7fiQ4R23 
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3337RR 

vJJJ/ uu 


Dunham 

UUI II lal II, 


. et.al. 


Plus 


7R95440-7R95RQ7 
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3337R9 

UUU ( uu 


Dunham 
uui ii lain, 


. et.al. 


Plus 


7fiQfifi?5-7fiQfi707 

/ UC/UU£U r U JU f vf 


333772 


Dunham 

UUI U lal 1 1, 


. etal. 


Plus 


7706773-7706902 


333777 


Dunham 
uui ii idi 1 1, 


. et.al. 


Plus 


7746805-7746916 


333846 


Dunham 
uui ii tai ii, 


. et.al. 


Plus 


8008623-8008757 


333884 


Dtinham 

UUl 11 lal 1 1, 


. etal. 


Plus 


8153960-8154161 


333RR7 


Dunham 

UUlllldll 1) 


. et.al. 


Plus 


8154882-8155025 


33330,1 
UUUOc? f 


Ditnham 
UUI Midi II, 


et.al. 


Plus 


R15R437-R15R7fiQ 

O 1 JUtJf U 1 Jul 


333RQ9 


Dunham 
uui ii idi 1 1, 


, et.al. 


Plus 


R1 5RR25-R1 57001 

U 1 JUu£J-U 1 Ul UU 1 


333043 


Dunham 
UUI llldll 1, 


et al 


Plus 


R5R34Q7-R5R3R97 


333Q54 


Dunham 
uutiiidiii. 


. et.al. 


Plus 


R5R31flfi-R5R3335 

UJUw IOU 'UJUwJtM 


333QRR 


Dunham 
uui ii lal It, 


. et.al. 


Plus 


RR55R43-RR55R2R 


333QRR 

OJJ3UU 


Dunham 
uui iiiaii I) 


etal. 


Plus 


RfiRin04-RRR1241 


3340S1 


Dunham 
uutiiidiiif 


. et.al. 


Plus 


9686941-9687077 


334094 


Dunham 
uui ii idi 1 1, 


. et.al. 


Plus 


9889953-9890105 


334113 


Dunham 
uui nidil i, 


. et.al. 


Plus 


10282459-1 02R2597 


334161 


Dunham 
uuiuidiiij 


. et.al. 


Plus 


10599033-10599180 


33491 Q 


Di tnham 
UUI Midi II, 


et.al. 


Plus 


1271R1R0-1971R3R4 

\Cl 1 U 1 UU Itl lUvJOH 


334939 


Dunham 
uui II lai 1 1, 


etal. 


Plus 


1 305R5R9-1 305RR93 

1 UUUU UUC7- 1 yUJUUJO 


334333 


Dunham 

UUI itldll 1, 


. et.al. 


Plus 


13603544-13603657 

IwUUOJtT IOUUUUOI 


33437R 


Dunham 
uutiiidiii, 


, et.al. 


Plus 


13907239-13907370 


334382 


Dunham 


. eta). 


Plus 


13915866-13916036 


334562 


Dunham ! 

UUI 11 lal 1 1| 


. etal. 


Plus 


14QR7R47-1 49R7Q40 
i*ti7u i otf i tao i otu 


334588 


Dunham 

UUI II lal 1 1, 


. et.al. 


Plus 


15032740-15032817 


114R1R 


Uui ii (dill, i 


eta). 


Plus 


1R17fi123-1 f il7fi47n 
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334R33 


Dunham 1 
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et.al. 


Plus 
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334RRR 


Dunham 1 
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eta!. 


Plus 
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334891 


Dtinham i 
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etal. 


Plus 


1 9299770-1 9299944 


334Q34 


Dunham 1 

UUI II Idl 1 1, 1 


. et.al. 


Plus 


20103970-20104058 


335015 

OJJu 1 U 


Dunham 

uutiiidiii, i 


. etal! 


Plus 


20682792-20682945 


335120 

UUU \£XI 


Dunham 1 
uui II lal ii, i 


etal. 


Plus 


21436286-21 436384 

i lfuU£UU"£ l*TUUuU*r 


335125 


Dunham 1 

UUllllalll, l 


et.al. 


Plus 


21441390-21441471 

fclt*rlu3U"tlTtl*tl 1 


335179 


Dunham, 1 


. etal! 


Plus 


21634405-21634526 


335188 


Dunham, 1 


et.al. 


Plus 


21669118-21669328 


335211 


Dunham, 1 


etal. 


Plus 


21774611-21774680 


335361 


Dunham, 1 


etal. 


Plus 


22807292-22807445 


335379 


Dunham, 1 


etal. 


Plus 


22899306-22899420 


335414 


Dunham, 1 


etal. 


Plus 


23235546-23235684 


335416 


Dunham, 1 


et.al. 


Plus 


23237354-23237465 


335496 


Dunham, 1 


etal. 


Plus 


24164386-24164545 


335497 


Dunham, 1 


etal. 


Plus 


24167666-24167869 


335558 


Dunham, 1 


etal. 


Plus 


24740167-24740347 


335586 


Dunham, 1 


etal. 


Plus 


24990333-24990497 


335686 


Dunham, 1 


etal. 


Plus 


25439839-25439920 


335784 


Dunham, 1 


etal. 


Pius 


25942710-25942792 


335823 


Dunham, 1 


etal. 


Plus 
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335983 


Dunham, 1 


etal. 


Plus 


27938968-27939070 


335995 


Dunham, 1 


etal. 


Plus 


28009044-28009184 


336021 


Dunham, 1 


etal. 


Plus 
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OO0U04 


Di inham 


pt at 
. cl.dl. 


Plus 


29014404-29014590 


perron 


n 1 inham 


pt a) 
. cl.dl. 


Plus 


29022963-29023165 


OOD lUf 


UUlllldll), 


pt al 
. cl.dl. 


Plus 


29987731-29987869 


OOOOO*- 


r^i inh^m 

uuniidiu, 


p| al 
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983890-985529 


OOQQOO 


riunham 
UUI llldlll, 
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Df inham 


etal. 
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986296-986670 
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[^imham 
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Plus 
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336636 


Dunham 
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988418-989185 


336637 
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. et.al. 
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989276-990813 


336638 


Dunham 
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. et.al. 


Plus 


991906-993240 




Dunham 

UUI 111 dl II, 


. et.al. 


Plus 


1896402-1896478 


33RRQ4 


Dunham 
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. et.al. 


Plus 


2420546-2420616 


336721 


Dunham 

UUI II IOI II) 


. et.al. 


Plus 


3371522-3371586 


336Qnn 

oooyuu 


Dunham 

UUlllldll 1, 


. et.al. 


Plus 


10236423-10236523 


33RQ4R 


Dunham 
uui 11 lai 1 1, 


. et.al. 


Plus 


12692290-12692381 


33709R 


Dunham 
uuiii lain. 


. et.al. 


Plus 


16644817-16644942 


OO/Vu't 


Di inham 


et.al. 
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17821742-17821922 
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UUlllldll 1, 


etal. 
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23478943-23479145 


OO/ I 00 


nhinham 
UUIllldlll, 


et.al. 


Plus 


23943606-23943696 


3371 R4 

OO/ I 04 


Di inham 
UUIII Idl 1 1, 


et.al. 


Plus 


23973949-23974016 


OOf £UO 


Dunham 

UUI llldl II, 


. et.al. 


Plus 


28011979-28012034 


3379QQ 

OO/ £39 


Dunham 
uuiiiiciii i. 


. etal. 


Plus 


29022656-29022775 


3373RQ 

OO/ OOcJ 


Dunham 
UUIllldlll, 


et.al. 


Plus 


31401509-31401579 


3374Q3 

Jul *tOO 


Dunham 

UUI IUC1I II, 


. et.al. 


Plus 


33330760-33330981 


337549 


Dunham 

UUIII ICtl 1 lj 


. et.al. 


Plus 


34474472-34474531 


337755 

OO/ ' OO 


Dunham 

UUIII IOI 1 1, 


. et.al. 


Plus 


3971764-3971900 


337fiflQ 


Di inham 

UUI IUCII Uf 


. et.al. 


Plus 


4449069-4449193 


337R71 

00/0/ I 


Dunham 
UUlllldll), 


Pt at 
. cl.dl. 


Plus 


5443027-5443101 


337QSR 

OO/ 300 


Dunham 
UUIllldlll, 


. et.al. 


Plus 


6969162-6969270 


33RnnR 

OOOUUU 


Dunham 
uui nidi 1 1, 


. et.al. 


Plus 


7697068-7697236 


ooouoo 


UUIllldlll, 


. et al. 


Plus 


8092128-8092271 


ooO I IU 


UUlllldll 1, 


at a) 
. cl.dl. 


Plus 


10384481-10384621 


OOO I 1 t 


UUIllldlll, 


et.al. 


Plus 


10391398-10391600 


?3R145 

000 1 no 


Hnnham 
UUIllldlll, 


Pt al 
. cl.dl. 


Plus 


11386629-11386692 


33R14R 
OOO i HO 


Himham 
UUIllldlll, 


. etal. 


Plus 


11448985-11449085 


338179 


rinnham 
UUIllldlll, 


eta). 


PJus 


12808775-12808833 


33R1Q7 

OOO I %JI 


Dunham 
UUI iiidii 1, 


. et.al. 


Plus 


13638107-13638181 


33R97Q 


Di inham 
uuuiidi ii. 


. et.al. 


Plus 


16168944-16169091 


338316 

OOOO 1 O 


Dunham 
uuiiiiaiit, 


. etal. 


Plus 


17089711-17089988 


338322 


Dunham 
UUlllldll 1, 


. etal. 


Plus 


17132477-17132547 


338357 


Dunham 

UUIII lain. 


. et.al. 


Plus 


18062184-18062402 


338359 


Dunham 

uui 11 1 cai i i. 


. et.al. 


Plus 


18074402-18074501 


33R36R 

\J000Q\J 


Dunham 

UUI It lai 1 1, 


. et.al. 


Plus 


18252026-18252189 


33R374 
ooOOr 4 


Dunham 

uui 11 1 di u, 


. et.al. 


Plus 


18371200-18371282 


338414 


Dunham 
UUIllldlll, 


etal. 


Plus 


19345573-19345660 


33R41R 

0004 1 O 


Dunham 
UUIllldlll, 


. etal. 


Plus 


19435506-19435596 


OOOuU 1 


Hi inham 
UUIllldlll, 


et.al. 


Plus 


21244713-21244828 


oooouo 


Dunham 
UUIllldlll, 


et.al. 


Plus 


21221871-21221953 


OOOOcO 


uuniidi 11, 


. et.al. 


Plus 


21509763-21509864 
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UUIllldlll, 


. et.al. 


Plus 


24404720-24404899 


OoDOUh 


Dunham, 


at al 


PJus 
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r^i inham 
UUIllldlll, 


et.al. 


Plus 


27792166-27792272 


oooo/ y 


r^i inham 

uunndrn, 


ot al 
. cl.dl. 


Plus 


28410653-28410734 


00090/ 


r^rmham 
UUIllldlll, 


pt al 
. cl.dl. 


Plus 


29160655-29160725 


•aqoooq 
oooyyo 


UUIUIdl II, 


pt a) 


PJus 


30077787-30078184 


0QQn47 

ooautf 


rinnham 
UUIllldlll, 


et.al. 


Plus 


30760793-30760968 


OOo I UU 


Pli inham 
UUIllldlll, 


et.al. 


Plus 


31141580-31141765 


339114 


Dunham 
UUIllldlll, 


et.al. 


Plus 


31456454-31456519 


339121 


UUIllldlll, 


. etal. 


Plus 


31583467-31583536 


33Ql7n 

O03 I / If 


Dunham 

UUI llldlll, 


. et.al. 


Plus 


32216399-32216527 


33Q9Q3 


Dunham 

UUI II leu 1 1, 


. etal. 


Plus 


33223671-33223819 


339R5R 

00£OOU 


Dunham 

UUI II 101 1 1, 


. et.al. 


Minus 


1339607-1339397 


oqoqoo 

O 


Di inham 

UUI II ICUII, 


. etal. 


Minus 


2628296-2628109 
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oo^yot 


Hunham 
UUI II Idl 1 1, 


et.al. 


Minus 


2632606-2632457 


oo<;yyo 


UUIllldlll, 


nt a\ 
. cl.dl. 


Minus 


2711704-2711565 


ooouoo 


Hnnham 
UUIllldlll, 


at al 

. Cl.dl. 


Minus 


3028925-3028811 


OO0U3 / 


riunham 
UUIllldlll, 


et.al. 


Minus 


3204124-3204036 


333121 


Dunham 

UUIllldlll, 


. etal. 


Minus 


3308446-3308358 
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Plunham 
UUIllldlll, 


. etal. 
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3309596-3309531 


333193 

OOO f £o 


Dunham 
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etal. 
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3310817-3310749 
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Pit inham 
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. et.al. 
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3377220-3376309 
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et.al. 
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4308400-4308304 
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Hnnham 
UUIllldlll, 


at at 
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6466335-6465727 
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Dunham 
UUIllldlll, 


pf al 
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6467090-6466768 


O003U4 


Dunham 
uuuiidiu, 


et.al. 
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8217374-8217261 
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OO 


Dunham 

UUIllldlll, 


. et.al. 
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Di inham 
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11921456-11921205 


334222 


Dunham, 


. etal. 
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12732417-12732289 


334223 


Dunham, 


.etal. 


Minus 


12734365-12734269 


334255 


Dunham, 


. etal. 


Minus 


13200776-13200692 


334492 


Dunham, 


. et.al. 


Minus 


14478333-14478172 


334648 


Dunham, 


.etal. 


Minus 


15363301-15363222 


334787 


Dunham, 


. etal. 


Minus 


16299093-16298937 


334933 


Dunham, 


. etal. 


Minus 


20078117-20077991 
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334945 


Dunham, i. et.al. 


Minus 


20138885-20138637 


334967 


Dunham, I. et.al. 


Minus 


20173311-20173218 


334990 


Dunham, 1. eta). 
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20341159-20341087 


335093 


Dunham, 1. et.al. 
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21297367-21297214 


335288 


Dunham, 1. et.al. 


Minus 


22304275-22303770 


335289 


Dunham, (. etal. 


Minus 


22305950-22305708 


335548 


Dunham, 1. et.al. 


Minus 


24662773-24662673 


335551 


Dunham, 1. et.al. 


Minus 


24679828-24678961 


335619 


Dunham, 1. et.al. 


Minus 


25082677-25082498 


335620 


Dunham, 1. etal. 


Minus 


25092561-25092434 


335621 


Dunham, 1. et.a). 
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25098878-25098767 


335682 


Dunham, i. et.al. 
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25421215-25421093 


335755 


Dunham, ). et.al. 
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25763806-25763747 


335814 


Dunham, 1. et.al. 


Minus 


26320043-26319845 


335815 


Dunham, 1. et.al. 


Minus 


26320518-26320421 


335835 


Dunham, 1. eta!. 


Minus 


26393311-26393245 


335851 


Dunham, I, etal. 


Minus 


26604863-26604742 


335868 


Dunham, 1, eta). 


Minus 


26711437-26711300 


335896 


Dunham, !. et.al. 


Minus 


26977639-26977558 


335936 


Dunham, 1. et.al. 


Minus 


27360474-27360400 


335948 


Dunham, 1. et.al. 


Minus 


27555924-27555788 


336066 


Dunham, 1. et.al. 


Minus 


29241080-29240842 


336205 


Dunham, 1. et.al. 


Minus 


30477456-30477311 


336275 


Dunham, 1. etal. 


Minus 


32086675-32086536 


336292 


Dunham, 1. etal. 


Minus 


32818035-32817927 


336331 


Dunham, 1. et.al. 


Minus 


33594527-33594371 


336419 


Dunham, 1. etal 
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34052568-34052445 


336675 


Dunham, 1. etal. 
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2020758-2020664 


336684 


Dunham, 1, etal. 


Minus 


2158060-2157993 


336716 


Dunham, i. etal. 


Minus 


3259952-3259862 


336798 


Dunham, 1. etal. 


Minus 


5888954-5888757 


337043 


Dunham, 1. etal. 


Minus 


17407330-17407251 


337046 


Dunham, 1. etal. 


Minus 


17610892-17610821 


337128 


Dunham, 1. eta!. 


Minus 


22215251-22215034 


337192 


Dunham, 1. etal. 


Minus 


24591853-24591771 


337194 


Dunham, I. et.al. 


Minus 


24610510-24610359 


337229 


Dunham, 1. etal. 


Minus 


26716579-26716481 


337325 


Dunham, 1. et.al. 


Minus 


30015948-30015800 


337497 


Dunham, 1. et.al. 


Minus 


33371317-33371258 


337500 


Dunham, 1. etal. 


Minus 


33376212-33376158 


337603 


Dunham, 1. etal. 


Minus 


1299296-1299194 


337605 


Dunham, 1. et.al. 


Minus 


1346555-1346397 


337671 


Dunham, I. etaL 


Minus 


3260634-3260547 


337786 


Dunham, 1. etal. 


Minus 


4133203-4133081 


337862 


Dunham, !. etal. 


Minus 


5347658-5347550 


338083 


Dunham, ). etal. 


Minus 


9318438-9318301 


338158 


Dunham, I. et.al. 


Minus 


11794465-11794343 


338161 


Dunham, 1. et.al. 


Minus 


12124716-12124658 


338182 


Dunham, 1. et.al. 


Minus 


12824919-12824827 


338189 


Dunham, 1. et.al. 


Minus 


12878594-12878478 


338199 


Dunham, 1. et.al 


Minus 


13760865-13760780 


338215 


Dunham, 1. et.al 


Minus 


14055447-14055355 


338469 


Dunham, 1. etal 


Minus 


20520387-20520242 


338549 


Dunham, 1. etal. 


Minus 


22049171-22049081 


338561 


Dunham, I. et.al. 


Minus 


22311966-22311856 


338671 


Dunham, !. etal. 


Minus 


24508421-24508346 


338676 


Dunham, 1. etal. 


Minus 


24637427-24637369 


338726 


Dunham, I. et.al. 


Minus 


25926206-25925618 


338779 


Dunham, 1. etal. 


Minus 


27030151-27029795 


338871 


Dunham, I. etal, 


Minus 


28301708-28301611 


338872 


Dunham, ). et.a). 


Minus 


28300921-28300790 


338966 


Dunham, 1. et.al. 


Minus 


29614876-29614749 


339229 


Dunham, 1. et.al. 


Minus 


32722330-32722199 


339264 


Dunham, 1. et.al. 


Minus 


32975145-32975053 


325228 


6381940 Plus 


2630-2694 




325235 
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162154-162264 


329588 


3962484 Plus 


1169-1619 




329560 


3962491 Plus 
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329541 


3983503 Minus 
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TABLE 9A: Potential Therapeutic, Diagnostic and Prognostic targets for Therapy of Lung Cancer 

Table 9A shows about 1312 genes up-regulated in lung tumors (including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid 
tumors) relative to normal body tissues. These genes were selected from about 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. 

Table 9B show the accession numbers for those Pkey's lacking UnigenelD's for table 9A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed In the 
"Accession" column. 

Table 9C show the genomic positioning for those Pkey's lacking Unigene ID'S and accession numbers in table 9A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Average of lung tumors (including squamous cetl carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 

average of normal lung samples 

Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung sample 
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3.82 
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Hs.1 54672 


methylene tetrahydrofolate dehydrogenase 
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Hs.1 54868 
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Hs.1 55421 
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1.71 
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426496 


D31765 


Hs 170114 
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phospholipase A2, group IVA {cytosofic, 
tumor necrosis factor receptor superfami 
Ras-GTPase-activating protein SH3-domain 
cyclin-dependent kinase 5, regulatory su 
ESTs 

ribosomal protein S6 

sine oculis homeobox (Drosophila) homolo 

ESTs 

ESTs 

ESTs, Weakly similar to T331 88 hypotheti 
hairy/enhancer-of-split related with YRP 
ESTs, Weakly similar to LEU5J-1UMAN LEUKE 
guanine nucleotide binding protein 4 
oncostatin M receptor 
guanine deaminase 
calmodulin-Iike 3 

tumor necrosis factor receptor superfami 
nuclear cap binding protein subunit 2, 2 
estrogen-responsive B box protein 
DKFZP434B061 protein 
cathepsin B 
ESTs 

polymerase (DNA directed), theia 

ESTs, Moderately similar to ALU8_HUMAN A 

chloride channel, calcium activated, fam 

ESTs 

ESTs, Weakly similar to T17288 hypotheti 
ATP-binding cassette, sub-family C {CFTR 
desmoglein 2 
ESTs 

desmoglein 1 

ESTs, Weakly similar to 2004399A chromos 
glypican 1 

zinc finger protein 131 (clone pHZ-10) 
ESTs, Weakly similar to 178885 serine/th 
gap junction protein, beta 6 {connexin 3 
ESTs, Weakly similar to unknown protein 
ESTs 

doublesex and mab-3 related transcriptio 
Homo sapiens cDNA: FLJ23494 fis, clone L 
gap junction protein, beta 2, 26kD (conn 
SRB7 (suppressor of RNA polymerase B, ye 
ESTs 

gb:EST382704 MAGE resequences, MAGK Homo 
ESTs 

gb:MR2-HT0377-!50200-202-e03 HT0377 Homo 
granin-Hke neuroendocrine peptide precu 
hypothetical protein DKFZp434A1315 
endothelial differentiation, iysophospha 
novel protein 

integrin, alpha 3 (antigen CD49C, alpha 
ESTs 

5'-3' exoribonuclease 2 

small inducible cytokine subfamily A (Cy 

uroplakin 1B 

integrin, alpha 2 (CD49B, alpha 2 subuni 
STG protein 

cadherin 3, type 1, P-cadherin (placenta 
Homo sapiens mRNA: cDNA DKFZp434L1226 (f 
EST 

Transmembrane protease, serine 3 

Homo sapiens, clone IMAGE:3544662, mRNA, 

phosphate cytidylyltransferase 1, cholin 

matrix metalloproteinase 13 (coilagenase 

SCGlO-like-protein 

hypothetical protein FLJ10377 

hypothetical protein FU 10244 

Homo sapiens cDNA FU1 1346 fis, clone PL 

S100 calcium-binding protein P 

gb:zr03f12.r1 Stratagene NT2 neuronal pr 

ESTs 

ESTs 

Homo sapiens cDNA: FLJ21274 fis, clone C 
ESTs, Weakly similar to ALU8_HUMAN ALU S 
potassium channel TASK-4; potassium chan 
granzyme K (serine protease, granzyme 3; 
ESTs, Moderately similar to T03094 A-kin 
ESTs, Weakly similar to JC7328 amino aci 
UDP-N-acetyl-aIpha-D-galactosamine:polyp 
ESTs, Weakly smlar to KIAA1074 protein 
Homo sapiens PRO0593 mRNA, complete cds 
Homo sapiens cDNA: FLJ231 17 fis, clone L 
hypothetical protein MGC4485 
ESTs 

PRO0327 protein 
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polymerase (RNA) II (DNA directed) polyp 
clone HQ0310PRO0310p1 
Ihrombospondin 1 

Homo sapiens cDNA FLJ11660 fis, clone HE 

lymphocyte antigen 6 complex, locus D 

kinesin-like protein 2 

transcription factor BMAL2 

ESTs, Weakly similar to ALUB.HUMAN !!!! 

ESTs 

caspase 6, apoptosis-retated cysteine pr 
aldc-keto reductase family 1, member C2 
ESTs 

beta tubulin 1 , class VI 
calcium/calmodulin-dependent protein kin 
ESTs 

immunoglobulin kappa constant 

lung type-l cell membrane-associated gly 

ESTs 

KIAA0205 gene product 
phosphoserine aminotransferase 
hypothetical protein PR01966 
hypothetical protein PRO2013 
presenilins associated rhomboid-like pro 
ESTs 

ESTs, Weakly similar to T17226 hypotheti 
ESTs 

gb:tr34b07.x1 NCLCGAP j>23 Homo sapiens 
Homo sapiens cDNA: FLJ23523 fis, clone L 
Homo sapiens cDNA FLJ 13445 fis, clone PL 
ESTs, Highly similar to A35661 DNA excis 
ESTs 

Homo sapiens cDNA FLJ 11 980 fis, clone HE 
Homo sapiens cDNA FLJ 12676 fis, clone NT 
ESTs 

Homo sapiens, Similar to RIKEN cDNA 2810 
phorbol-1 2-myristate-1 3-acetate-induced 
Ig superfamily receptor LNIR 
ESTs 

Homo sapiens clones 24714 and 24715 mRNA 
Target CAT 

dyskeratosis congenita 1, dyskerin 
ESTs 

flap structure-specific endonuclease 1 
ESTs 

immunoglobulin lambda locus 
cyciin-dependent kinase inhibitor 2C {p1 
ESTs 
ESTs 

interleukin-1 homolog 1 

ESTs 

ESTs 

Homo sapiens, clone IMAGE:3682908, mRNA 
H.sapiens polyA site DNA 
uncharacterized bone marrow protein BM03 
ESTs 

KIAA1313 protein 

ESTs 

ESTs 

hypothetical protein FLJ10719; KIAA1794 
hypothec protein FLJ10774; KIAA1709 
fibrinogen-like 1 
hypothetical protein FLJ11301 
nucleolar protein (KKE/D repeat) 
protein regulator of cytokinesis 1 
hypothetical protein FLJ23588 
wingless-type MMTV integration site fami 
WD repeat domain 4 
ESTs 
ESTs 
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ESTs 
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down syndrome critical region protein DS 

ESTs 
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hypothetical protein DKFZp547Dl55 

ESTs 
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guanine monphosphate synthetase 
cDNA for differentially expressed C01 6 g 
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1.00 


448278 


W07369 


Hs.1 1782 


ESTs 


0.97 


1.90 


448290 


AK002107 


Hs.20843 


Homo sapiens cDNA FLJ 11 245 fis, clone PL 


1.00 


1.00 


448296 


BE622756 


Hs.10949 


Homo sapiens cDNA FLJ 14162 fis, clone NT 


2.42 


2.17 


448357 


BE274396 


Hs.108923 


RAB38, member RAS oncogene family 


1.44 


1.08 


448390 


AL035414 


Hs.21068 


hypothetical protein 


1.00 


43.00 


448469 


AW504732 


Hs.21275 


hypothetical protein FLJ 1101 1 


2.63 


2.49 


448569 


BE382657 


Hs.21486 


signal transducer and activator of trans 


1,84 


2.53 


448663 


BE614599 


Hs.1 06823 


hypothetical protein MGC14797 


3.29 


46.00 


448672 


AI955511 


Hs.225106 


ESTs 


1.00 


21.00 


448733 


NM_005629 


Hs.1 87958 


solute carrier family 6 (neurolransmitte 


1.82 


1.08 


448741 


BE614567 


Hs.1 9574 


hypothetical protein MGC5469 


2.48 


1.92 


448757 


AI366784 


Hs.48820 


TATA box binding protein (TBP)-associate 


23.53 


20.00 


448775 


AB025237 


Hs.388 


nudix (nucleoside diphosphate linked moi 


2.34 


1.97 


448826 


AI580252 


Hs.293246 


ESTs, Weakly similar to putative p1 50 [H 


74.07 


62.67 


448830 


AL031658 


Hs.22181 


hypothetical protein dJ310O13.3 


1.37 


1.31 


448844 


AI581519 


Hs.1 77164 


ESTs 


1.00 


31.00 


448988 


Y09763 


Hs.22785 


gamma-aminobutyric acid (GABA) A recepto 


1.84 


1.95 


448993 


AI471630 




KIAA0144 gene product 


1.63 


1.49 


449003 


X76342 


Hs.389 


alcohol dehydrogenase 7 (class IV), mu o 


1.00 


1.00 


449029 


N28989 


Hs.22891 


solute carrier family 7 (cationic amino 


1.97 


2.26 


449040 


AF040704 


Hs.149443 


putative tumor suppressor 


0.97 


1.56 


449048 


Z45Q51 


Hs.22920 


similar to S68401 (cattle) glucose indue 


27.13 


90.00 


449053 


AI625777 


Hs.344766 


ESTs 


8.33 


44.00 


449054 


AF148848 


Hs.22934 


myoneurin 


73.85 


104.00 


449101 


AA205847 


Hs.23016 


G protein-coupled receptor 


2.58 


27.00 
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449167 


T05095 


Hs.19597 


449207 


AL044222 


Hs.23255 


449228 


AJ403107 


Hs. 148590 


449230 


BE613348 


Hs.211579 


449305 


A1638293 




449318 


AW236021 


Hs.78531 


449448 


060730 


Hs,57471 


449467 


AW205006 


Hs.1 97042 


449523 


NM_000579 


Hs.54443 


449722 


BE280074 


Hs.23960 


449976 


H06350 


Hs.1 35056 


450001 


NMJJ01044 


Hs.406 


450098 


W27249 


Hs.8109 


450101 


AV649989 


Hs.24385 


450149 


AW969781 


Hs.1 32863 


450193 


AI916071 


Hs.1 5607 


450221 


AA328102 


Hs.24641 


450372 


BE218107 


Hs,202436 


450375 


AA009647 


Hs.8650 


450447 


AF212223 


Hs.25010 


450568 


AL050078 


Hs.25159 


450589 


AI701505 


Hs.202526 


450684 


AA872605 


Hs.25333 


450701 


H39960 


Hs.288467 


450705 


U90304 


Hs.25351 


450832 


AA401369 


Hs.1 90721 


450937 


R49131 


Hs.26267 


450983 


M305384 


Hs.25740 


451105 


AI761324 




451110 


AI955040 


Hs.265398 


451253 


H48299 


Hs.26126 


451291 


R39288 


Hs.6702 


451320 


AW498974 




451380 


H09280 


Hs.1 3234 


451386 


AB029006 


Hs.26334 


451437 


H24143 


Hs,31945 


451462 


AK000367 


Hs.26434 


451524 


AK001466 


Hs.26516 


451541 


BE279383 


Hs.26557 


451592 


AI805416 


Hs.213897 


451635 


AA018899 


Hs.1 271 79 


451743 


AA401369 


Hs. 190721 


451806 


NMJ)Q3729 


Hs.27076 


451807 


W52854 




451871 


AI821005 


Hs.1 18599 


451952 


AL120173 


Hs,301663 


452012 


AA307703 


Hs.279766 


452046 


AB018345 


Hs.27657 


452194 


AI694413 


Hs.332649 


452206 


AW340281 


Hs.33074 


452240 


AA401369 


Hs.1 90721 


452256 


AK000933 


Hs.28661 


452281 


T93500 


Hs.28792 


452291 


AF015592 


Hs.28853 


452295 


BE379936 


Hs.28866 


452304 


AA025386 


Hs.61311 


452340 


NMJJ02202 


Hs.505 


452349 


AB028944 


Hs.29189 


452367 


U71207 


Hs.29279 


462401 


NM_007115 


Hs,29352 


452410 


AM 3361 9 




452461 


N78223 


Hs.1 081 06 


452571 


W31518 


Hs.34665 


452613 


AA461599 


Hs.23459 


452699 


AW295390 


Hs.213062 


452705 


H49805 


Hs.246Q05 


452747 


AF1 60477 


Hs.61460 


452787 


AW294022 


Hs.222707 


452795 


AW392555 


Hs.1 8878 


452823 


AB012124 


Hs,30696 


452833 


BE559681 


Hs.30736 


452838 


U65011 


Hs.30743 


452862 


AA401369 


Hs.1 90721 


452865 


AW173720 


Hs.345805 


452934 


AA581322 


Hs.4213 


452946 


X95425 


Hs.31092 


452976 


R44214 


Hs.1 01 189 


453028 


AB0Q6532 


Hs.31442 


453095 


AW295660 


Hs.252756 


453102 


NM.007197 


Hs.31664 


453103 


AI301052 


Hs.153444 


453120 


AA292891 


Hs.31773 


453153 


N53893 


Hs.24360 


453160 


A1263307 


Hs.239884 


453197 


AI916269 


Hs.109057 



KIAA1694 protein 

nucleoporin 155kD 

protein related with psoriasis 

melanoma eel) adhesion molecule 

gb:tt09b07.x1 NCLCGAPJ3C6 Homo sapiens 

Homo sapiens, Similar to RIKEN cDNA 5730 

ESTs 

ESTs 

chemokine (OC motif) receptor 5 
cyciin B1 

Human DNA sequence from clone RP5-85QE9 

solute carrier family 6 (neurotransmitte 

hypothetical protein FLJ21080 

Human hbc647 mRNA sequence 

Zic family member 2 (odd-paired Drosophi 

Homo sapiens Fanconi anemia complementat 

cytoskeleton associated protein 2 

ESTs 

a disintegrin and metaHoproteinase doma 

hypothetical protein P15-2 

Homo sapiens cDNA FLJ 10784 fis, clone NT 

ESTs 

interieukin 1 receptor, type IJ 

Homo sapiens cDNA FU 12280 fis, clone MA 

Iroquois homeobox protein 2A (IRX-2A) ( 

ESTs 

ATP-dependant interferon response protei 

ER01 (S. cereWs/aeHike 

gb:wi60b11.x1 NCLCGAP_Co16 Homo sapiens 

ESTs, Weakly similar to transformations 

claudin 10 

ESTs 

diacylglycerol kinase, zeta (104kD) 
ESTs 

spastic paraplegia 4 (autosomal dominant 
hypothetical protein FIJ1 1071 
hypothetical protein FLJ20360 
hypothetical protein FU10604 
plakophilln 3 
ESTs 

cryptic gene 
ESTs 

RNA 3'-terminal phosphate cyclase 
hypothetical protein FLJ23293 similar to 
ESTs 
ESTs 

kinesln family member 4A 
KIAA0802 protein 

olfactory receptor, family 2, subfamily 

Homo sapiens, done IMAGE:3606519, mRNA, 

ESTs 

Homo sapiens cDNA FLJ 10071 fis, clone HE 

Homo sapiens cDNA FLJ 1 1041 fis, clone PL 

CDC7 (cell division cycle 7, S. cerevisi 

programmed ceil death 10 

ESTs, Weakly similar to S 10590 cysteine 

ISL1 transcription factor, LIM/homeodoma 

ATPase, Class VI, type 1 1 A 

eyes absent (Drosophlla) homolog 2 

tumor necrosis factor, alpha-induced pro 

Homo sapiens mRNA; cDNA DKFZp434E2321 (f 

transcription factor 

ESTs 

ESTs 

ESTs 

ESTs 

Ig superfamily receptor LN1R 
KIAA1718 protein 
hypothetical protein FLJ21620 
transcription factor-tike 5 (basic helix 
KIAA0124 protein 

preferentially expressed antigen in meia 
ESTs 

ESTs, Weakly similar to A47582 B-cell gr 
hypothetical protein MGC16207 
EphAS 
ESTs 

RecQ protein-tike 4 
ESTs 

frizzled (Drosophila) homolog 10 
ESTs 

pregnancy-induced growth inhibitor 
ESTs 

H2B histone family, member L 

ESTs, Weakly similar to ALU5_HUMAN ALU S 



1.61 


2.36 


2.36 


1.56 


1.15 


1.15 


206.65 


151,00 


17.28 


45.00 


26.39 


35.00 


1.00 


1.00 


1.00 


1.00 


56,80 


216.86 


150.03 


1.00 


2.16 


2.85 


1.17 


1.45 


1.79 


2.38 


1.00 


69.00 


1.00 


1.00 


29.85 


34.00 


1.00 


1.00 


1.00 


1.00 


51.26 


93.00 


123.20 


181,00 


1.00 


19.00 


1.00 


23.00 


1.00 


100.00 


1.89 


1.55 


1.00 


45.00 


25.17 


17.00 


90.92 


90.00 


3,33 


1.70 


15.02 


124.00 


1.00 


143.00 


3.02 


2.29 


1.00 


1.00 


2.92 


18.00 


6.90 


6.67 


35.75 


72.00 


1.00 


69.00 


1.83 


2.10 


1.13 


1.07 


1.88 


1.33 


1.00 


1.00 


1.52 


1,92 


4.95 


17.00 


13.55 


31.00 


1.55 


35.00 


1.81 


2.53 


1.00 


22.00 


3.43 


2.26 


56.59 


19.00 


1.67 


4.09 


9.31 


53.00 


13.42 


17.00 


39.03 


94.00 


153.01 


340.00 


1.95 


23.00 


42.33 


61.00 


1.17 


2.14 


1.00 


13.00 


1.09 


1.42 


54.49 


53.00 


1.00 


32.00 


1.26 


1.99 


24.47 


35.00 


54.61 


102,00 


1.39 


1.32 


1.00 


26,00 


1.00 


100 


112.87 


1.29 


1.00 


1.00 


1.00 


1.00 


7.91 


75.00 


3.16 


1.92 


174.35 


1.00 


98.26 


17.00 


1.55 


1.00 


1.73 


1.19 


1.00 


1.00 


1.58 


1.98 


1.80 


1.60 


0.77 


1.50 


1.00 


1.00 


1.00 


too 


1.23 


1.20 


1.00 


83.00 


1.00 


30.00 


1.00 


134.00 



166 



WO 02/086443 PCT/US02/12476 



453210 


AL133161 


Hs.32360 


hypothetical protein FLJ 10867 


1.69 


1.93 


453240 


AI969564 


Hs.166254 


hypothetical protein DKFZp566l133 


1.00 


1.00 


453317 


NM 002277 


Hs,41696 


keratin, hair, acidic, 1 


119 


1.27 


453323 


AF034102 


Hs.32951 


solute carrier family 29 {nucleoside tra 


4.90 


4.11 


453331 


AI240665 


Hs.8850 


ESTs 


199.42 


340.00 


453392 


U23752 


Hs.32964 


SRY (sex determining region Y)-box 1 1 


1.00 


16.00 


453431 


AF094754 


Hs.32973 


glycine receptor, beta 


1.00 


1.00 


453439 


AI572438 


Hs.32976 


guanine nucleotide binding protein 4 


3.44 


5.17 


453459 


BE047032 


Hs.257789 


ESTs 


2.84 


5.58 


453563 


AW6089Q6.comp 


Hs.181163 


hypothetical p 


rotein MGC5629 


453533 


AA357001 


Hs.34045 


hypothetical protein FU20764 


1.74 


1,60 


453775 


NM 002916 


Hs!35120 


replication factor C (activator 1) 4 (37 


19.49 


1.00 


453830 


AA534296 


Hs.20953 


ESTs 


24.92 


25.00 


453857 


AL080235 


Hs. 35861 


DKFZP586E1621 protein 


167.59 


66.00 


453867 


AI929383 


Hs.33032 


hypothetical protein DKFZp434N185 


1.00 


39.00 


453883 


AI638516 


Hs.347524 


cofactor required forSpl transcripliona 


1.97 


1.58 


453884 


AA355925 


Hs 36232 


KIAA0186 gene product 


63.89 


20.00 


453900 


AW003582 


Hs.226414 


ESTs, Weakly similar to ALU8_HUMAN ALU S 


20.41 


16.00 


453922 


AF053306 


Hs.36708 


budding uninhibited by benzimidazoles 1 


7.09 


22.00 


453941 


U39817 


Hs.36820 


Bloom syndrome 


29.75 


19.00 


453964 


AI961486 


Hs,12744 


ESTs 


1,00 


1.00 


453968 


AA847843 


Hs<62711 


Homo sapiens, clone }MAGE:3351295, mRNA 


2.08 


1.81 


453976 


BE463830 


Hs.163714 


ESTs 


3.02 


131.00 


454024 


AA993527 


Hs.293907 


hypothetical protein FLJ23403 


1.00 


131.00 


454034 


NMJJ00691 


Hs.575 


aldehyde dehydrogenase 3 family, member 


1.23 


1.02 


454042 


T19228 


Hs. 172572 


hypothetical protein FLJ20093 


30.63 


171.00 


454059 


NMJH33154 


Hs.37048 


statherin 


1.00 


1.00 


454066 


X00*356 


Hs.37058 


calcitonln/calcitonin-related polypeptid 


1.01 


1.45 


454098 


W27953 


Hs.292911 


ESTs, Highly similar to S60712 band-6-pr 


1.26 


1.11 


454241 


BE144666 




gb:CM2-HT0176-041099-017-c02 HT0176 Homo 


6.33 


5.04 


454417 


AI244459 


Hs. 11 0826 


trinucleotide repeat containing 9 


4.30 


7.82 


454439 


AW819152 


Hs. 154320 


DKFZP56601 646 protein 


1.00 


1.00 


455175 


AW993247 




gb:RC2-BN0033-180200-014-h09 BN0033 Homo 


13.75 


103.00 


455601 


AI368680 


Hs.816 


SRY (sex determining region Y)-box 2 


206.11 


1.00 


456237 


AA203682 




gb:zx52e07.r1 Soaresjetaljiver.s pleen_ 


1.00 


1.00 


456321 


NM 001327 


Hs.87225 


cancer/testis antigen 


1.14 


1.10 


456475 


NM 0D0144 


Hs.95998 


Friedreich ataxia 


1.00 


48.00 


456508 


AA502764 


Hs 123469 


ESTs, Weakly similar to AF208855 1 BM-01 


162.25 


189.00 


456534 


X91195 


Hs 10Q623 


phospholipase C, beta 3, neighbor pseudo 


2.12 


1.80 


456736 


AW248217 


Hs 1619 


achaete-scute complex (Drosophila) homo! 


1.15 


1.94 


456759 


BE259150 


Hs 127792 


delta (Drosophila)-like 3 


1.00 


1.00 


456990 


NM 004504 


Hs 171545 


H1V-1 Rev binding protein 


16.42 


84.00 


457200 


U33749 


Hs 197764 


thyroid transcription factor 1 


0.57 


1.76 


457234 


AW968360 


Hs.14355 


Homo sapiens cDNA FLJ 13207 fis, clone NT 


2.71 


4.15 


457465 


AW301344 


Hs.122908 


DNA replication factor 


46.37 


47.00 


457489 


AI693815 


Hs.127179 


cryptic gene 


1.12 


1.35 


457646 


AA725650 


Hs.1 12948 


ESTs 


1.55 


2.51 


457733 


AW974812 


Hs.291971 


ESTs 


1.00 


55.00 


457819 


AA057484 


Hs.35406 


ESTs, Highly similar to unnamed protein 


4.36 


3.18 


458092 


BE545684 


Hs.343566 


KIAA0251 protein 


1.00 


1.32 


458098 


BE550224 




metailothionein 1E (functional) 


1.00 


22.00 


458207 


T28472 


Hs.7655 


U2 small nuclear ribonucleoprotetn auxil 


2.06 


1.88 


458242 


BE299588 


Hs.28465 


Homo sapiens cDNA: FLJ21869 fis, clone H 


1.00 


1.00 


458247 


R14439 


Hs.209194 


ESTs 


7.00 


9.85 


458679 


AW975460 


Hs.142913 


ESTs 


1.00 


3.00 


458778 


AW451034 


Hs.326525 


arylsulfatase D 


1.31 


2.01 


458933 


AI638429 


Hs.24763 


RAN binding protein 1 


1.98 


1.71 


459352 


AW810383 


Hs.206828 


ESTs 


12.60 


63.00 


459670 


F01020 


Hs.172004 


titin 


1.00 


1.00 


459702 


A1204995 




gb:an03c03.x1 Stratagene schizo brain S1 


1.00 


237.00 



TABLE 9B 

Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
407746 



CAT Number 
10125J 



408070 


1036688J 


408660 


107294J 


409522 


113735 1 


409866 


11 56522 J 


410032 


1170435J 


411089 


123172J 


411152 


1234028J 


412537 


1304.1 



Accession 

AK001962 R69415 BE464605 AA418699 AA053293 AA149075 AA058396 AW338226 AW272659 AA454607 AI139535 AW469852 AI275461 
AW271982 AA730033 AA576507 AA991217 AA782067 AI985851 AA805864 AA505598 AW469857 R69546 AA988279 AW001647 N63320 
D82661 T27343 AA306950 AA360989 R58778 
AW148852BE350895 

AA525775 AA056342 AI538978 AW975281 AA664986 

AA075382AA075431 

AW502152 H41202H29772 

BE065985 BE065944 BE066008 BE066083 BE066093 

AA456454AA713730AA091294AA584921 N86077 AW836781 AA601031 AA579876AA551106 AA633188 AW905577 A1955808 AI679386 
AI679895 AA514764 AA454562 AI082382 AA595822 AA551351 AA586369 AA666384 AA1 88934 AA666398 AA551297 AA565188 
BE069199 AW936012 AW877466 AW819782 AW935798 AW835546 AW936042 BE069121 AW835625 AW877536 AW935885 BE069202 
AW820019AW935937BE160180AW935946 BE069101 BE069125AW877527BE160316 8E160398 AW935794 AW835701 AW935784 
AL031778X59711 NM 002505 M59079 AI870439 AI494259 AW664010AA405063AA436132 BE174516AA412691 AI400314AA436024 
T29403 BE079412 BE079428 N90322 AI631202 AI141758 M016793 A1167566 AI862075 AI375230 AI208445 AW235763 AL044113 AA382556 
AW953918 AA927051 AA889823 BE003094 AW390155 AW360805 AW360823 AW360810 AA425472 AI694282 AL0441 14 AI684577 AI809865 
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412811 132943J 



413690 1383256J 
414883 15024.1 



417324 
418574 



419936 
421582 



422128 
423034 
423816 
424200 
424999 
426966 
426991 
427260 
428023 



429220 
429978 



430439 
430935 
431089 
431322 
432407 



156454J 
166714J 
17690.1 



418712 1784125.1 
419443 184788.1 
419502 18535.1 



189181.1 
2041.1 



21 1994 J 

2241 22 J 

23234J 

236595.1 

245835.1 

273896.1 

27415.1 

276598.1 

28589.2 



301384.1 
31 150 J 



31808 1 

325772.1 

327825.1 

331543.1 

34624.1 



434414 38585.1 



436608 42361.3 



PCT/US02/12476 

AI478773 AI160445 AI674630 N69088 AW665529 N49278 AI129239 AI457890 AI621264 AW297152 AI268215 AA907787 AI286170 AI017982 
AI963541 AI469807 A1969353 BE552356 N66509 AA736741 AA382555 AW07581 1 AW292026 

H06382 AW957730 AA352014 R13591 AA1 21 201 D60420 BE263253 BE047862 Z41 952 AI424991 AI693507 AI8631 08 AA599060 A1091 1 48 
M598689 R39887 M8 13462 A W0 16452 H06383 R41807 AI364268 AA620528 Af241 940 AW089 149 AW090733 AW088875 Z38240 
AA121202 R17734 
BE157489 BE157560 

AA926960 AA926959 W76521 W24270 W2 1526 AA037 172 BE267636 H83186 AA469909 N86396 AA001348 BE535736 AA081745BE566245 
AA082436 H72525 H77575 N49786 W80565 H78746 BE569085 W04339 R981 27 T55938 BE279271 AW960304 T29812 M476873 BE297387 
AA292753 M177048 NM_0Q1826 X54941 BE314366 M908783 AI719075 BE2701 72 BE269819 AA889955 AI204630 W25243 AI935150 
AA872039 W72395 T99630 AI422691 H98460 N3142B BE255916 H03265 AI857576 M776920 AA910644 M459522 AA293140 AW514667 
R75953 AW662396 AA662522 AI865147 AI4231 53 AW262230 AA584410 AA583187 AW024595 AW069734 A1828996 AA282997 AA876046 
AW613002 AA527373 AW972459 AI831360 AA621337 AA100926 AA77241 8 AA594628 AI033892 W95096 AI034317 AA398727 AI085031 
N9521 0 AI459432 AI041437 AA932124 AA627684 AA935829 AI004827 AI423513 AI094597 H42079 R54703 A1630359 AA617681 AA978045 
AA643280W44561 AI991988 AI537692 AI090262 AA740817 AI3121 04 AI911822 AA416871 AI185409 AA1 29784 AA701 623 AI075239 
AI139549 AA633648 AI339996 AI336880 AA399239 AI078708 AI085351 AI362835 AI34661 8 Al 146955 AI989380 AI348243 N92892 AA765850 
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438091 44964J 



439000 
439285 

439780 
441128 

443068 
443947 
447636 



449305 
451105 
451320' 



451807 
452410 



454241 
455175 
456237 
458098 



46771 6_1 
47065J 

47673.1 
51021.2 

558874J 
5861 60_1 
7301J 



448993 79225.1 
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86576J 



8865.1 
9163.1 



1067807.1 
1257335.1 
168730.1 
47395.1 
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AI813854 AI21451 8 A1635262 AI139455 AI707807 A1698085 AW884528 AI024768 AI004723 AW087420 AI565133 N94964 AI268939 
AW51 3280 AI061 1 26 AI43581 8 AI8591 06 AI360506 AI024767 AA51 301 9 AA757598 X561 96 M902959 AI334784 A1860794 AA01 0207 
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TABLE 9C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from whfch exons were predicted. 
Imposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400512 


9796593 


Minus 


1439-1615 


400517 


9796686 


Minus 


49996-50346 


400560 


9843598 


Plus 


94182-94323,97056-97243,101095-101236,102824-103005 


400664 


8118496 


Plus 


13558-13721,13942-14090,14554-14679 


400665 


8118496 


Plus 


16879-17023 


400666 


8118496 


Plus 


17982-18115,20297-20456 


400749 


7331445 


Minus 


9162-9293 


400763 


8131616 


Minus 


35537-35784 


401027 


7230983 


Minus 


70407-70554,71060-71160 


401093 


8516137 


Minus 


22335-23166 


401203 


9743387 


Minus 


172961-173056,173868-173928 


401212 


9858408 


Plus 


87839-88028 


401411 


7799787 


Minus 


144144-144329 


401435 


8217934 


Minus 


54508-55233 


401464 


6682291 


Minus 


170688-170834 


401714 


6715702 


Plus 


96484-96681 


401747 


9789672 


Minus 


118596-118816,119119-119244,119609-119761,120422-120990,130161-130381,130468-130593,131097-131258,131866- 








131932,132451-132575,133580-134011 


401760 


9929699 


Plus 


831 26-83250,85320-85540,9471 9-95287 


401780 


7249190 


Minus 


28397-28617,28920-29045,29135-29296,29411-29567,29705-29787,30224-30573 


401781 


7249190 


Minus 


83215-83435,83531-83656,83740-83901,84237-84393,84955-85037,86290-86814 


401785 


7249190 


Minus 


165776-165996,166189-166314,166408-166569,167112-167268,167387-167469,168634-168942 


401797 


6730720 


Plus 


6973-7118 


401961 


4581193 


Minus 


124054-124209 


401985 


2580474 


Plus 


61542-61750 


401994 


4153858 


Minus 


42904-43124,43211-43336,44607-44763,4519945281,46337-46732 


402075 


8117407 


Plus 


121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 


402260 


3399665 


Minus 


113765-113910,115653-115765,116808-116940 


402265 


3287673 


Plus 


21059-21168 


402297 


6598824 


Plus 


35279-35405,35573-35659 


402408 


9796239 


Minus 


110326-110491 
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409490 


Q7Q633Q 


Plus 


199750-199919 


409574 


R07710R 
OU/ / IUO 


Minus 


39990-39509 

09&yU~Ot70U£ 


4O9R09 
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Odof ioq 


iVtitlUS 


53949-53439 


409Q94 
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Mini it; 
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Mini ic 
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yuHou-yoojo 


4033R1 

HUuOO 1 


Q43R967 
3H0Q£0/ 


Mini ic 


96009.9617ft 


40^478 


QQCQORft 

yyoo^oo 


rlUo 
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XOOO*OuUl,0 1 5fO*OOO^ t «JUOO-H I 1/ 
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QCRQQ7Q 


Mint ic 


93RKR 94349 


A(\V?\ 5 
HUOf ID 




PIlIC 

riuo 


R519R.R59Q9 
OO 1 £.0-00 eve 
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QCCQC70 


(win us 


995757-995Q3Q 
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HuHU/D 
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HUH I U 1 
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QU( oy^io 
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ivnnub 
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HUH 1 HU 


9ft 4359ft 


riua 


37761 -^ft 147 

Ol / u l-OD IH/ 


404165 


9998489 


Minns 


69025-69128 


AW 85 

HUH lOO 


45795R4 

HOI £JOH 


Mimic 


129171-129327 


404210 


5006946 


Plus 


169926-170121 


404953 


93R7909 


Miniffe 

IVtlMUO 


55675-56055 


4049R7 


9396514 


Plus 


53134-53981 


404998 


9Q44963 


Mmnc 
IVIIHUo 


7 3591 -7379 3 


404347 


yaou < yo 


Plus 


744Q3 7489Q 


HUHHHU 


759R051 


nus 


80d?0-R1 5R1 

OUHOU-O J DO 1 


HUH/ 1£ l 


0R5R64R 

y OO DO HO 


Mint ic 


1 73763-1 749Q4 

1 / Of OO" J / Hi3H 


404794 
HUHl OH 


4R 96439 


Plus 


10161Q-101R9R 


4048 54 
HvHOOH 


/ iHOHtU 


Plus 


14960-14537 


404R77 


15109R4 

1 O 1 siOH 


Pine 


1095-9107 


404Q97 
HUHytJ 


7349009 
f OH£UU£ 


P/tic 
rlUS 


oooyu-oyooj 


404QQ5 


6007fiQ0 


Pius 


379QQ-3B145 38652-38998 39727-39872 40557-40674 42351-42450 


40544Q 


7599407 
/O^tHiJ/ 


Plus 


49936-49570 

He <CO O-HZO / V 


40555R 

HUDDuD 


5005005 




3591 9-36065 


405579 
HUOOl £ 


oouuoy 1 


Pius 


R5930-R5Q3R 


40554ft 


40 14350 

H«7 l HOOU 




741 QRQ 


405676 


4557087 


Plus 


73195-73917 


405770 


2735037 


Plus 


61057-62075 


405932 


7767812 


Minus 


123525-123713 


406137 


9166422 


Minus 


30487-31058 


406360 


9256107 


Minus 


7513-7673 


406399 


9256288 


Minus 


63448-63554 


406467 


9795551 


Plus 


182212-182958 



TABLE 10A: Potential Therapeutic, Diagnostic and Prognostic targets for Therapy of Lung Cancer and Non-malignant Lung Disease 

Table 2A shows about 307 genes up-regulated in non-malignant lung disease relative to lung tumors and normal body tissues and/or down-regulated in lung tumors relative to 
normal lung and non-malignant tung disease. These genes were selected from about 59680 probesets on the Eos/AnVmetrix Hu03 Genechip array. 

Table 108 show the accession numbers for those Pkey's lacking UnigeneiD's for table 10A For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Table 10C show the genomic positioning for those Pkey's lacking Unigene ID's and accession numbers in table 10A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: j Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Average of iung tumors {including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of norma) lung samples 

R2: Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


404394 






ENSP00000241075:TRRAP PROTEIN. 


0.79 


3,10 


404916 






Target Exon 


1.00 


159.00 


405257 






Target Exon 


1.00 


422.00 


407228 


M25079 


Hs.155376 


hemoglobin, beta 


0.47 


2.33 


407568 


AA740964 


Hs.62699 


ESTs 


1.00 


123.00 


408562 


A1436323 


Hs.31141 


Homo sapiens mRNA for KIAA1568 protein, 


1.00 


230.00 


409031 


AA376836 


Hs.76728 


ESTs 


1.00 


128.00 


410434 


AF051152 


Hs.63668 


toll-like receptor 2 


39.65 


149.00 


410467 


AF102546 


Hs.63931 


dachshund (Drosophtla) homotog 


1.00 


109.00 


410808 


T40326 


Hs.167793 


ESTs 


1.14 


13,14 


412351 


AL135960 


Hs73828 


T-cell acute lymphocytic leukemia 1 
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hypothetical protein FU22Q29 
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ESTs 
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ESTs 
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Hs.75819 


glycoprotein M6A 
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oxidised low density lipoprotein {lectin 
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ESTs 
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tyrosine kinase with immunoglobulin and 


0.67 


1.65 
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H00747 


Hs.29792 


ESTs, Weakly similar to I38022 hypotheti 


0.29 


2.64 
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U20350 


Hs.78913 


chemokine (C-X3-C) receptor 1 


1.00 


145.00 
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CD83 antigen (activated B lymphocytes, i 15.32 237.00 

complement component 4-binding protein, 0.64 4.00 

endothelin receptor type B 0.01 3.90 

nuclear receptor subfamily 4, group A, m 36.30 357.00 

chordin-Iike 1.00 179.00 

adenosine deaminase, RNA-specific, B1 (h 0.02 6.00 

protein tyrosine phosphatase, non-recept 1 .00 11 3.00 

ESTs, Weakly similar to S41044 chromosom 0.44 1.90 

acid phosphatase 5, tartrate resistant 0.96 2.04 

Kallmann syndrome 1 sequence 0.62 2.74 

TEK tyrosine kinase, endothelial (venous 0.03 6.90 

neurotrimin 1.48 5.13 

hypothetical protein FLJ21276 37.55 336.00 

Homo sapiens cDNA: FLJ231 23 fis, clone L 0.80 3.65 

ESTs 1.65 8.07 

ESTs 2.99 25.82 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f 0.46 1.95 

ESTs, Moderately similar to ALU5JHUMAN A 1.00 156.00 

ESTs 0.03 7.44 

syntaxin 1 1 0.14 3.62 

protein tyrosine phosphatase, receptor t 0.01 3. 16 

ESTs 0.75 141.75 

ESTs 1.00 167.00 

calcitonin receptor-like Q.43 3.01 

matrix metalloproteinase 19 0.37 19.45 

endothelial differentiation, sphingolipi 0.14 3.35 

transient receptor potential channel 6 1 .00 94.00 

fibroblast growth factor receptor 4 0.68 1 .42 

solute carrier family 14 (urea transport 0.03 3,74 

ESTs 1.00 141.00 

growth differentiation factor 10 1.00 117.00 

colony stimulating factor 3 (granulocyte 0.75 2.20 

hypothetical protein FLJ 1 1 259 0.76 2.25 

ESTs 0.01 3.62 

ESTs 1.00 138.00 

angiopoietin-like 1 1.00 132.00 

membrane-spanning 4-domains, subfamily A 1.00 1 57.00 

multimerin 1.00 157.00 

ESTs, Highly similar to AF1 75283 1 zinc 0.34 2.24 

ESTs 0.00 1.15 

ESTs, Weakly similar to BCHUIA S-100 pro 0.01 2.06 

lymphoid nuclear protein (LAF-4) mRNA 1 .00 267.00 

ESTs 1.00 105.00 

ESTs 1.05 3.15 

triggering receptor expressed on myeloid 0.83 1 .94 

gb:nv54h12.r1 NCI_CGAP_Ew1 Homo sapiens 1.00 218.00 

ESTs 1.00 133.00 

activator of CREM in testis 0.67 122.67 

ecotropic viral integration site 2B 1 .00 142.00 

ESTs 1.00 147,00 

ESTs 1.00 141.00 

Homo sapiens cDNA: FU23123 fis, clone L 0.71 3.66 

ESTs 23.32 370.00 

ESTs 0.77 8.50 

ESTs 3.43 16.36 

ESTs 1.00 167.00 

glutamate receptor, ionotropic, AMPA 1 1.00 151.00 

small inducible cytokine subfamily A (Cy 46.47 1 53.00 

c-fos induced growth factor (vascular en 0.01 3.08 

ESTs 0.60 141.00 

ESTs 0.18 2.39 

Rho guanine exchange factor (GEF) 15 0.1 0 2. 16 

Homo sapiens mRNA; cDNA DKFZp564B2062 (f 0.01 2.53 

ESTs 0.46 2.64 

ESTs 18.05 296.00 

ESTs 1.00 141.00 

periaxin 0.56 1.38 

ESTs 0.88 4.33 

hypothetical protein FLJ23191 0.52 2.08 

retinoic acid receptor, alpha 0.79 2.01 

Wnt inhibitory factor-1 0.35 2.03 

serum deprivation response (phosphatidyl 0.13 2.25 

ESTs 1.00 116.00 

ESTs 1.00 192.00 

ESTs 0.20 2.98 

Eos Control 0.40 2.40 

NM_016369*:Homo sapiens claudin 18 (CLDN 0.53 1.77 

zinc finger protein 145 (Kruppel-like, e 0.01 3.18 

poly(A)-binding protein, nuclear 1 0.56 1.74 

surfactant, pulmonary-associated protein 0.64 1.50 

surfactant, pulmonary-associated protein 0.33 1.16 

solute earner family 21 (prostaglandin 0.53 1.55 

carbonic anhydrase IV 0.20 1 .28 

solute carrier family 34 (sodium phospha 0.78 1.90 

N-acylsphingosine amidohydrolase (acid c 0.59 1.54 
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423738 AB002134 Hs.132195 

425211 M18667 Hs.1867 

425438 T62216 Hs.270840 

5 426828 NNL000020 Hs.172670 

427019 AA001732 Hs.173233 

428043 T92248 Hs.2240 

430280 AA361258 Hs.237868 

431433 X65018 Hs.253495 

10 431723 AW058350 Hs.16762 

432985 T92363 Hs.178703 

441835 AB036432 Hs.184 

442275 AW449467 Hs.54795 

443709 AI082692 Hs.1 34662 

15 444325 AW152618 Hs.16757 

450954 A1904740 Hs.25691 

451558 NM_001089 Hs.26630 

453310 X70697 Hs.553 

456855 AF035528 Hs.153863 

20 444342 NM_014398 Hs.10887 
400754 
401045 
401083 
402474 

25 402808 
403021 
403438 
403687 
403764 

30 404277 
404288 

404518 AI815601 

405106 

405381 
35 406387 

406646 M33600 

406714 AI219304 Hs.266959 

406753 M505665 Hs.217493 

406973 M34996 Hs. 198253 

40 407248 U82275 Hs.94498 

407510 U96191 

407731 NMJHJ0066 Hs.38069 

407830 NM_001086 Hs.587 

408045 AW138959 Hs.245123 

45 408074 R20723 

408374 AW025430 Hs.155591 

409064 AA062954 Hs.141883 

409083 AF050083 Hs.673 

409153 W03754 Hs.50813 

50 409203 AA780473 Hs.687 

409238 AL049990 Hs.51515 

409389 AB007979 Hs.301281 

409718 D86640 Hs.56045 

410798 BE178622 Hs.16291 

55 411020 NM.006770 Hs.67726 

411667 BE160198 

412000 AW576555 Hs.15780 

412358 BE047490 Hs.24172 

412420 AL035668 Hs.73853 

60 412564 X83703 Hs.31432 

412869 AA290712 Hs.82407 

412870 N22788 Hs.82407 
413529 U11874 Hs.846 
413533 BE146973 

65 413689 BE157286 Hs.20631 

413724 M131466 Hs.23767 

413800 AI129238 Hs.192235 

413802 AW964490 Hs.32241 

413829 NMJJ01872 Hs.75572 

70 414376 BE393856 Hs.66915 

414577 M0555A8 Hs.72116 

414700 H63202 Hs.38163 

415078 M311223 Hs.283091 

„ 415120 N64464 Hs.34950 

75 415323 BE269352 Hs.949 

415335 AA847758 Hs.111030 

415582 W92445 Hs.165195 

416030 H15261 Hs.21948 

41 6427 BE244050 Hs.79307 

80 416464 NMJ)00132 Hs.79345 

416585' X54162 Hs.79386 

416847 L43821 Hs.80261 

417148 AA359896 Hs.293885 

417370 T28651 Hs.82030 

85 417673 T87281 Hs.16355 



calcium channel, voltage-dependent, alph 0.59 1.55 

airway trypsin-like protease 10.14 51.00 

progastricsin (pepsinogen C) 0.35 1.62 

ESTs 0.23 9.45 

activin A receptor type ll-like 1 0.03 1 .71 

hypothetical protein FLJ 10970 0.01 1.49 

uteroglobin 0.42 1.26 

interleukin 7 receptor 0.46 2.43 

surfactant, pulmonary-associated protein 0.57 1.59 

Homo sapiens mRNA; cDNA DKFZp564B2062 (f 0.29 1.80 

ESTs 0.32 2.27 

advanced glycosylation end product-speci 0.31 1.51 

ESTs 0.55 178 

ESTs O.OO 3.02 

ESTs 0.32 2.49 

receptor (calcitonin) activity modifying 0.46 1.74 

ATP-binding cassette, sub-family A (ABC1 0.52 1 .87 

solute carrier family 6 (neurotransmitte 0.00 3.30 

MAD (mothers against decapentaplegic, Dr 0.01 2.31 

similar to lysosome-associated membrane 0.66 2.20 

Target Exon 1.00 297.00 

C1 1 001 883*:gil6753278|ref INP_033938.1 1 c 1 .00 1 09.00 

NMJ)16582*:Homo sapiens peptide transpor 0.89 1.39 

NM_004079:Homo sapiens cathepsin S (CTSS 1 .45 4.47 

ENSP00000235229:SEMB. 1 .00 1 .87 

C21 000030:gi|9955960Iret]NP_063957.1 1 AT 1.00 149.00 

NM_03141 9*:Homo sapiens molecule possess 1 .06 2.96 

NM_007037*:Homo sapiens a disintegrin-ii 0.04 4.89 

NM_005463:Homo sapiens heterogeneous nuc 1 .00 225.00 

NMJJ191 1 1*:Homo sapiens major histocompa 0.97 1.93 

NM_002944*;Homo sapiens v-ros avian UR2 1 .00 68.00 

CD83 antigen (activated B lymphocytes, i 0.02 1 .83 

C1 1001 637*:gi|5032241 |refjNP_005732. 1 1 2 1 .00 235.00 

Target Exon 1.00 93.00 

Target Exon 1.37 6.02 

major histocompatibility complex, class 0.86 2.46 

hemoglobin, gamma G 0.01 3.19 

annexinA2 1.00 147.00 

major histocompatibility comp]ex, class 1 .03 2.04 

leukocyte immunoglobulin-like receptor, 1.00 64.00 

gb:Human trophoblast hypoxia-regulated f 1 .00 90.00 

complement component 8, beta polypeptide 1 .00 67.00 

arylacetamide deacetylase (esterase) 1.00 102.00 

ESTs ' 1.00 70.00 

ESTs 1.00 112.00 

forkheadboxFI 0.07 10.17 

ESTs 0.39 2.31 

interleukin 1 2A (natural killer cell sti 1 .00 95.00 

hypothetical protein FLJ20022 0.01 4.55 

cytochrome P450, subfamily IVB, polypept 0.01 3.72 

Homo sapiens mRNA; cDNA DKFZp564G1 1 2 (fr 1 .00 79.00 

Homo sapiens mRNA, chromosome 1 specific 0.14 27.35 

src homology three (SH3) and cysteine ri 1 .00 11 3.00 

gb:PM3-HT0605-270200-001-a02 HT0605 Homo 0.64 2.47 

macrophage receptor with collagenous str 0.55 2.40 

gb:QV1-HT0413-010200-059-h03 HT0413 Homo 1.00 111.00 

ATP-binding cassette, sub-family A (ABC1 1 .00 95.00 

ESTs 1.00 87.00 

bone morphogenetic protein 2 1.43 8.07 

cardiac ankyrin repeat protein 0.02 3.07 

CXCchemokineligand16 0.93 1.72 

CXC chemokine ligand 16 0.97 1.51 

interleukin 8 receptor, beta 0.02 2.42 

gb:QV4-HT0222-011 199-01 9-e05 HT0222 Homo 0.65 1.50 

zinc finger protein, subfamily 1 A, 5 (Pe 20.87 232.00 

hypothetical protein FLJ12666 1 .00 80.00 

ESTs 1.00 85.00 

ESTs, Weakly similar to S65657 alpha-1 C- 1 .00 21 3.00 

carboxypeptidase B2 (plasma) 0.02 3.93 

ESTs, Weakly similar to 16.7Kd protein [ 1.00 115.00 

hypothetical protein FLJ20992 similar to 0.49 1.94 

ESTs 0.03 3.75 

found in inflammatory zone 3 0.86 1 .95 

ESTs 1.00 120.00 

neutrophil cytosolic factor 2 (65kD, chr 0.60 2.48 

ESTs 1.00 95.00 

Homo sapiens cDNA FLJ 14237 fis, clone NT 1 .00 136.00 

ESTs 0.02 8.07 

Rac/Cdc42 guanine exchange factor (GEF) 1.00 73.00 

coagulation factor VIII, procoagulant co 0.70 3.36 

leiomodin 1 (smooth muscle) 0.06 6.56 

enhancer of filamentation 1 (cas-like do 0.70 3.66 

hypothetical protein FLJ14902 1.00 114.00 

tryptophanyl-tRNA synthetase 0.85 1.30 

ESTs 0.15 15.54 
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cystatin E7M 
ESTs 

small nuclear ribonucleoprotein D1 polyp 
cytochrome b-245, beta polypeptide (chro 
arachidonate 5-lipoxygenase 
wingless-type MMTV integration site fami 
solute carrier family 1 (neuronal/epithe 
hypothetical protein 
interleukin 6 (interferon, beta 2} 
cadherin 5, type 2, VE-cadherin (vascula 
ESTs, Moderately similar to ZN91JHUMAN Z 
ESTs 

Homo sapiens cDNA FLJ 14265 fis, clone PL 
Homo sapiens, clone IMAGE:4054868, mRNA 
annexin A3 

ESTs, Moderately similar to S29539 ribos 
granulysin 

early growth response 2 {Krox-20 (Drosop 

ESTs, Moderately similar to ALU4JHUMAN A 

osteoglycin (osteoinductive factor, mime 

ESTs, Moderately similar to AF1 61 511 1 H 

transcription factor EC 

heparan sulfate (glucosamine) 3-O-sulfot 

GTP-binding protein 

hypothetical protein FLJ11004 

vasoactive intestinal peptide receptor 1 

prostaglandin D2 synthase, hematopoietic 

DKFZP586D0824 protein 

Sushi domain (SCR repeat) containing 

ESTs 

glycoprotein A33 (transmembrane) 
ESTs 

UNCI 3 (C. e)egans)-like 
Bruton agammaglobulinemia tyrosine kinas 
Homo sapiens mRNA; cDNA DKFZp586B0220 (f 
toll-like receptor 7 

colony stimulating factor 3 receptor (gr 
secretin receptor 

ESTs, Moderately similar to A53959 throm 

hypothetical protein FLJ21940 

ESTs 

ESTs 

ESTs 

KIAA0001 gene product; putative 6-protei 

gb:nc67f04.s1 NCLCGAP_Pr1 Homo sapiens 

adrenergic, beta-2- ( receptor, surface 

chromosome 1 open reading frame 21 

ESTs 

ESTs 

ESTs 

indolethylamine N-methyltransferase 
Rho GTPase activating protein 6 
ESTs 

arrestin, beta 1 

macrophage scavenger receptor 1 
CLST 11240 protein 
CDW52 antigen (CAMPATH-1 antigen) 
phosphatidylinositol glycan, class A (pa 
matrilin 3 

angiotensin receptor 2 

semaphorin sem2 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to transformation-r 
Homo sapiens beta-1 adrenergic receptor 
Homo sapiens cDNA: FLJ21326 fis, clone C 
ESTs 

hypothetical protein FLJ 12910 

SWi/SNF related, matrix associated, acti 

ESTs 

ESTs 

ESTs 

SEC24 (S. cerevisiae) related gene fami) 

ESTs 

ESTs 

Homo sapiens cDNA: FLJ22182 fis, clone H 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

hypothetical protein FLJ12666 
ferritin, light polypeptide 
ESTs 
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Pkey; Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
408074 

411667 
413533 

423387 



423696 

430212 
436532 
453531 
454741 



CAT Number 
103684J 

1253334J 
1375344J 

22779J 



23112J 

314437J 
421802.1 
97026J 
1232559 J 



Accession 

R20723 AA263003 AA333976 AA334725 AA334151 AW965490 AA310513 Ai810530 D31302 AW134897 AA830127 AA046953 AI668930 
C06094AW104534 

BE1601 98 AW935898 T1 1 520 AW935930 AW856073 AW861034 

BE146973 BE146972 BE147042 BE147018 BE146783 BE147020 BE146781 BE147019 BE146766 BE147021 BE146952 BE146767 BE147044 
BE146797 BE1 46776 BE146985 BE146793 BE146768 BE1 46771 BE146954 BE146760 BE147048 BE147025 BE147030 
AJ012074 U11087 L13288 X75299 L20295 AW630780 H14880 T28037 AI872991 R72136 AW449839 T81622 T79697 T29519 R94105 T83923 
R73300 A1797007 R73390 AA961010 H74168 A1689932 BE045543 AI808418 AI608912 AI806573 AW884084 AW872978 AW872985 AA565655 
AI022915 R50647 R73210 H45098 R46451 AW1 66269 T71 132 Al 264547 R521 46 Al 304920 R73391 AW884059 AW884085 H73241 T60038 
T79612 R73145 R50549 Al 094557 AI668793 R72302 Al 564366 W01 956 AA41 8962 W32571 R72840 H45409 R72085 R46356 R46758 
AA508805 AA41 8798 T83751 R94072 T1 61 82 AA928785 AA903896 

Z92546 AA330586 AI570568 AW341487 AI827050 AW298668 AI792189 AI015693 AI733599 AI572251 AI672488 AW193262 A1244716 

AI864375 AI2061 00 AA91 2444 AI269365 A1640254 AW772466 AI867336 AA627604 H1 6914 AA358477 AA338009 

AA4691 53 AI71 8503 AA469225 

AA721522AW975443T93070 

AA417940AA036735T07025 

BE154396 AW817959 BE154393 



TABLE 10C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 
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TABLE 11 A: Genes Distinguishing Adenocarcinoma from Other Lung Diseases and Norma! Lung 

Table 11A shows about 84 genes upregulated In lung adenocarcinomas relative to other lung tumors, non-malignant lung disease, and normal lung. These genes were selected 
from about 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. 

Table 1 1 B show the accession numbers for those Pkey's lacking UnigenelD's for table 1 1 A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Toots (DoubteTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Table 1 1 C show the genomic positioning for those Pkey's lacking Unigene ID's and accession numbers in table 1 1 A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 ; Average of lung tumors (including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 

R2; Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 
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TABLE 11B 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

410399 11995 1 BE068889 BE068882 AF044311 AF017256 NM 003087 AF037207 AF010126 AA633976 AA872836 BE298825 BE299889 AI016464 AI684600 

AI936527 AA804675 AA394097 AI139933 AA946606 BE171313 AA722407 AA293803 AI468480 AA056035 AA055968 AW796957 AI637713 
AA41 0737 H49348 AA486472 AA41 1 094 AA235594 AA402624 AA443638 AW452137 AA421 708 AW26521 1 AJ493266 AA3651 32 AW966044 

419502 18535J AU076704 T74854 T74860 T72098 T73265 T73873 T69180 T74658 T58786 T60385 T73410 T68781 T67845 T67593 T73952 T67864 T60630 

T68367 T68401 T53959 T72360 T72099 T60377 T58961 T7171 2 T72821 T64738 T74645 T72037 T68688 T72063 T73258 T72826 T64242 
T68220 T74673 T71800 T68355 T61227 T62738 T69317 T53850 T64692 T73768 T73962 T73382 T68914 T70975 T73400 T60631 T73277 
T73203 T70498 T61409 T58925 NM 000508 M64982 T68301 T73729 T69445 T60424 T67922 T67736 T68716 T67755 T74765 T73819 T58719 
T74756 T60477 T74863 T61 109 T68329 T58850 T71857 T73425 T53736 T68607 T58898 T64309 T72031 T72079 T64305 T71908 T681 07 
T71916 T73787 T56035 T64425 T71870 T60476 T61376 T67820 T71895 T41006 T69441 T68170 T74617 T71958 T69440 T61875 R06796 
H48353 T71914 T53939 T64121 AA693996 T72525 T67779 T68078 AA01 1465 AA345378 AV654847 AV654272 AV656001 A1064740 T82897 
N33594 AA344542 AW805054 AI207457 T61743 AA026737 H94389 AA382695 AA918409 T68044 S82092 T39959 AI017721 AA312395 
AA31291 9 T40156 H66239 AV652989 H38728 R98521 AV655200 R95790 W03250 W0091 3 AA3441 36 AV660126 R97923 AA343596 
AW470774 AV651256 N54417 AA812862AW182929AI111192 H61463 H72060 AA344503 H38639 AI277511 AV661108 AI207625 T47810 
AA235252 T27853 T47778 R95746 H70620 AA701463 AW827166 R98475 C20925 AV657287 T71959 T71313 T73920 T73333 T61618 T69293 
T69283 T73931 T721 78 T72456 AV645639 AV653476 T72957 T72300 T58906 T71457 T70494 T72956 T70495 T68267 T74407 T85778 
AA344726 T27854 T74485 T74101 T73868 T71518 T72304 AA343853 T73909 T68070 T72065 H72149 T73493 T73495 AV645993 R02293 
T70475 T64751 AA344441 AA343657 AA345732 AA344328 AH 10639 AA344603 AF063513 T64696 T68516 T72223 T60507 T67633 R29500 
T72517 R02292 T60599 T69206 T70452 T74677 R29366 T61 277 T74914 T60352 R29675 T74843 AV645792 AA344408 T691 97 T72057 
T69368 T69358 T68258 AV650429 T73341 T61702 T74598 T40095 K02272 T40106 AA343045 AA341908 AA341907 AA342807 AA341964 
T53747 T72042 T62764 AI064899 AA343060 T67832 T72440 T71770 T68091 T69108 T72449 T69167 T71289 T68251 AV654844 T64375 
AA345234 T67598 AA01 1 41 4 T68036 H48262 AI207557 T6821 9 W86031 T69081 T64232 R931 96 T621 36 AV650539 H67459 T72978 
AA344583T60362 H58121 T95711 T72803T68055T71715 R29036T72793 T69122T64595T62888T69139T68291 T64652T67971 T46862 
AA693592 AI248502 R29454 T64764 T57001 T73052 T71429 T51176 T58866 AV655414 H90426 AA342489 T73666 T67848 T72512 T53835 
T67837 T73317 T74273 T69420 T68245 T74380 T67862 T74474 T56068 

421582 2041 1 AI910275 X00474 X52003 X05030 NM_003225 AA314326 AA308400 AA506787 AA314825 A1571948 AA507595 AA614579 AA587613 R83818 

AA568312 AA614409 AA307578 AI925552 AW950155 AI910083 M12075 BE074052 AW004668 AA578674 AA582084 BE074053 BE074126 
BE074140 AA514776 AA588034 BE074051 BE074068 AW009769 AW050690 AA858276 R55389 AI001 051 AW050700 AW750216 AA614539 
BE074045 A1307407 AW602303 BE073575 A1202532 AA524242 AI970839 AI909751 BE076078 AI909749 R55292 

437866 44433 2 AA156781 AW293839 U 52054 AA024963 AA778446 BE073977 AW444904 AW602574 BE164040 BE164012 BE163972 BE163974 BE163992 

AA837481 AW468444 BE185091 AW468002 AA687333 AA81 1 830 AA58 1806 AI866686 AI572124 AA043777 AA040926 D20160AI536733 
AA812489 AW874142 AI471883 W84421 AA156850 

451807 8865J W52854AL1 17600 BE208116 BE208432 BE206239 BE082291 AW953423 AA351619 BE180648 B E1 40560 W60080 AA865478 N90291 

AW450652 AW44951 9 AA993634 AI806539 AA351618 AW449522 A1827626 AA904788 AA380381 AA886045 AA774409 BE003229 241756 



TABLE 11C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et a!., Nature (1999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

403329 8516120 Plus 96450-96598 

406399 9256288 Minus 63448-63554 
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TABLE 12A: Genes Distinguishing Squamous Cell Carcinoma from Other Lung Diseases and Normal Lung 

Table 12A shows about 72 genes upregulated in squamous cell carcinomas of the lung relative to other lung tumors, non-malignant lung disease, and normal lung. These genes 
were selected from about 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. 

Table 12B show the accession numbers for those Pkey's lacking UnigenelD's for table 12A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Table 12C show the genomic positioning for those Pkey's lacking Unigene ID's and accession numbers in table 12A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Average of lung tumors (including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 

R2: Average of non-malignant lung disease samples {including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 
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Hs.293317 


ESTs, Weakly similar to GGC1JHUMAN G ANT 
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1.00 


428645 


AA431400 


Hs.98729 


ESTs, Weakly similar to 2017205A dihydro 


1.00 


16.00 


428748 


AW593206 


Hs.98785 


Ksp37 protein 


1.00 


87.00 


429259 


AA420450 


Hs.292911 


ESTs, Highly similar to S60712 band-6-pr 


2,01 


1.18 


429538 


BE182592 


Hs.1 1261 


small proline-rich protein 2A 


4.43 


2.90 


429903 


AL134197 


Hs.93597 


cyclin-dependent kinase 5, regulatory su 


11.80 


1.00 


430486 


BE062109 


Hs.241551 


chloride channel, calcium activated, fam 


12.28 


41.00 


430890 


X54232 


Hs.2699 


glypican 1 


1.58 


1.40 


431009 


BE149762 


Hs.48956 


gap junction protein, beta 6 (connexin 3 


60.25 


28.00 


431846 


BE019924 


Hs.271580 


uroplakin 1B 


4.49 


2.51 


433091 


Y12642 


Hs.3185 


lymphocyte antigen 6 complex, locus D 


1.20 


1.09 


434360 


AW015415 


Hs.1 27780 


ESTs 


40.98 


27.00 


434880 


U02388 


Hs.101 


cytochrome P450, subfamily IVF, polypept 


1.00 


1.00 


435505 


AF200492 


Hs.211238 


interleukin-1 homolog 1 


1.00 


38.00 


435793 


AB037734 


Hs.4993 


KIAA1313 protein 


23.68 


42.00 


436511 


AA721252 


Hs.291502 


ESTs 


16.76 


14.00 


438403 


AA806607 


Hs.292206 


ESTs 


1.00 


1.00 


439285 


AL133916 




hypothetical protein FLJ20093 


46.23 


139.00 


439606 


W79123 


Hs.58561 


G protein-coupled receptor 87 


33.61 


1.00 


439670 


AF088076 


Hs.59507 


ESTs, Weakly similar to AC004858 3 U1 sm 


1.00 


1.00 


439706 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAP1JHUMAN DEATH 


86.55 


11.00 


440325 


NM 003812 


Hs.7164 


a disintegrin and metalloproteinase doma 


62.88 


147.00 


441525 


AW241867 


Hs.1 27728 


ESTs 


1.53 


1.42 


443162 


T49951 


Hs.9029 


DKFZP434G032 protein 


31.11 


38.00 


444378 


R41339 


Hs.12569 


ESTs 


1.00 


1.00 
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446292 


AF081497 


Hs.279682 


Rh type C glycoprotein 


447078 


AW885727 


Hs.9914 


ESTs 


447342 


AM 99268 


Hs.19322 


Homo sapiens, Similar to RIKEN cDNA 2010 


449003 


X76342 


Hs.389 


alcohol dehydrogenase 7 (class IV), mu o 


449101 


AA205847 


Hs.23016 


G protein-coupled receptor 


450832 


AW970602 


Hs.1 05421 


ESTs 


452240 


A1591147 


Hs.61232 


ESTs 


453317 


NM 002277 


Hs.41696 


keratin, hair, acidic, 1 


453830 


AA534296 


Hs.20953 


ESTs 


454098 


W27953 


Hs.292911 


ESTs, Highly similar to S60712 band-6-pr 


455601 
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Hs.816 


SRY (sex determining region Y)-box 2 
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1.26 
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27.00 


25.17 


36.00 


13.42 


1.00 
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TABLE 12B 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

439285 47065 1 AL1 33916 N791 13 AF086101 N76721 AW950828 AA364013 AW955684 AI346341 AI867454 N54784 AI655270 AI421279 AW014882 

AA775552 N62351 N59253 AA626243 AI341 407 BE 175639 AA456968 AI35891 8 AA457077 



TABLE 12C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (G!) numbers. "Dunham I. 

sequence of human chromosome 22." Dunham I. et a!., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
Imposition: indicates nucleotide positions of predicted exons. 



et al." refers to the publication entitled 'The DNA 



Pkey 


Ref 


Strand 


400666 


8118496 


Plus 


401780 


7249190 


Minus 


401781 


7249190 


Minus 


401785 


7249190 


Minus 


401994 


4153858 


Minus 


402075 


8117407 


Plus 


404996 


6007890 


Plus 



NLpositlon 

17982-18115,20297-20456 

28397-28617,28920-29045,29135-29296,29411-29567,29705-29787,30224-30573 

83215-83435,83531-83656,83740-83901,84237-84393,84955-85037,86290-86814 

165776-165996,166189-166314,166408-166569,167112-167268,167387-167469,168634-168942 

42904-43124,43211-43336,44607-44763,45199-45281,46337-46732 

121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 

37999-38145,38652-38998,39727-39872,40557-40674,42351-42450 
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Table 13A shows about 23 genes upregulated in non-malignant lung disease relative to lung tumors and normal lung. These genes were selected from about 59680 probesets on 
the Eos/Affymetrix Hu03 Genechip array. 

Table 13B show the accession numbers for those Pkey's lacking UnigenelD's for table 13A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Table 13Cshow the genomic positioning for those Pkey's lacking Unigene ID's and accession numbers in table 13A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Average of lung tumors {including squamous cell carcinomas, adenocarcinomas, small cell carcinomas, granulomatous and carcinoid tumors) divided by the 
average of normal lung samples 

R2: Average of non-malignant lung disease samples (including bronchitis, emphysema, fibrosis, atelectasis, asthma) divided by the average of normal lung samples 



Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


408562 


AI436323 


Hs.31141 


Homo sapiens mRNA for KIAA1568 protein, 


1.00 


230.00 


409031 


AA376836 


Hs.76728 


ESTs 


1.00 


128.00 


412372 


R65998 


Hs.285243 


hypothetical protein FLJ22029 


1.00 


173.00 


415910 


U20350 


Hs.78913 


chemokine (C-X3-C) receptor 1 


1.00 


145.00 


417511 


AL049176 


Hs.82223 


chordin-like 


1.00 


179.00 


418819 


AA228776 


Hs.191721 


ESTs 


100 


140.00 


422060 


R20893 


Hs.325823 


ESTs, Moderately similar to ALU5_HUMAN A 


1.00 


156.00 


424585 


AA464840 


Hs.131987 


ESTs 


1.00 


167.00 


426753 


T89832 


Hs. 170278 


ESTs 


1.00 


141.00 


429496 


AA453800 


Hs.1 92793 


ESTs 


1.00 


138.00 


430719 


AA488988 


Hs.293796 


ESTs 


1.00 


133.00 


431089 


BE041395 




ESTs, Weakly similar to unknown protein 


23.32 


941.00 


431385 


BE1 78536 


Hs.11090 


membrane-spanning 4-domains, subfamily A 


1.00 


157.00 


431728 


NM__007351 


Hs.268107 


multimerin 


1.00 


157.00 


436532 


AA721522 




gb:nv54h12.r1 NCLCGAP_Ew1 Homo sapiens 


1.00 


218.00 


437960 


AI669586 


Hs.222194 


ESTs 


1.00 


147.00 


438202 


AW1 69287 


Hs.22588 


ESTs 


1.00 


141.00 


441499 


AW298235 


Hs.101689 


ESTs 


1.00 


167.00 


444513 


AL120214 


Hs.7117 


glutamate receptor, ionotropic, AMPA 1 


1.00 


151.00 


448253 


H25899 


Hs.201591 


ESTs 


1.00 


141.00 


453636 


R67837 


Hs.169872 


ESTs 


1.00 


116.00 


458332 


AI000341 


Hs.220491 


ESTs 


1.00 


192.00 


459587 


AA031956 




gb:zk1 5e04.s1 Soares_pregnant_uterus_NbH 


1.00 


154.00 



TABLE 13B 

Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

431089 327825 1 B E04 1 395 AA49 1826 AA62 1946 AA7 15980 AA6661 02 
436532 421 802 J AA721 522 AW975443 T93070 



TABLE 13C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank \dentifier (Gl) numbers. "Dunham I. el al" refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

402075 8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 



180 



WO 02/086443 PCT/US02/12476 

TABLE 14A: Preferred Utility and Subcellular Localization for Potential Lung Disease Targets 

Table HA shows the subcellular localization and preferred utility for the genes appearing in Tables 9A and 10A. mAb symbolizes monoclonal antibody, diag symbolizes 
diagnostic, s.m. symbolizes small mofecule, and CTL symbolizes cytotoxic lymphocytic ligand. These genes were selected from 59680 probesets on the Eos/Afrymetrix Hu03 
Genechip array. 

Table 14B show the accession numbers for those Pkey's lacking UnigenelD's for table 14A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESJs and mRNAs. These sequences were cluslered based on sequence 
similarity using Clustering and Alignment Tools (DoubieTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Table 14C show the genomic positioning for those Pkey's lacking Unigene ID's and accession numbers in table 14A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Pkey: Unique Eos probeset identifier number 
ExAccn: Exemplar Accession number, Genbank accession number 
UnigenelD: Unigene number 
Unigene Title: Unigene gene title 
Pref. Utility; Preferred Utility 



Pred.Loc: 


Predicted subcellular localization 
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ExAccn 


UnigenelD 


Unigene Title 


Pref Utility 


Pred. Loc 


400289 


X07820 


Hs.2258 


matrix metaWoprotemase 10 (stromelysin 


mAb & diag & s.m. 


extracellular 


400303 


AA242758 


Hs.79136 


L1V-1 protein, estrogen regulated 


mAb 


plasma membrane 


402075 






ENSP00000251056*:Plasma membrane calcium 


mAb & diag 


secreted 


407811 


AW1 90902 


Hs.40098 


cysteine knot superfamily 1, BMP antagon 


diag 


secreted 


408243 


Y00787 


Hs.624 


interleukin 8 


diag 


secreted 


408790 


AW580227 


Hs.47860 


neurotrophic tyrosine kinase, receptor, 


mAb & s.m. 


plasma membrane 


408908 


BE296227 


Hs.250822 


serine/threonine kinase 15 


s.m. 


cytoplasm 


409041 


AB033025 


Hs.50081 


Hypothetical protein, XP_051860 (KIAA119 


CTL & diag 


secreted 


409103 


AF251237 


Hs. 11 2208 


XAGE-1 protein 


CTL 


nuclear 


409420 


Z15008 


Hs.54451 


laminin, gamma 2 {nicein (100kD), kalini 


diag 


secreted 


409632 


W74001 


Hs.55279 


serine (or cysteine) proteinase inhiblto 


diag 


secreted 


409757 


NMJ)01898 


Hs.123114 


cystatin SN 


diag 


extracellular 


409893 


AW247090 


Hs.57101 


minichromosome maintenance deficient (S. 


CTL 


nuclear 


409956 


AW1 03364 


Hs.727 


inhibin, beta A (activin A, activin AB a 


diag 


extracellular 


410001 


AB041036 


Hs.57771 


kallikrein 1 1 


diag 


extracellular 


410407 


X66839 


Hs.63287 


carbonic anhydrase IX 


mAb & s.m. 


plasma membrane 


410418 


D31382 


Hs.63325 


transmembrane protease, serine 4 


mAb & diag & s.m. 


plasma membrane 


412140 


AA219691 


Hs.73625 


RAB6 interacting, kinesin-like (rabkines 


s.m. 




412719 


AW016610 


Hs.816 


ESTs 


s.m. 


nuclear 


414774 


X02419 


Hs.77274 


plasminogen activator, urokinase 


diag 


extracellular 


414883 


AA926960 




CDC28 protein kinase 1 


s.m. 




415138 


C18356 


Hs.295944 


tissue factor pathway inhibitor 2 


CTL & diag 


extracellular 


415669 


NM_005025 


Hs.78589 


serine (or cysteine) proteinase inhibito 


mAb & diag & s.m. 


secreted 


415817 


U88967 


Hs.78867 


protein tyrosine phosphatase, receptor-! 


mAb & s.m. 


plasma membrane 


416658 


U03272 


Hs.79432 


fibrillin 2 (congenital contractural ara 


diag 


extracellular 


417034 


NM_006183 


Hs.80962 


neurotensin 


diag 


extracellular 


417079 


U65590 


Hs.81134 


interleukin 1 receptor antagonist 


diag 


extracellular 


417308 


H60720 


Hs.81892 


KIAA0101 gene product 


s.m. 


mitochondrial 


417389 


BE260964 


Hs.82045 


midkine (neurite growth-promoting factor 


mAb & diag 


secreted 


417433 


BE270266 


Hs.82128 


5T4 oncofetal trophoblast glycoprotein 


mAb 


plasma membrane 


417933 


X02308 


Hs.82962 


thymidylate synthetase 


s.m. 


endoplasmic reticulum 


418478 


U38945 


Hs.1174 


cyclin-dependent kinase inhibitor 2A (me 


s.m. 


cytoplasm 


418506 


AA084248 


Hs.85339 


G protein-coupled receptor 39 


mAb & s.m. 


plasma membrane 


418678 


NM_001327 


Hs.1 67379 


cancer/testis antigen (NY-ESO-1) 


CTL 


cytoplasmic 


419121 


AAd/4o/<£ 


HS. 030^0 


parathyroid hormone-like hormone 


diag 


secreted 


419171 


NM_002846 


Hs.89655 


protein tyrosine phosphatase, receptor t 


mAb & s.m. 


plasma membrane 


419183 


U60669 


Hs.89663 


cytochrome P450, subfamily XXIV (vitamin 


CTL & s.m. 


mitochondrial 


419216 


AU076718 


Hs.1 64021 


small inducible cytokine subfamily B (Cy 


diag 


secreted 


419235 


AW470411 


Hs.288433 


neurotrimin 


mAb & diag 


plasma membrane 


419452 


U33635 


Hs.90572 


PTK7 protein tyrosine kinase 7 


mAb & s.m. 


plasma membrane 


419556 


U29615 


Hs.91093 


chitinase 1 (chitotriostdase) 


mAb & diag 


extracellular* 


420610 


AI683183 


Hs.99348 


distal-less homeo box 5 


CTL 


nuclear 


421110 


AJ250717 


Hs.1 355 


cathepsin E 


sm & diag 


extracellular 


421379 


Y15221 


Hs.103982 


small inducible cytokine subfamily B (Cy 


diag 


secreted 


421474 


U76362 


Hs.1 04637 


solute carrier family 1 (glutamate trans 


mAb & s.m. 


plasma membrane 


421552 


AF026692 


Hs.105700 


secreted frizzled-related protein 4 


diag 


secreted 


421753 


BE314828 


Hs.107911 


ATP-binding cassette, sub-family B (MDR/ 


mAb & s.m. 


plasma membrane 


421817 


AF146074 


Hs.108660 


ATP-binding cassette, sub-family C (CFTR 


mAb & s.m. 


plasma membrane 


422109 


S73265 


Hs.1473 


gastrin-releasing peptide 


diag 


secreted 


422158 


L10343 


Hs.1 12341 


protease inhibitor 3, skin-derived (SKAL 


diag 


secreted 


422282 


AF019225 


Hs.1 14309 


apolipoprotein L 


diag 


secreted 


422283 


AW411307 


Hs.114311 


CDC45 (cell division cycle 45, S.cerevis 


s.m. 


nuclear 


422424 


Al 186431 


Hs.296638 


prostate differentiation factor 


diag 


extracellular 


422765 


AW409701 


Hs.1578 


bacutovirat IAP repeat-containing 5 (sur 


s.m. 


cytoplasm 


422809 


AK001379 


Hs.121028 


hypothetical protein FLJ 10549 


s.m. 


nuclear 


422867 


L32137 


Hs.1 584 


cartilage oltgomeric matrix protein (pse 


diag 


extracellular 


422956 


BE545072 


Hs.1 22579 


ECT2 protein (Epithelial cell transform! 


CTL & s.m. 




423634 


AW959908 


Hs.1690 


heparin-binding growth factor binding pr 


diag 




423673 


BE003054 


Hs.1695 


matrix metalloproteinase 12 (macrophage 


mAb & diag & s.m. 


secreted 


423961 


D13666 


Hs.136348 


periostin (OSF-2os) 


mAb & diag 


extracellular 


424046 


AF027866 


Hs.1 38202 


serine (or cysteine) proteinase inhibito 


diag 


secreted 


424381 


AA285249 


Hs.146329 


protein kinase Chk2 


s.m. 


nuclear 
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WO 02/086443 



424502 


AF242388 


Hs. 149585 


lengsin 


s.m. 


cytoplasmic 


424503 


mm 00220*5 


Hs 149609 


infpnrin alnha 5 ffihronpntin rpnpntnr 


mAb & s.m. 


plasma membrane 


424687 


105070 


Hs 151738 

1 lo. I J I | \JU 


matrix metalloproteinase 9 (gelatinase B 


diag 


extracellular 


425247 


MM 005940 


H<; 155*324 
no. i ijoott 


matriv mpfallnnrntpina^p 1 1 fctrnmplv^in 

1 1 loll IA 1 1 IwtOlll/UI ULCII IQtfO 1 1 \wU \Jl 1 1CI jrOll J 


mAb & diag & s.m. 


secreted 


425322 


1 IR^fi^O 


Hs 155R17 


p rots in kinase, DNA-activated, catalytic 


s.m. 


cytoplasmic 


425650 


NM 001944 


Hs.1925 


desmoQlein 3 (pemphigus vuigaris antigen 


mAb 


plasma membrane 


425734 


AF056209 


Hs. 159396 


peptidylglycine alpha-amidating monooxyg 


s.m. 


plasma membrane 


425776 


U25128 


Hs. 159499 


parathyroid hormone receptor 2 


mAb & diag 


425852 


AK001504 


Hs.159651 


death receptor 6, TNF superfamily member 


mAb & s.m. 


plasma membrane 


426215 


AW963419 


Hs. 155223 


stanniocalcin 2 


mAb & diag 


secreted 


426427 


M86699 


Hs. 169840 


TTK protein kinase 


CTL & s.m. 


nuclear 


426514 


BE616633 


Hs.170195 


bone morphogenetic protein 7 (osteogenic 


mAb & diag 


secreted 


427335 


AA448542 


Hs.251677 


G antigen 7B 


CTL 


cytoplasmic 


427747 


AW411425 


Hs.1 80655 


serine/threonine kinase 12 


s.m. 


cytoplasmic 


428242 


H55709 


Hs 2250 


leukemia inhibitory factor (cholinergic 


diag 


extracellular 


428330 


L22524 


Hs.2256 


matrix metalloproteinase 7 (matrilysin, 


mAb & diag & s.m. 


428450 


NM 014791 


Hs. 184339 


KIAA0175 gene product 


s.m. 


nuclear 


428479 


Y00272 


Hs 334562 


cell division cycle 2, G1 to S and G2 to 


s.m. 


nuclear 


428484 


AF1 04032 


Hs 184601 


solute carrier family 7 (cationic amino 


mAb & s.m. 


plasma membrane 


428664 


AK001666 

nlww IVwy 


Hs 189095 

1 Mi 1 UwW\?W 


similar to SALL1 (sal {Drosophila)-like 


CTL & s.m. 


nuclear 


428698 


AA852773 


Hs 334838 


KIAA1866 protein 
Ksp37 protein 


mAb 




428748 


AW593206 


Hs.98785 


diag 


extracellular 


428758 


AA433988 


Hs.98502 


CA1 25 antigen; mucin 16 


diag 


mitochodria* 


428969 


AF1 20274 


Hs. 194689 


artemin 


diag 


extracellular 


42921 1 


AF052693 


Hs. 198249 


gap junction protein, beta 5 (connexin 3 


mAb & s.m. 


plasma membrane 


429263 


AA019004 


Hs.1 98396 


ATP-binding cassette, sub-family A (ABC1 


mAb & s.m. 


plasma membrane 


429547 


AW009166 


Hs.99376 


ESTs 


diag 


secreted 


429610 


AB024937 


Hs 211092 


LUNX protein; PLUNC (palate lung and nas 


mAb & diag 


secreted 


429903 


AL134197 


Hs 93597 


cyclin-dependent kinase 5, regulatory su 


s.m. 


plasma membrane 


430486 


BE062109 


Hs.241551 


chloride channel, calcium activated, fam 


mAb & s.m. 


431462 


AW583672 


Hs.256311 


granin-like neuroendocrine peptide precu 


diag 


extracellular 


431515 


NM 012152 


Hs 258583 


endothelial differentiation, lysophospha 


mAb & s.m. 


plasma membrane 


431846 


BE019924 


Hs 271580 


u ro pi akin 1 B 


mAb & diag 


plasma membrane 


4*31958 


X63629 


Hs.2877 


cadherin 3, type 1, P-cadherin (placenta 


mAb & diag 


plasma membrane 


432201 


AI53R611 


Hs 298241 


Transmpmhranp nrntpa^fl sprins 3 

1 1 alio 1 1 LCI 1 IUJ CJI 1C piULCagC) OCtJMC V/ 


mAb & diag & s.m. 


plasma membrane 




nr C I / O 1 0 


Hs 27QQ05 


rlone HO0310 PRO0310n1 


s.m. 


nuclear 


4^5505 


AF200492 


Hs 211238 


interleukin-1 homolog 1 


diag 


secreted 


436481' 


AA^7Q5Q7 


Hs 51 QQ 


HSPC150 protein similar to ubiquitin-con 


s.m. 


cytoplasm 


437016 


AU076916 


Hs 5398 


guanine monphosphate synthetase 


s.m. 


437044 


AL 035864 


Hs 69517 


differentially expressed in Fanconi's an 


CTL 


ER 


437789 


AI581344 


Hs.127812 


ESTs, Weakly similar to T17330 hypotheti 


CTL 


nuclear 


437852 


BE001836 


Hs.256897 


ESTs, Weakly similar to dJ365012.1 [H.sa 


mAb & s.m. 


plasma membrane 


439223 


AW238299 


Hs.250618 


UL16 binding protein 2 

ESTs, Moderately similar to GFR3_HUMAN G 


mAb 


plasma membrane 


439477 


W69813 


Hs.58042 


mAb & s.m. 




439606 


W79123 


Hs.58561 


G protein-coupled receptor 87 


mAb & s.m. 


plasma membrane 




RF24R502 


Hs 9598 


sema domain, immunoglobulin domain {lg)» 


mAb & s.m. 


plasma membrane 


440006 


AK000517 


Hs.6844 


NALP2 protein; PYRIN-Containing APAF1-Ii 


s.m. 


nuclear 


441362 


BE614410 


Hs 23044 


RAD51 {S. cerevisiae) homolog (E coli Re 


s.m. 




442117 


AW664964 


Hs 1 28899 


ESTs; hypothetical protein for IMAGE:447 


mAb & s.m. 


plasma membrane 


443247 


BE6 14387 


Hs 333893 

1 IOiWWWUWW 


c-Myc target JP01 


CTL 


extracellular* 


443426 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


CTL 




443859 


NM_0 13409 


Hs.9914 


foliistatin 


diag 


extracellular 


444006 


BE395085 


Hs 10086 


type I transmembrane protein Fn14 


mAb 


plasma membrane 


444371 


RF540274 


Hs.239 


fnrkhpaH hnx M1 

IUI IMIOaU UUA JVJ 1 


s.m. 


nuclear 


444381 


BE387335 

ULuU f WWW 


Hs 283713 


ESTs, Weakly similar to S64054 hypotheti 


diag 


secreted 


444781 


NM 014400 


Hs 11950 

J IO. | i wwU 


GPI-anchored metastasis-associated prote 


mAb & diag 


plasma membrane 


445537 


A I245R71 


Hs 12R44 


PHF-likp-Hnmnin mnltinlp 6 

L-V3I ImG^JVI 1 ION lj lllUIUpIC \J 


mAb & diag 


secreted 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 (osteopontin, 


diag 


secreted 


446921 


AB012113 


Hs 16530 


small inducible cytokine subfamily A (Cy 


diag 


extracellular 


447033 


AI357412 


Hs.1 57601 


ESTs 


CTL & diag 


secreted 


447342 


AI199268 


Hs.1 9322 


Homo sapiens, Similar to RIKEN cDNA 2010 


CTL 


plasma membrane 


448243 


AW369771 


Hs.52620 


integrin, beta 8 


mAb & s.m. - 


448844 


AI581519 


Hs.1 771 64 


ESTs 


mAb & s.m. 




449048 


Z45051 


Hs 22920 


similar to S68401 (cattle) glucose indue 


mAb 


plasma membrane 


449722 


BE280074 


Hs 23960 


cyclin B1 


s.m. 


cytoplasm 


450001 


MM 001044 


Hs.406 


solute carrier family 6 (neurotransmitte 


mAb & s.m. 


plasma membrane 


450375 


AA009647 




a disintegrin and metalloproteinase doma 


mAb & diag & s.m. 


plasma membrane 


450701 


H39960 


Hs 288467 


hypothetical protein XP_098151 (leucine- 


mAb & diag 


plasma membrane 


450983 


AA305384 


Hs. 25740 


ER01 (S. cerevisiae)-like 


diag 


secreted 


451668 


Z43948 


Hs!326444 


cartilage acidic protein 1 


mAb & diag 


plasma membrane 


452281 


T93500 


Hs.28792 


Homo sapiens cDNA FLJ1 1041 fis, clone PL 


diag 




452401 


NMJX)7115 


Hs.29352 


tumor necrosis factor, alpha-induced pro 


diag 


extracellular 


452747 


BE153855 


Hs.61460 


Ig superfamily receptor LNIR 


mAb 


plasma membrane 


452838 


U65011 


Hs.30743 


preferentially expressed antigen in mela 


CTL 


nuclear 


453968 


AA847843 


Hs.62711 


High mobility group (nonhistone chromoso 


CTL & s.m. 


nuclear 


457489 


AI693815 


Hs.127179 


cryptic gene 


diag 


secreted 



TABLE 14B 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 
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WO 02/086443 

414883 15024J 



450375 83327J 



PCT/US02/12476 

AA926960 AA926959 W76521 W24270 W21526AA037172 BE267636 H83186 AA469909 N86396 M001348 BE535736AA081745 BE566245 
AA082436 H72525 H77575 N49786 W80565 H78746 BE569085 W04339 R981 27 T55938 BE279271 AW960304 T29812 AA476873 BE297387 
AA292753 AA177048 NM_001826 X54941 BE314366 AA908783 AI719075 BE270172 BE269819 AA889955 AI204630 W25243 AI935150 
AA872039 W72395 T99630 AI422691 H98460 N31423 BE255916 H03265 AI857576 M776920 AA910644 AA459522 AA293140 AW514667 
R75953 AW662396 AA662522 AI865147 A1423153 AW262230 AA584410 AA583187 AW024595 AW069734 AI828996 M282997 AA876046 
AW61 3002 AA527373 AW972459 AI831360 AA621 337 AA1 00926 AA77241 8 AA594628 AI033892 W95096 AI034317 AA398727 AI085031 
N95210 AI459432 AI041437 AA932124 AA627684 AA935829 AI004827 AI423513 AI094597 H42079 R54703 AI630359 AA617681 M978045 
AA643280W44561 AI991 988 Al 537692 AI090262 M74081 7 AI31 2104 AI91 1822 M41 6871 Al 1 85409 AA1 29784 AA70 1623 Al 075239 
AI139549 AA633648 AI339996 A1336880 AA399239 AI078708 AI085351 AI362835 AI34661 8 AI146955 AI989380 AI348243 N92892 AA765850 
AI494230A1278887AA962596AI492600W80435AA001979 R97424A1 12901 5 N241 27 AA1 57451 AA235549 AA459292 AA0371 14 AA1 29785 
A1494211 AW059601 AW886710 R92790 N59755 A1361128 AW5B9407 H47725 H97534 H48076 H48450T99631 AW300758 H03431 R76789 
AA954344 H77576 R96823 AI457100 N92845 N49682 H42038 BE220698 BE220715 H99552 M701624 N74173 R54704 H79520 H72923 
H03266 BE261919 AA769633 AA480310 AA507454 AA9 10586 AI203723 AW1 04725 W25611 W25071 T88980 H03513 T77589 R99156 
W95095 R97470AA702275T77551 AA911952H82956 N83673 AA283672 

AA009647 AA1 31254 AA374293 AW954405 H0441 0 AW606284 AA1 51 1 66 BE1 57467 BE1 57601 H04384 W46291 AW663674 H04021 H01 532 
AA190993 H03231 H59605 H01642 AA852876 AA1 13758 AA626915 M746952 AI161014 AA099554 R69067 



TABLE 14C 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (G!) numbers. "Dunham I, et al." refers to the publication entitled "The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

402075 8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 



183 



WO 02/086443 



PCT/US02/12476 



TABLE 15A: Information for all sequences in Table 16 

Table 15A shows the Seq ID No, Pkey, ExAccn, UnigenelD, and Unigene Title for ail of the sequences in Table 16. 

Table 15B show the accession numbers for those Pkey's lacking UnigenelD's for table 15A. For each probeset we have listed the gene cluster number from which the 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence 
similarity using Clustering and Alignment Tools {DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 
"Accession" column. 

Table 15C show the genomic positioning for those Pkey's lacking Unigene ID's and accession numbers in table 15A. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also listed. 



Seq ID No: Sequence ID number 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 



Seq ID No: 

Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq !D No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 



1&2 
3&4 
5&6 
7&8 
9&10 
11 & 12 
13&14 
15&16 
17&18 
19&20 
21 & 22 
23&24 
25&26 
27&28 
29&30 
31&32 
33&34 
35&36 
37&38 
39&40 
41&42 
43&44 
45&46 
47&48 
49 

50&51 
52&53 
54&55 
56& 57 
58&59 
60&61 
62&63 
64&6S 
66& 67 
68& 69 
70&71 
72&73 
74&7S 
76&77 
78&79 
80&81 
82 & 83 
84&85 
86&87 
88&89 
90&91 
92&93 
94&95 
96&97 
98&99 
100 & 101 
102 & 103 
104 & 105 
106 & 107 
108 & 109 
110 & 1 11 
112&113 
114&115 
116 

117&118 
119 & 120 
121 & 122 
123 & 124 
125 & 126 



Pkey 

410407 
412719 
417034 
430486 
407788 
407788 
407788 
407788 
439285 
413753 
120486 
425650 
412140 
423673 
452838 
418663 
418663 
409632 
429610 
406690 
431846 
418830 
424098 
443648 
311034 
408522 
422158 
435505 
417366 
431958 
441020 
423217 
429538 
448733 
444371 
444371 
444371 
422168 
422168 
429259 
426440 
437044 
423662 
428484 
429211 
417389 
423634 
417515 
441362 
425322 
449003 
431009 
409103 
417542 
428471 
418004 
414761 
418203 
447343 
437016 
449230 
446989 
457819 
424687 



ExAccn 



AW016610 

NM__006183 

BE062109 

BE514982 

BE514982 

BE514982 

BE514982 

AL133916 

U17760 

AW368377 

NMJ)01944 

AA219691 

BE003054 

U65011 

AK001100 

AK001100 

W74001 

AB024937 

M29540 

BE019924 

BE513731 

AF077374 

AI085377 

BE567130 

AI541214 

L10343 

AF200492 

BE185289 

X63629 

W79283 

NM_000094 

BE182592 

NMJD05629 

BE540274 

BE540274 

BE540274 

M586894 

AA586894 

M420450 

BE382756 

AL035864 

AK001035 

AF1 04032 

AF052693 

BE260964 

AW959908 

L24203 

BE614410 

U63630 

X76342 

BE149762 

AF251237 

J04129 

X57348 

U37519 

AU077228 

X54942 

M256641 

AU076916 

BE613348 

AK001898 

AA057484 

J05070 



UnigenelD 

Hs.63287 

Hs.816 

Hs.80962 

Hs.241551 

Hs.38991 

Hs.38991 

Hs.38991 

Hs.38991 

Hs.75517 

Hs.137569 

Hs.1 925 

Hs.73625 

Hs.1695 

Hs.30743 

Hs.41690 

Hs.41690 

Hs.55279 

Hs.211092 

Hs.220529 

Hs.271580 

Hs.88959 

Hs.1 39322 

Hs.143610 

Hs.311389 

Hs.46320 

Hs.1 12341 

Hs.211238 

Hs.1 076 

Hs.2877 

Hs.35962 

Hs.1 640 

Hs.1 1261 

Hs.1 87958 

Hs.239 

Hs.239 

Hs.239 

Hs.1 12408 

Hs.1 12408 

Hs.292911 

Hs.1 69902 

Hs.69517 

Hs.1 30881 

Hs.1 84601 

Hs.1 98249 

Hs.82045 

Hs.1 690 

Hs.82237 

Hs.23044 

Hs.1 55637 

Hs.389 

Hs.48956 

Hs.1 12208 

Hs.82269 

Hs.184510 

Hs.87539 

Hs.77256 

Hs.83758 

Hs.236894 

Hs.5398 

Hs.211579 

Hs.1 6740 

Hs.35406 

Hs.151738 



Unigene Title 

carbonic anhydrase IX 
ESTs 

neurotensin 

chloride channel, calcium activated, fam 
S100 calcium-binding protein A2 
S100 calcium-binding protein A2 
S100 calcium-binding protein A2 
S100 calcium-binding protein A2 
hypothetical protein FLJ20093 
laminin, beta 3 {nicein (125kD), kalintn 
tumor protein 63 kDa with strong homolog 
desmoglein 3 (pemphigus vulgaris antigen 
RAB6 interacting, kinesin-like (rabkines 
matrix metalloproteinase 12 (macrophage 
preferentially expressed antigen in mela 
desmocoliin 3 
desmocollin 3 
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Seq ID No: 500 & 501 413063 

Seq ID No: 502 & 503 433800 

Seq ID No: 504 & 505 452401 

Seq ID No: 506 & 507 452401 

Seq ID No: 508 & 509 450001 

Seq ID No: 510 & 511 410407 

Seq ID No: 512 & 513 309931 

Seq ID No: 514 & 515 412719 

Seq ID No: 516 & 517 417034 

Seq ID No: 518 & 519 430486 

Seq ID No: 520 & 521 413753 

Seq ID No: 522 & 523 425650 

Seq ID No: 524 & 525 423673 

Seq ID No: 526 & 527 418663 

Seq ID No: 528 & 529 418663 

Seq ID No: 530 & 531 429610 

Seq ID No: 532 & 533 406690 

Seq ID No: 534 & 535 431846 

Seq ID No: 536 & 537 422158 

Seq ID No: 538 & 539 431958 

Seq ID No: 540 & 541 437044 

Seq ID No: 542 & 543 428484 

Seq ID No: 544 & 545 429211 

Seq ID No: 546 & 547 417389 

Seq ID No: 548 & 549 431009 

Seq ID No: 550 & 551 417542 

Seq ID No: 552 & 553 449230 

Seq ID No: 554 & 555 410555 

Seq ID No: 556 & 557 410555 

Seq ID No: 558 & 559 424687 

Seq ID No: 560 & 561 418462 

Seq ID No: 562 & 563 410274 

Seq ID No: 564 & 565 439606 

Seq ID No: 566 & 567 404877 

Seq ID No: 568 & 569 444781 

Seq ID No: 570 & 571 418543 

Seq ID No: 572 & 573 415817 

Seq ID No: 574 & 575 415817 

Seq ID No: 576 & 577 415817 

Seq ID No: 578 & 579 415817 

Seq ID No: 580 & 581 415817 

Seq ID No: 582 & 583 415817 

Seq ID No: 584 & 585 421817 

Seq ID No: 586 & 587 418678 

Seq ID No: 588 & 589 418678 

Seq ID No: 590 & 591 409420 

Seq ID No: 592 & 593 332180 

Seq ID No: 594 & 595 408790 

Seq ID No: 596 & 597 408790 

Seq ID No: 598 & 599 439223 

Seq ID No: 600 & 601 409757 

Seq ID No: 602 & 603 428969 

Seq ID No: 604 & 605 428969 

Seq ID No: 606 & 607 428969 

Seq ID No: 608 & 609 428969 

Seq ID No: 61 0& 611 450701 

Seq ID No: 612 & 613 450701 

Seq ID No: 614 & 615 414774 

Seq ID No: 616 & 617 407944 

Seq ID No: 618 & 619 407944 

Seq ID No: 620 & 621 457489 

Seq ID No: 622 & 623 429547 

Seq ID No: 624 & 625 407242 

Seq ID No: 626 & 627 407242 

Seq ID No: 628 & 629 407242 

Seq ID No: 630 & 631 444006 
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ESTs, Weakly similar to dJ36501 2.1 [H.sa 
ENSP00000251056*:Plasma membrane calcium 
cathepsin E 

cartilage acidic protein 1 
cartilage acidic protein 1 
cartilage acidic protein 1 
apolipoprotein L 

death receptor 6, TNF superfamily member 
sema domain, immunoglobulin domain (Ig), 
serine/threonine kinase 12 
Predicted cation efflux pump 
C15000305:gi|3806122jgb|AAC69198.1|(AF0 
C1 5000305:gi|38061 22(gb|AAC691 98. 1 ) (AF0 
similar to lysosome-associated membrane 
small inducible cytokine subfamily B (Cy 
interleukin 1 receptor antagonist 
glypican 1 
aquaporin 4 
KIAA0877 protein 

chitinase 3-Iike 1 (cartilage glycoprote 
lung type-l cell membrane-associated gly 
tumor necrosis factor, alpha-induced pro 
tumor necrosis factor, alpha-Induced pro 
solute carrier family 6 {neurotransmitte 
carbonic anhydrase IX 

gb:hd13d01,x1 Soares_NFL_TJ3BCLS1 Homos 
ESTs 

neurotensin 

chloride channel, calcium activated, fam 
laminin, beta 3 (nicein (1 25kD), kalinin 
desmoglein 3 (pemphigus vulgaris antigen 
matrix metalloproteinase 12 (macrophage 
desmocollin 3 
desmocoliin 3 

LUNX protein; PLUNC {palate lung and nas 
carcinoembryonic antigen-related cell ad 
uroplakin 1B 

protease inhibitor 3, skin-derived (SKAL 
cadherin 3, type 1, P-cadherin (placenta 
differentially expressed in Fanconi's an 
solute carder family 7 (cationic amino 
gap junction protein, beta 5 (connexin 3 
midkine (neurite growth-promoting factor 
gap junction protein, beta 6 (connexin 3 
progestagen-associated endometrial prate 
melanoma cell adhesion molecule 
a disintegrin and metalloproteinase doma 
a disintegrin and metalloproteinase doma 
matrix metalloproteinase 9 (gelatinase B 
integrin, beta 4 
hypoxia-inducible protein 2 • 
G protein-coupled receptor 87 
NM_005365:Homo sapiens melanoma antigen, 
GPI-anchcred metastasis-associated prote 
hyaluronan synthase 3 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
protein tyrosine phosphatase, receptor-t 
ATP-binding cassette, sub-family C (CFTR 
cancer/testis antigen (NY-ESO-1) 
cancer/testis antigen (NY-ESO-1) 
)amlnin, gamma 2 (nicein (100kD), kalini 
claudin 1 

neurotrophic tyrosine kinase, receptor, 

neurotrophic tyrosine kinase, receptor, 

UL16 binding protein 2 

cystatin SN 

artemin 

artemin 

artemin 

artemin 

hypothetical protein XP.098151 (leucine- 
hypothetical protein XP_098151 (leucine- 
plasminogen activator, urokinase 
desmocollin 2 
desmocollin 2 
cryptic gene 
ESTs 

gb:Human nonspecific crossreacting antig 
gb:Human nonspecific crossreacting antig 
gb:Human nonspecific crossreacting antig 
type I transmembrane protein Fn14 
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Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 
Seq ID No: 



632 & 633 
634 & 635 
636 & 637 
638 & 639 
640 & 641 
642 & 643 
644 & 645 
646 & 647 
648 & 649 
650 & 651 
652 & 653 
654 & 655 
656 & 657 
658 & 659 
660 & 661 
662 & 663 
664 & 665 
666 & 667 
668 & 669 
670 & 671 
672 & 673 
674 & 675 
676 & 677 
678 & 679 
680 & 681 
682 & 683 
684 & 685 
686 & 687 
688 & 689 



429597 
422109 
419235 
449048 
419216 
431462 
448243 
426427 
445537 
422278 
428450 
446619 
453392 
426514 
425776 
425776 
431515 
419452 
432653 
432653 
432653 
432653 
410001 
426501 
408369 
445413 
422424 
428330 
420610 



NM.003816 

S73265 

AW470411 

Z45051 

AU076718 

AW583672 

AW369771 

M86699 

AJ245671 

AF072873 

NM_014791 

AU076643 

U23752 

BE616633 

U25128 

U25128 

NM_012152 

U33635 

N62096 

N62096 

N62096 

N62096 

AB041036 

AW043782 

R38438 

AA151342 

AI186431 

L22524 

AI683183 



Hs.2442 a disintegrin and metaltoproteinase doma 

Hs.1473 gastrin-releasing peptide 

Hs.288433 neurotrimin 

Hs.22920 similar to S68401 (cattle) glucose indue 

Hs.164021 small inducible cytokine subfamily B (Cy 

Hs.25631 1 granin-Iike neuroendocrine peptide precu 

Hs.52620 integrin, beta 8 

Hs,1 69840 TTK protein kinase 

Hs.12844 EGF-like-domain, multiple 6 

Hs. 114218 frizzled (Drosophila) homolog 6 

Hs.184339 KIM0175 gene product 

Hs.313 secreted phosphoprotein 1 (osteopontin, 

Hs.32964 SRY (sex determining region Y)-box 1 1 

Hs.170195 bone morphogenetic protein 7 (osteogenic 

Hs. 1 59499 parathyroid hormone receptor 2 

Hs. 1 59499 parathyroid hormone receptor 2 

Hs.258583 endothelial differentiation, lysophospha 

Hs.90572 PTK7 protein tyrosine kinase 7 

Hs.2931 85 ESTs, Weakly similar to JC7328 amino aci 

Hs,2931 85 ESTs, Weakly similar to JC7328 amino aci 

Hs.293185 ESTs, Weakly similar to JC7328 amino aci 

Hs.2931 85 ESTs, Weakly similar to JC7328 amino aci 

Hs.57771 kallikrein11 

Hs.293616 ESTs 

Hs.1 82575 solute carrier family 1 5 (H??? transport 

Hs.12677 CG)-147prolein 

Hs.296638 prostate differentiation factor 

Hs.2256 matrix metalioproteinase 7 (matrilysin, 

Hs.99348 distal-less homeo box 5 



TABLE 15B 



Pkey: Unique Eos probeset identifier number 
CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
309931 
330493 
439285 

450375 

451320 



CAT Number 
AW341683 
33264_5 
47065.1 

83327J 

86576J 



Accession 

M27826 R78416 AA307645 AW957879 AW95780Q AA633529 H03662 

AL133916 N79113 AF086101 N76721 AW950828 AA364013 AW955684 AI346341 AI867454 N54784 AI655270 A1421279 AW014882 
AA775552 N62351 N59253 M626243 AI341407 BE175639 M456968 AI358918 AA457077 

AA009647 AA131254 AA374293 AW954405 H04410 AW606284 AA151166 BE157467 BE157601 H04384 W46291 AW663674 H04021 H01532 
AA1 90993 H03231 H59605 H01642 M852876 AA1 13758 AA626915 M746952 Al 161 01 4 AA099554 R69067 
AW1 1 8072 AI631 982 T1 5734 AA2241 95 AI701458 W201 98 F26326 AA890570 N90552 AW071 907 AI671 352 AI375892 T0351 7 R88265 
AI124088 AA224388 AI084316 AI354686 T33652 A1140719 A172021 1 T03490 AI372637 T15415 AW205836 AA630384 T03515 T33230 
AA017131 AA443303 T33623 A1222556 T33511 T33785 AI419606 D55612 



TABLE 15C 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled "The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 

Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand NLposition 

402075 8117407 Plus 121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 

403329 8516120 Plus 96450-96598 

403478 9958258 Plus 116458-116564 

404440 7528051 Plus 80430-81 581 

404877 1519284 Pius 1095-2107 

405770 2735037 Plus 61057-62075 

405932 7767812 Minus 123525-123713 



188 



WO 02/086443 

Table 16 



PCT/US02/12476 



Seq ID NO: 1 DNA sequence 
Nucleic Acid Accession #: NM_001216 
5 Coding sequence: 43.. 1422 

1 11 21 31 41 51 

I 1 I I I I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 60 

10 AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 120 

CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 180 

TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 240 

AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT ACCTGGAGAG 300 

GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 360 

15 TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 

AATAATGCCC ACAGGGACAA AGAAGGGGAT GACCAGAGT C ATTGGCGCTA TGGAGGCGAC 480 

CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC GGTGGATATC 540 

CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT GGGCTTCCAG 600 

CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 

20 CCTCCTGGGC TAGAGATGGC TCTGGGTCCC GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT 72 0 

CTGCACTGGG GGGCTG CAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG CCACCGTTTC 780 

CCTGCCGAGA TCCACGTGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA CGAGGCCTTG 840 

GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC GCCTTTCTGG AGGAGGGCCC GGAAGAAAAC 900 

AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG CTCAGAGACT 960 

25 CAGGT CCC AG GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG CTACTTCCAA 1020 

TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 1080 

CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT CTGACACCCT GTGGGGACCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC CTTTGAATGG GCGAGTGATT 120 0 

GAGGCCTCCT TCCCTGCTGG AGTGGACAGC AGTCCTCGGG CTGCTGAGCC AGTCCAGCTG 12 60 

30 AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT TTGGCCTGCT TTTTGCTGTC 132 0 

ACCAGCGTCG CGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGG GGGT 1380 

GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA 1440 

TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCCGGTA ACTGTCCTGT CCTGCTCATT 1500 
ATGCCACTTC CTTTTAACTG CCAAGAAATT TTTTAAAATA AATATTTATA AT 



35 



Seq ID NO: 2 Protein sequence: 
Protein Accession #: NP_001207 



1 11 21 31 41 51 

40 | | | l l I 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 

GEEDLPSEED SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSLKL EDLPTVEAPG 120 

DPQEPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 180 

ELLGFQLPPL PELRLRNNGH SVQLTLPPGL EMALGPGREY RALQLHLHWG AAGRPGSEHT 240 

45 VEGHRFPAEI HWHLSTAFA RVDEALGRPG GLAVLAAFLE EGPEENSAYE QLLSRLEEIA 300 

EEGSETQVPG LDISALLPSD FSRYFQYEGS LTTPPCAQGV IWTVFNQTVM LSAKQLHTLiS 360 

DTLWGPGDSR LQLNFRATQP LNGRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 42 0 
GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 

50 Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: BC013923 
Coding sequence: 43 8-1391 

1 11 21 31 41 51 

55 | | I I I I 

AGCGGGGTTG TCTATTAACT TGTTCAAAAA GTATCAGGAG TTGTCAAGGC AGAGAAGAGA 60 

GTGTTTGCAA AAGGGGGAAA GTAGTTTGCT GCCTCTTTAA GACTAGGACT GAGAGAAAGA 120 

AGAGGAGAGA GAAAGAAAGG GAGAGAAGTT TGAGCCCCAG GCTTAAGCCT TTCCAAAAAA 180 

TAATAATAAC AATC AT CGGC GGCGGCAGGA TCGGCCAGAG GAGGAGGGAA GCGCTTTTTT 240 

60 TGATCCTGAT TCCAGTTTGC CTCTCTCTTT TTTTCCCCCA AATTATT CTT CGCCTGATTT 300 

TCCTCGCGGA GCCCTGCGCT CCCGACACCC CCGCCCGCCT CCCCTCCTCC TCTCCCCCCG 3 60 

CCCGCGGGCC CCCCAAAGTC CCGGCCGGGC CGAGGGT CGG CGGCCGCCGG CGGGCCGGGC 420 

CCGCGCACAG CGCCCGCATG TACAACATGA TGGAGACGGA GCTGAAGCCG CCGGGCCCGC 480 

AGCAAACTTC GGGGGGCGGC GGCGGCAACT CCACCGCGGC GGCGGCCGGC GGCAACCAGA 540 

65 AAAACAGCCC GGACCGCGTC AAGCGGCCCA TGAATGCCTT CATGGTGTGG TCCCGCGGGC 60 0 

AGCGGCGCAA GATGGCCCAG GAGAACCCCA AGATGCACAA CTCGGAGATC AGCAAGCGCC 660 

TGGGCGCCGA GTGGAAACTT TTGTCGGAGA CGGAGAAGCG GCCGTTCATC GACGAGGCTA 72 0 

AGCGGCTGCG AGCGCTGCAC ATGAAGGAGC ACCCGGATTA TAAATACCGG CCCCGGCGGA 780 

AAACCAAGAC GCTCATGAAG AAGGATAAGT ACACGCTGCC CGGCGGGCTG CTGGCCCCCG 840 

70 GCGGCAATAG CATGGCGAGC GGGGT CGGGG TGGGCGCCGG CCTGGGCGCG GGCGTGAACC 900 

AGCGCATGGA CAGTTACGCG CACATGAACG GCTGGAGCAA CGGCAGCTAC AGCATGATGC 960 

AGGACCAGCT GGGCTACCCG CAGCACCCGG GCCTCAATGC GCACGGCGCA GCGCAGATGC 1020 

AGCCCATGCA CCGCTACGAC GTGAGCGCCC TGCAGTACAA CTCCATGACC AGCTCGCAGA 1080 

CCTACATGAA CGGCTCGCCC ACCTACAGCA TGTCCTACTC GCAGCAGGGC ACCCCTGGCA 1140 

75 TGGCTCTTGG CTCCATGGGT TCGGTGGTCA AGTCCGAGGC CAGCTCCAGC CCCCCTGTGG 1200 

TTACCTCTTC CTCCCACTCC AGGGCGCCCT GCCAGGCCGG GGACCTCCGG GACATGATCA 1260 

GCATGTATCT CCCCGGCGCC GAGGTG CCGG AACCCGCCGC CCCCAGCAGA CTTCACATGT 132 0 

CCCAGCACTA CCAGAGCGGC CCGGTGCCCG GCACGGCCAT TAACGGCACA CTGCCCCTCT 1380 

CACACATGTG AGGGCCGGAC AGCGAACTGG AGGGGGGAGA AATTTTCAAA GAAAAACGAG 1440 

80 GGAAATGGGA GGGGTGCAAA AGAGGAGAGT AAGAAACAGC ATGGAGAAAA CCCGGTACGC 1500 

TCAAAAAAAA AAAAAAAAAA AAAATCCCAT CACCCACAGC AAATGACAGC TGCAAAAGAG 1560 

AACACCAAT C CCATCCACAC TCACGCAAAA ACCGCGATGC CGACAAGAAA ACTTTTATGA 162 0 

GAGAGATCCT GGACTTCTTT TKGGGGGACT ATTTTTGTAC AGAGAAAACC TGGGGAGGGT 1680 

GGGGAGGGCG GGGGAATGGA CCTTGTATAG ATCTGGAGGA AAGAAAGCTA CGAAAAACTT 1740 

85 TTTAAAAGTT CTAGT GGTAC GGTAGGAGCT TTGCAGGAAG TTTGCAAAAG TCTTTACCAA 1800 

TAATATTTAG AGCTAGTCTC CAAGCGACGA AAAAAATGTT TTAATATTTG CAAGCAACTT 1860 

TTGTACAGTA TTTATCGAGA TAAACATGGC AATCAAAATG TCCATTGTTT ATAAGCTGAG 1920 
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AATTTGCCAA TATTTTTCAA GGAGAGGCTT CTTGCTGAAT TTTGATTCTG CAGCTGAAAT 1980 

TTAGGACAGT TGCAAACGTG AAAAGAAGAA AATTATTCAA AT TTGGACAT TTTAATTGTT 2 040 

TAAAAATTGT ACAAAAGGAA AAAATTAGAA TAAGTACTGG CGAACCATCT CTGTGGTCTT 2100 

GTTTAAAAAG GGCAAAAGTT TTAGACTGTA CTAAATTTTA TAACTTACTG TTAAAAGCAA 2160 

AAATGGCCAT GCAGGTTGAC ACCGTTGGTA ATTTATAATA GCTTTTGTTC GATCCCAACT 2220 

TTCCATTTTG TTCAGATAAA AAAAACCATG AAATTACTGT GTTTGAAATA TTTTCTTATG 22 80 

GTTTGTAATA TTTCTGTAAA TTTATTGTGA TATTTTAAGG TTTTCCCCCC TTTATTTTCC 2 340 

GTAGTTGTAT TTTAAAAGAT TCGGCTCTGT ATTATTTGAA TCAGTCTGCC GAGAATCCAT 2400 

GTATATATTT GAACTAATAT CATCCTTATA ACAGGTACAT TTTCAACTTA AGTTTTTACT 24 60 

CCATTATGCA CAGTTTGAGA TAAATAAATT TTTGAAATAT GGACACTGAA AAAAAAAAAA 2520 

AAAAAAACAA AACAAAAAAA CAAAAAACAA AAACAGAAAA AACAAAAAAA AAAACAAAAC 2580 

CACAACACAA AAACAAAAAA AAAAAAAAGA AACAAACACA CAACACAACA CAACACAAAA 2 640 
CCACAACACA AACAACAACA CACAGAGGG 

Seq ID NO: 4 Protein sequence: 
Protein Accession #:CAA83435.1 



PCT/US02/12476 



20 



25 



MYNMMETELK 
QENPKMHNSE 
KKDKYTLPGG 
PQHPGLNAHG 
GSWKSEASS 
GPVPGTAING 



11 

i 

PPGPQQTSGG 
ISKRLGAEWK 
LLAPGGNSMA 
AAQMQPMHRY 
SPPWTSSSH 
TLPLSHM 



21 
i 

GGGNS TAAAA 
IiLSETEKRPF 
SGVGVGAGLG 
DVSALQYNSM 
SRAPCQAGDL 



31 



41 
I 



GGNQKNSPDR VKRPMNAFMV 
IDEAKRLRAL HMKEHPDYKY 
AGVNQRMDSY AHMNGWSNGS 
TSSQTYMNGS PTYSMSYSQQ 
RDMISMYLPG AEVPEPAAPS 



51 
I 

WSRGQRRKMA 
RPRRKTKTLM 
YSMMQDQLGY 
GTPGMALGSM 
RLHMSQHYQS 



60 
120 
180 
240 
300 



30 
35 
40 
45 



Seq ID NO: 5 DNA sequence 
Nucleic Acid Accession #: U91618 
Coding sequence: 2 9-541 



CGGACTTGGC 
CATGCTACTC 
AGCATTAGAA 
TCCCTCTTGG 
AGCTGAGGAA 
TGCTTTAGAT 
TCACAGCAGG 
TGACAAAAAT 
GCTGTATGAG 
AGAGAATAAA 
ATTATATTTG 
ATTGAATGTG 
TCTTCAAAAA 



11 

I 

TTGTTAGAAG 
CTGGCTTTCA 
GCAGATTTCT 
AAGATGACTC 
ACAGGAGAAG 
GGCTTTAGCT 
GCTTTTCAAC 
GGAAAGGAAG 
AATAAACCCA 
TCATTTATTT 
TGTGAAAATG 
TTTTTCTGCA 
AAAAAAAAAA 



21 
I 

GCTGAAAGAT 
GCTCCTGGAG 
TGACCAATAT 
TGCTAAATGT 
TTCATGAAGA 
TGGAAGCAAT 
ACTGGGAGTT 
AAGT CAT AAA 
GAAGACCCTA 
ACATGTGATT 
TGACAAACAC 
CTAATAGAAA 
AAATGGGGCC 



31 

[ 

GATGGCAGGA 
TCTGTGCTCA 
GCATACATCA 
TTGCAGTCTT 
GGAGCTTGTT 
GTTGACAATA 
AATCCAGGAA 
GAGAAAAATT 
CATACTCAAA 
GTGATTCATC 
ACTTATCTGT 
T T AG ACT AAG 
GCAATT 



41 
I 

ATGAAAATCC 
GATT CAGAAG 
AAGATTAGTA 
GTAAATAATT 
GCAAGAAGGA 
TACCAGCTCC 
GATATTCTTG 
CCTTATATTC 
AGAGATTCTT 
ATCCCTTAAT 
CTCTTCTACA 
TGTTTTCAAA 



51 

i 

AGCTTGTATG 
AGGAAATGAA 
AAGCACATGT 
TGAACAGCCC 
AACTTCCTAC 
ACAAAATCTG 
ATACTGGAAA 
TGAAACGGCA 
ACTATTACTG 
TAAATATCAA 
ATTGTGGTTT 
TAAATCTAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 6 Protein sequence: 
50 Protein Accession #: AAB50564 

1 11 21 31 41 51 

i I i I I I 

MMAGMKIQLV CMLLLAFSSW SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWKMTLLN 60 

55 VCSLVNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 12 0 
LIQEDILDTG NDKNGKEEVI KRKIPYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 7 DNA sequence 
Nucleic Acid Accession #: NM_006536.2 
60 Coding sequence: 10 9-2 940 

1 11 21 31 41 51 

I I I I I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCAT C CAT AT TG AAA ACCTGACACA 60 

65 ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 3 60 

70 ATAAAGATTT TAATACCTGC CACATG G AAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

75 ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CAT C AAT AAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG T AAT CACAG A CTCTGCTGAC 960 

80 TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAGTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

85 TCAGCTAAAA CAGAC AT CAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 



190 



WO 02/086443 PCT/US02/12476 

CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACT CCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC AT CAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

5 ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGAT CCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

10 TATGCCAATG TGAAACAGGG ATTTTATCCC AT T CTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAAC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 22 80 

15 GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 24 60 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 

AGTAAAAGTC TACAGAATAT CCAAGATGAC TTTAACAATG CTATTTTAGT AAATACATCA 2580 

20 AAGCGAAATC CTCAGCAAGC TGGCATCAGG GAGATATTTA CGTTCTCACC CCAGATTTCC 2 640 

ACGAATGGAC CTGAACATCA GCCAAATGGA GAAACACATG AAAGCCACAG AATTTATGTT 2 700 

GCAATACGAG CAATGGATAG GAACTCCTTA CAGTCTGCTG TATCTAACAT TGCCCAGGCG 2760 

CCTCTGTTTA TTCCCCCCAA TTCTGATCCT GTACCTGGCA GAG ATTAT CT TATATTGAAA 2820 

GGAQTTTTAA CAGCAATGGG TTTGATAGGA ATCATTTGCC TTATTATAGT TGTGACACAT 2880 

25 CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 2 940 

ATAAATATCC AAAGTGT CTT CCTTCTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 3 000 

CATACTAACA AAGT C AAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 3060 

ATACAGATAA GATTTTTACA TGGTAGATCA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 3120 

CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 3180 

30 GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 3240 

AATAGCCCCA AG CAG AG AAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 3300 

TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 3360 

TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 3420 

CTTGCTATTT TGTTATATAT ATTTCAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 3480 

35 TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT T CAT AC CGGT 3540 

TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 3600 

TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 3660 

TACCTAGGAA A 

40 Seq ID NO: 8 Protein sequence: 

Protein Accession #: NP_006527.1 

1 11 21 31 41 51 

4j MTQRSIAGPI CNLKFVTLLV AIiSSELPFLG AGVQLQDNGY NGLLIAINPQ VPENQNLISN 60 

IKEMITEASF YLFNATKRRV FFRNIKILIP ATWKANNNSK IKQESYEKAN VIVTDWYGAH 120 

GDDPYTLQYR GCGKEGKYIH FTPNFLLNDN LTAGYGSRGR VFVHEWAHLR WGVFDEYNND 180 

KPFYINGQNQ IKVTRCSSDI TGI FVCEKGP CPQENCIISK LFKEGCTFIY NSTQNATAS I 240 

MFMQSLSSW EFCNASTHNQ EAPNLQNQMC SLRSAWDVIT DSADFHHSFP MNGTELPPPP 300 

50 TFSLVQAGDK WCLVLDVSS KMAEADRLLQ LQQAAEFYLM QIVEIHTFVG IASFDSKGEI 360 

RAQLHQINSN DDRKLLVSYL PTTVS AKTD I SICSGLKKGF EWEKLNGKA YGSVMILVTS 42 0 

GDDKLLGNCL PTVLSSGSTI HSIALGSSAA PNLEELSRLT GGLKFFVPDI SNSNSMIDAF 480 

SRISSGTGDI FQQHIQLEST GENVKPHHQL KNTVTVDNTV GNDTMFLVTW QASGPPEIIL 540 

FDPDGRKYYT NNFITNLTFR TASLWIPGTA KPGHWTYTLN NTHHSLQALK VTVTSRASNS 600 

55 AVPPATVEAF VERDSLHFPH PVMIYANVKQ GFYPILNATV TATVEPETGD PVTLRLLDDG 660 

AGADV I KNDG IYSRYFFSFA ANGRYSLKVH VNHSPSISTP AHS I PGSHAM YVPGYTANGN 72 0 

IQMNAPRKSV GRNEEERKWG FSRVSSGGSF SVLGVPAGPH PDVFPPCKII DLEAVKVEEE 780 

LTLSWTAPGE DFDQGQATSY EIRMSKSLQN IQDDFNNAIL VNTSKRNPQQ AGIREIFTFS 840 

PQISTNGPEH QPNGETHESH RIYVAIRAMD RNSLQSAVSN IAQAPLFIPP NSDPVPARDY 900 
60 LILKGVLTAM GLIGIICLII WTHHTIiSRK KRADKKENGT KLL. 



Seq ID NO: 9 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
65 Coding sequence: 336-632 

1 11 21 31 41 51 

I I I I I I 

CTCCCCTCAC CCCGGTCCAG GATGCCCAGT CCCCACGACA CCTCCCACTT CCCACTGTGG 60 

70 CCTGGGTGGG CTCAGGGGCT GCCCTTGACC TGGCCTAGAG CCCTCCCCCA GCTGGTGGTG 120 

GAGCTGGCAC TCTCTGGGAG GGAGGGGGCT GGGAGGGAAT GAGTGGGAAT GGCAAGAGGC 180 

CAGGGTTTGG TGGGATCAGG TTGAGGCAGG TTTGGTTTCC TTAAAATGCC AAGTTGGGGG 240 

CCAGTGGGGC CCACATATAA ATCCTCACCC TGGGAGCCTG GCTGCCTTGC TCTCCTTCCT 300 

GGGTCTGTCT CTGCCACCTG GTCTGCCACA GATCCATGAT GTGCAGTTCT CTGGAGCAGG 360 

75 CGCTGGCTGT GCTGGTCACT ACCTTCCACA AGTACTCCTG CCAAGAGGGC GACAAGTTCA 420 

AGCTGAGTAA GGGGGAAATG AAGGAACTTC TGCACAAGGA GCTGCCCAGC TTTGTGGGGG 480 

AGAAAGTGGA TGAGGAGGGG CTGAAGAAGC TGATGGGCAG CCTGGATGAG AACAGTGACC 540 

AG CAGGTGG A CTTCCAGGAG TATGCTGTTT TCCTGGCACT CATCACTGTC ATGTGCAATG 600 

ACTTCTTCCA GGGCTGCCCA GACCGACCCT GAAGCAGAAC TCTTGACTTC CTGCCATGGA 660 

80 TCTCTTGGGC CCAGGACTGT TGATGCCTTT GAGTTTTGTA TTCAATAAAC TTTTTTTGTC 720 

TGTTGATAAT ATTTTAATTG CTCAGTGATG TTCCATAACC CGGCTGGCTC AGCTGGAGTG 780 

CTGGGAGATG AGGGCCTCCT GGATCCTGCT CCCTTCTGGG CTCTGACTCT CCTGGAAATC 840 

TCTCCAAGGC CAGAGCTATG CTTTAGGTCT CAATTTTGGA AT T T CAAACA CCAGCAAAAA 900 

ATTGGAAATC GAGATAGGTT GCTGACTTTT ATTTTGTCAA ATAAAGATAT TAAAAAAGGC 960 

85 AAATACCA 

Seq ID NO: 10 Protein sequence: 



191 



WO 02/086443 

Protein Accession #: NP_005969.1 

1 11 21 31 



41 
I 



51 



MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDF FQGCPDRP 



PCT/US02/12476 



60 



10 
15 
20 
25 
30 
35 
40 



Seq ID NO: 11 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 33 6-62 6 



CTCCCCTCAC 
CCTGGGTGGG 
GAGCTGGCAC 
CAGGGTTTGG 
CCAGTGGGGC 
GGGTCTGTCT 
CGCTGGCTGT 
AGCTGAGTAA 
ATTCCAGAGA 
GAGACTTGAG 
GGAGAAAGTG 
CCAGCAGGTG 
TGACTTCTTC 
GATCTCTTGG 
TCTGTTGATA 
TGCTGGGAGA 
TCTCTCCAAG 
AAATTGGAAA 
GCAAATACCA 



11 

I 

CCCGGTCCAG 
CTCAGGGGCT 
TCTCTGGGAG 
TGGGATCAGG 
CCACATATAA 
CTGCCACCTG 
GCTGGTCACT 
GGGGGAAATG 
ACCATGTGCT 
AAACCAGAGC 
GATGAGGAGG 
GACTTCCAGG 
CAGGGCTGCC 
GCCCAGGACT 
ATATTTTAAT 
TGAGGGCCTC 
GCCAGAGCTA 
TCGAGATAGG 



21 

I 

GATGCCCAGT 
GCCCTTGACC 
GGAGGGGGCT 
TTGAGGCAGG 
ATCCTCACCC 
GTCTGCCACA 
ACCTTCCACA 
AAGGAACTTC 
GTGAGGGCCT 
CCAGAAGGGA 
GGCTGAAGAA 
AGTATGCTGT 
CAGACCGACC 
GTTGATGCCT 
TGCTCAGTGA 
CTGGATCCTG 
TGCTTTAGGT 
TTGCTGACTT 



31 

I 

CCCCACGACA 
TGGCCTAGAG 
GGGAGGGAAT 
TTTGGTTTCC 
TGGGAGCCTG 
GATCCATGAT 
AGTACTCCTG 
TGCACAAGGA 
TCCGAGTCCA 
AAAGTGATTG 
GCTGATGGGC 
TTTCCTGGCA 
CTGAAGCAGA 
TTGAGTTTTG 
TGTTCCATAA 
CTCCCTTCTG 
CTCAATTTTG 
TTATTTTGTC 



41 
I 

CCTCCCACTT 
CCCTCCCCCA 
GAGTGGGAAT 
TTAAAATGCC 
GCTGCCTTGC 
GTGCAGTTCT 
CCAAGAGGGC 
GCTGCCCAGC 
TCTGTTTA^T 
TCCCAAGATC 
AGCCTGGATG 
CTCATCACTG 
ACTCTTGACT 
TATTCAATAA 
CCCGGCTGGC 
GGCTCTGACT 
GAATTTCAAA 
AAATAAAGAT 



51 
I 

CCCACTGTGG 
GCTGGTGGTG 
GGCAAGAGGC 
AAGTTGGGGG 
TCTCCTTCCT 
CTGGAGCAGG 
GACAAGTTCA 
TTTGTGGGGC 
CCTGTCATTG 
ACACAGCACT 
AGAACAGTGA 
TCATGTGCAA 
TCCTGCCATG 
ACTTTTTTTG 
TCAGCTGGAG 
CTCCTGGAAA 
CACCAGCAAA 
ATTAAAAAAG 



Seq ID NO: 12 Protein sequence: 
Protein Accession # : Eos sequence 



1 11 21 31 41 51 

1 I I I I I 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGHS RE P CAVRAFR 
VHLFNPVIGD LRNQSPEGKS DCPKITQHWR KWMRRG 



60 
12 0 
180 
240 
300 
360 
42 0 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 



45 



50 



55 



60 



65 



70 



75 



80 



85 



Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 58-354 



GTGAGCTCAC 
ATGTGCAGTT 
TGCCAAGAGG 
GAGCTGCCCA 
AGCCTGGATG 
CTCATCACTG 
ACTCTTGACT 
TATTCAATAA 
CCCGGCTGGC 
GGCTCTGACT 
GAATTTCAAA 
AAATAAAGAT 



11 

I 

CATGTGGGGG 
CTCTGGAGCA 
GCGACAAGTT 
GCTTTGTGGG 
AGAACAGTGA 
TCATGTGCAA 
TCCTGCCATG 
ACTTTTTTTG 
TCAGCTGGAG 
CTCCTGGAAA 
CACCAGCAAA 
ATTAAAAAAG 



21 
I 

TGAGGCTGAG 
GGCGCTGGCT 
CAAGCTGAGT 
GGAGAAAGTG 
CCAGCAGGTG 
TGACTTCTTC 
GATCTCTTGG 
TCTGTTGATA 
TGCTGGGAGA 
TCTCTCCAAG 
AAATTGGAAA 
GCAAATACCA 



31 
I 

AGAAAACAAG 
GTGCTGGTCA 
AAGGGGGAAA 
GATGAGGAGG 
GACTTCCAGG 
CAGGGCTGCC 
GCCCAGGACT 
ATATTTTAAT 
TGAGGGCCTC 
GCCAGAGCTA 
TCGAGATAGG 



41 

I 

TACACAGCCA 
CTACCTTCCA 
TGAAGGAACT 
GGCTGAAGAA 
AGTATGCTGT 
CAGACCGACC 
GTTGATGCCT 
TGCTCAGTGA 
CTGGATCCTG 
TGCTTTAGGT 
TTGCTGACTT 



51 

I 

CAGATCCATG 
CAAGTACTCC 
TCTGCACAAG 
GCTGATGGGC 
TTTCCTGGCA 
CTGAAGCAGA 
TTGAGTTTTG 
TGTTCCATAA 
CTCCCTTCTG 
CTCAATTTTG 
TTATTTTGTC 



Seq ID NO: 14 Protein sequence: 
Protein Accession #: NP_005969.1 

1 11 21 31 41 51 

) I I I 1 I 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 
GSLDENSDQQ VDFQEYAVFL ALITVMCNDF FQGCPDRP 



Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 62-358 



GGAGGGTGTG 
CATGATGTGC 
CTCCTGCCAA 
CAAGGAGCTG 
GGGCAGCCTG 
GGCACTCATC 
CAGAACTCTT 
TTTGTATTCA 
ATAACCCGGC 
TCTGGGCTCT 
TTTGGAATTT 



11 

i 

CCGCTGAGTC 
AGTTCTCTGG 
GAGGGCGACA 
CCCAGCTTTG 
GATGAGAACA 
ACTGTCATGT 
GACTTCCTGC 
ATAAACTTTT 
TGGCTCAGCT 
GACTCTCCTG 
CAAACACCAG 



21 
I 

ACTGCCTGGG 
AGCAGGCGCT 
AGTTCAAGCT 
TGGGGGAGAA 
GTGACCAGCA 
GCAATGACTT 
CATGGATCTC 
TTTGTCTGTT 
GGAGTGCTGG 
GAAATCTCTC 
CAAAAAATTG 



31 
I 

CATCTGGGCC 
GGCTGTGCTG 
GAGTAAGGGG 
AGTGGATGAG 
GGTGGACTTC 
CTTCCAGGGC 
TTGGGCCCAG 
GATAATATTT 
GAGATGAGGG 
CAAGGCCAGA 
GAAATCGAGA 



41 
I 

TGGAACCTCG 
GTCACTACCT 
GAAATGAAGG 
GAGGGGCTGA 
CAGGAGTATG 
TGCCCAGACC 
GACTGTTGAT 
TAATTGCTCA 
CCTCCTGGAT 
GCTATGCTTT 
TAGGTTGCTG 



51 

I 

GCCACAGATC 
TCCACAAGTA 
AACTTCTGCA 
AGAAGCTGAT 
CTGTTTTCCT 
GACCCTGAAG 
GCCTTTGAGT 
GTGATGTTCC 
CCTGCTCCCT 
AGGTCTCAAT 
ACTTTTATTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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TGTCAAATAA AGATATTAAA AAAGGCAAAT ACCA 

Seq ID NO: 16 Protein sequence: 
Protein Accession #: NP_005969.1 

1 11 21 31 41 51 

i i I I 1 I 

MMCSSLEQAL AVLVTTFHKY SCQEGDKFKL SKGEMKELLH KELPSFVGEK VDEEGLKKLM 60 
GSLDENSDQQ VDFQEYAVFD ALITVMCNDF FQGCPDRP 



Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 939-2372 

1 11 21 31 41 51 

l l I I I I 

AAGACGGATT CTCAGACAAG GCTTGCAAAT GCCCCGCAGC CATCATT TAA CTGCACCCGC 60 

AGAATAGTTA CGGTTTGTCA CCCGACCCTC CCGGATCGCC TAATTTGTCC CTAGTGAGAC 120 

CCCGAGGCTC TGCCCGCGCC TGGCTTCTTC GTAGCTGGAT GCATATCGTG CTCCGGGCAG 180 

CGCGGGCGCA GGGCACGCGT TCGCGCACAC CCTAGCACAC ATGAACACGC GCAAGAGCTG 240 

AACCAAGCAC GGTTTCCATT TCAAAAAGGG AGACAGCCTC TACCGCGATT GTAGAAGAGA 300 

CTGTGGTGTG AATTAGGGAC CGGGAGGCGT CGAACGGAGG AACGGTTCAT CTTAGAGACT 360 

AATTTTCTGG AGTTTCTGCC CCTGCTCTGC GTCAGCCCTC ACGTCACTTC GCCAGCAGTA 420 

GCAGAGGCGG CGGCGGCGGC TCCCGGAATT GGGTTGGAGC AGGAGCCTCG CTGGCTGCTT 480 

CGCTCGCGCT CTACGCGCTC AGTCCCCGGC GGTAGCAGGA GCCTGGACCC AGGCGCCGCC 540 

GGCGGGCGTG AGGCGCCGGA GCCCGGCCTC GAGGTGCATA CCGGACCCCC ATTCGCATCT 600 

AACAAGGAAT CTGCGCCCCA GAGAGTCCCG GGAGCGCCGC CGGTCGGTGC CCGGCGCGCC 660 

GGGCCATGCA GCGACGGCCG CCGCGGAGCT CCGAGCAGCG GTAGCGCCCC CCTGTAAAGC 720 

GGTTCGCTAT GCCGGGGCCA CTGTGAACCC TGCCGCCTGC CGGAACACT C TTCGCTCCGG 780 

ACCAGCTCAG CCTCTGATAA GCTGGACTCG GCACGCCCGC AACAAGCACC GAGGAGT TAA 840 

GAGAGCCGCA AGCGCAGGGA AGGCCTCCCC GCACGGGTGG GGGAAAGCGG CCGGTGCAGC 900 

GCGGGGACAG GCACTCGGGC TGGCACTGGC TGCTAGGGAT GTCGTCCTGG ATAAGGTGGC 960 

ATGGACCCGC CATGGCGCGG CTCTGGGGCT TCTGCTGGCT GGTTGTGGGC TTCTGGAGGG 1020 

CCGCTTTCGC CTGTCCCACG TCCTGCAAAT GCAGTGCCTC TCGGATCTGG TGCAGCGACC 1080 

CTTCTCCTGG CATCGTGGCA TTTCCGAGAT TGGAGCCTAA CAGTGTAGAT CCTGAGAACA 1140 

TCACCGAAAT TTTCATCGCA AACCAGAAAA GGTTAGAAAT CATCAACGAA GATGATGTTG 1200 

AAGCTTATGT GGGACTGAGA AATCTGACAA TTGTGGATTC TGGATTAAAA TTTGTGGCTC 1260 

ATAAAGCATT TCTGAAAAAC AGCAACCTGC AGCACATCAA TTTTACCCGA AACAAACTGA 132 0 

CGAGTTTGTC TAGGAAACAT TTCCGTCACC TTGACTTGTC TGAACTGATC CTGGTGGGCA 13 80 

ATCCATTTAC ATGCTCCTGT GACATTATGT GGATCAAGAC TCTCCAAGAG GCTAAATCCA 1440 

GTCCAGACAC TCAGGATTTG TACTGCCTGA ATGAAAGCAG CAAGAATATT CCCCTGGCAA 1500 

ACCTGCAGAT ACCCAATTGT GGTTTGCCAT CTGCAAATCT GGCCGCACCT AACCTCACTG 1560 

TGGAGGAAGG AAAGTCTATC ACATTATCCT GTAGTGTGGC AGGTGATCCG GTTCCTAATA 1620 

TGTATTGGGA TGTTGGTAAC CTGGTTTCCA AACATATGAA TGAAACAAGC CACACACAGG 1680 

GCTCCTTAAG GATAACTAAC ATTTCATCCG ATGACAGTGG GAAGCAGATC TCTTGTGTGG 1740 

CGGAAAATCT TGTAGGAGAA GATCAAGATT CTGTCAACCT CACTGTGCAT TTTGCACCAA 1800 

CTAT CACATT TCTCGAATCT CCAACCTCAG ACCACCACTG GTGCATT CCA TTCACTGTGA 1860 

AAGGCAACCC CAAACCAGCG CTTCAGTGGT TCTATAACGG GGCAATATTG AATGAGTCCA 1920 

AATACATCTG TACTAAAATA CATGTTACCA ATCACACGGA GTACCACGGC TGCCTCCAGC 1980 

TGGATAATCC CACT CACATG AACAATGGGG ACTACACTCT AATAGCCAAG AATGAGTATG 2040 

GGAAGGATGA GAAACAGATT TCTGCTCACT TCATGGGCTG GCCTGGAATT GACGATGGTG 2100 

CAAACCCAAA TTATCCTGAT GTAATTTATG AAGATTATGG AACTGCAGCG AATGACAT CG 2160 

GGGACACCAC GAACAGAAGT AATGAAATCC CTTCCACAGA CGTCACTGAT AAAACCGGTC 222 0 

GGGAACATCT CTCGGTCTAT GCTGTGGTGG TGATTGCGTC TGTGGTGGGA TTTTGCCTTT 22 80 

TGGTAATGCT GTTTCTGCTT AAGTTGGCAA GACACTCCAA GTTTGGCATG AAAGGTTTTG 2340 

TTTTGTTTCA TAAGATCCCA CTGGATGGGT AGCTGAAATA AAGGAAAAGA CAGAGAAAGG 2400 

GGCTGTGGTG CTTGTTGGTT GATGCTGCCA TGTAAGCTGG ACTCCTGGGA CTGCTGTTGG 2460 

CTTATCCCGG GAAGTGCTGC TTATCTGGGG TTTTCTGGTA GATGTGGGCG GTGTTTGGAG 2520 

GCTGTACTAT ATGAAGCCTG CATATACTGT GAGCTGTGAT TGGGGAACAC CAATGCAGAG 2580 

GTAACTCTCA GGCAGCTAAG CAGCACCTCA AGAAAACATG TTAAATTAAT GCTTCTCTTC 2640 

TTACAGTAGT TCAAATACAA AACTGAAATG AAATCCCATT GGATTGTACT TCTCTTCTGA 2 700 

AAAGTGTGCT TTTTGACCCT ACTGGACATT TATTGACTTA ATTGCTTCTG TTTATTAAAA 2760 

TTGACCTGCA AAGTTAAAAA AAAATTAAAG TTGAGAACAG GTATAAGTGC ACACTGAATA 2820 

GTCTAATCTA CATGTAACAC ATATTTTAGT GTGATTTTCT ATACTCTAAT CAGCACTGAA 2880 

TTCAGAGGGT TTGACTTTTT CATCTATAAC ACAGTGACTA AAAGAGTTAA GGGTATATAT 2940 

ACCATCACTT TGGGACTTGG TAGTATTATT AAAAGGT TAT TTCCTTCACT GTCAATAAAA 30 00 

GTCCAAATGT TTAGCTTAGG TCTGAGAGTC AAACAATGTT AAGGATTGTC TTAAAGTTCC 3060 

T TAG C C AG CA AAACAAAACA AAACAAAACA AACAAATGAA AAACGTTTAA AAAGAAGAAG 3120 

AAGAAAAAAA ACAAGAACAA GCAGCAACAG CTGTTTTGTT GGGGCTATAG ATTTAAGTTA 3180 

GGCATAGTCA ATTTCAGAAT AACTAAGAGT GGAATATATG CATATGGTGA AATTATAACC 3240 

TTGCCCTTTT TTATTTGCCC TCTGCGATCC ACCTGCTTTT TAGAAGTCTG CCGAGTGAGA 3300 

AGGCCACAGT ATCT CATGCT GTTTGCATTA CAGAACTGCA GCTTTTCTAC TCTGAAAAGG 3360 

CCTGGGAGCA GAATGGCTGG CCTGCTGTGA GCAGGAGAGG AGATTCTAAG AAGGATAGTC 3420 

CCCCCTACAA CATACTGTCA TACTGCTGGG TTTTCATGGG TAGGAAAGCT TGTCCTGACC 3480 

CCAGCAGCAA AGAGGTGGCA GGTCGCTAAT GAATATATGC TTTATAATGT CCTTCTTCAT 3540 

TGCTGAGAGG GCAGCCTTAG AGCTGTGGAT TTCTGCATCC CCCCTGAGTC TGACCCATGG 3600 

ACACCTGTTT CATTCACTTT AGCATCACAG TGACCTTTGT ATGCTCTGTT CAGTCTGTGT 3 660 

CAGGCAGTAT GCTTGTCCTG AAGAGAGGTT TGGCTATCCC CACCCCACCC CACCCCACCC 3720 

TGTTCCTTTT TTATCAGGAG GACTTCAGAG CCAGGCCTGC AGCATTTTGT TTGAAAACAC 3780 

AATCAGCTCT GACAGTTAGA CATGCACACA GACGCCATAG CTGGATTGGA AACATTGATG 3840 

TTTTAAAAAT TTATTTTTTT TGGAAATAGT TGCACAAATG CTGCAATTTA GCTTTAAGGT 3900 

T CTAT AG ATT TTTAACTAGT CCAACACAGT CAGAAACATT GTTTTGAATC CTCTGTAAAC 3960 

CAAGGCATTA AT CT TAATAA ACCAGGATCC ATTTAGGTAC CACTTGATAT AAAAAGGATA 4020 

TCCATAATGA ATATTTTATA CTGCATCCTT TACATTAGCC ACTAAATACG TTATTGCTTG 4080 

ATGAAGACCT TT CACAGAAT CCTATGGATT GCAGCATTTC ACTTGGCTAC TTCATACCCA 4140 



193 



WO 02/086443 

TGCCTTAAAG AGGGGCAGTT TCTCAAAAGC AG AAACATG C CGCCAGTTCT CAAGTTTTCC 4200 

TCCTAACTCC ATTTGAATGT AAGGGCAGCT GGCCCCCAAT GTGGGGAGGT CCGAACATTT 4260 

TCTGAATTCC CATTTTCTTG TTCGCGGCTA AATGACAGTT TCTGTCATTA CTTAGATTCC 432 0 

GATCTTTCCC AAAGGTGTTG ATTTACAAAG AGGCCAGCTA AT AG CAG AAA TCATGACCCT 4380 

GAAAGAGAGA TGAAATTCAA GCTGTGAGCC AGGCAGGAGC TCAGTATGGC AAAGGTTCTT 4440 

GAGAATCAGC CATTTGGTAC AAAAAAGATT TTTAAAGCTT TTATGTTATA CCATGGAGCC 4500 

ATAGAAAGGC TATGGATTGT TTAAGAACTA TTTTAAAGTG TTCCAGACCC AAAAAGGAAA 4560 

AATAAAAAAA AAGGAATATT TGTACCCAAC AGCTAGAAGG ATTGCAAGGT AGATTTTTGT 4620 

TTTAAAATGG AGAGAAGTGG ACAGATAAGG CCATTTAATA TATCAAAGAT CAGTTGACAT 4680 
CTCCTAGGGA ATGATGAAAA CAGCAGGCTA T 

Seq ID NO: 18 Protein sequence: 
Protein Accession #: CAA53571 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE IINEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI . 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 3 00 

WCIPFTVKGN PKPALQWFYN GAI LNESKY I CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNEIPST 420 
DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGFVLF HKIPLDG 

Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession #: NM__000228 

Coding sequence: 82-3600 

1 11 21 31 41 51 

l l l ] I I 

GCTTTCAGGC GATCTGGAGA AAGAACGGCA GAACACACAG CAAGGAAAGG TCCTTTCTGG 60 

GGATCACCCC ATTGGCTGAA GATGAGACCA TTCTTCCTCT TGTGTTTTGC CCTGCCTGGC 120 

CTCCTGCATG CCCAACAAGC CTGCTCCCGT GGGGCCTGCT ATCCACCTGT TGGGGACCTG 180 

CTTGTTGGGA GGACCCGGTT TCTCCGAGCT TCATCTACCT GTGGACTGAC CAAGCCTGAG 240 

ACCTACTGCA CCCAGTATGG CGAGTGGCAG ATGAAATGCT GCAAGTGTGA CTCCAGGCAG 300 

CCTCACAACT ACTACAGTCA CCGAGTAGAG AATGTGGCTT CATCCTCCGG CCCCATGCGC 360 

TGGTGGCAGT CCCAGAATGA TGTGAACCCT GTCTCTCTGC AGCTGGACCT GGACAGGAGA 420 

TTCCAGCTTC AAGAAGTCAT GATGGAGTTC CAGGGGCCCA TGCCCGCCGG CATGCTGATT 480 

GAGCGCTCCT CAGACTTCGG TAAGACCTGG CGAGTGTACC AGTACCTGGC TGCCGACTGC 540 

ACCTCCACCT TCCCTCGGGT CCGCCAGGGT CGGCCTCAGA GCTGGCAGGA TGTTCGGTGC 600 

CAGTCCCTGC CTCAGAGGCC TAATGCACGC CTAAATGGGG GGAAGGTCCA ACTTAACCTT 660 

ATGGATTTAG TGTCTGGGAT TCCAGCAACT CAAAGTCAAA AAATT CAAGA GGTGGGGGAG 720 

ATCACAAACT TGAGAGT CAA TTTCACCAGG CTGGCCCCTG TGCCCCAAAG GGGCTACCAC 780 

CCTCCCAGCG CCTACTATGC TGTGTCCCAG CTCCGTCTGC AGGGGAGCTG CTTCTGTCAC 840 

GGCCATGCTG ATCGCTGCGC ACCCAAGCCT GGGGCCTCTG CAGGCCCCTC CACCGCTGTG 900 

CAGGTCCACG ATGTCTGTGT CTGCCAGCAC AACACTGCCG GCCCAAATTG TGAGCGCTGT 960 

GCACCCTTCT ACAACAACCG GCCCTGGAGA CCGGCGGAGG GCCAGGACGC CCATGAATGC 102 0 

C AAAGGTG CG ACTGCAATGG GCACT CAGAG ACATGTCACT TTGACCCCGC TGTGTTTGCC 1080 

GCCAGCCAGG GGGCATATGG AGGTGTGTGT GACAATTGCC GGGACCACAC CGAAGGCAAG 1140 

AACTGTGAGC GGTGTCAGCT GCACTATTTC CGGAACCGGC GCCCGGGAGC TTCCATTCAG 1200 

GAGACCTGCA TCTCCTGCGA GTGTGATCCG GATGGGGCAG TGCCAGGGGC TCCCTGTGAC 1260 

CCAGTGACCG GGCAGTGTGT GTGCAAGGAG CATGTGCAGG GAGAGCGCTG TGACCTATGC 1320 

AAGCCGGGCT TCACTGGACT CACCTACGCC AACCCGCAGG GCTGCCACCG CTGTGACTGC 1380 

AACATCCTGG GGTCCCGGAG GGACATGCCG TGTGACGAGG AGAGTGGGCG CTGCCTTTGT 1440 

CTGCCCAACG TGGTGGGTCC CAAATGTGAC CAGTGTGCTC CCTACCACTG GAAGCTGGCC 1500 

AGTGGCCAGG GCTGTGAACC GTGTGCCTGC GACCCGCACA ACTCCCCTCA GCCCACAGTG 1560 

CAACCAGTTC ACAGGGCAGT GCCCTGTCGG GAAGGCTTTG GTGGCCTGAT GTGCAGCGCT 1620 

GCAGCCATCC GCCAGTGTCC AGACCGGACC TATGGAGACG TGGCCACAGG ATGCCGAGCC 1680 

TGTGACTGTG ATTTCCGGGG AACAGAGGGC CCGGGCTGCG ACAAGGC AT C AGGCCGCTGC 1740 

CTCTGCCGCC CTGGCTTGAC CGGGCCCCGC TGTGACCAGT GCCAGCGAGG CTACTGCAAT 1800 

CGCTACCCGG TGTGCGTGGC CTGCCACCCT TGCTTCCAGA CCTATGATGC GGACCTCCGG 1860 

GAGCAGGCCC TGCGCTTTGG TAGACTCCGC AATGCCACCG CCAGCCTGTG GTCAGGGCCT 192 0 

GGGCTGGAGG ACCGTGGCCT GGCCTCCCGG AT CCTAGATG CAAAGAGTAA GATTGAGCAG 1980 

ATCCGAGCAG TTCTCAGCAG CCCCGCAGTC ACAGAGCAGG AGGTGGCTCA GGTGGCCAGT 2 040 

GCCATCCTCT CCCTCAGGCG AACTCTCCAG GGCCTGCAGC TGGATCTGCC CCTGGAGGAG 210 0 

GAGACGTTGT CCCTTCCGAG AGACCTGGAG AGTCTTGACA GAAGCTTCAA TGGTCTCCTT 2160 

ACTATGTATC AGAGGAAGAG GGAGCAGTTT GAAAAAATAA GCAGTGCTGA TCCTTCAGGA 222 0 

GCCTT CCGGA TGCTGAGCAC AGCCTACGAG CAGTCAGCCC AGGCTGCTCA GCAGGTCTCC 2280 

GACAGCTCGC GCCTTTTGGA CCAGCTCAGG GACAGCCGGA GAGAGGCAGA GAGGCTGGTG 2340 

CGGCAGGCGG GAGGAGGAGG AGGCACCGGC AGCCCCAAGC TTGTGGCCCT GAGGCTGGAG 2400 

ATGTCTTCGT TGCCTGACCT GACACCCACC TTCAACAAGC TCTGTGGCAA CTCCAGGCAG 2460 

ATGGCTTGCA CCCCAATATC ATGCCCTGGT GAGCTATGTC CCCAAGACAA TGGCACAGCC 2520 

TGTGGCTCCC GCTGCAGGGG TGTCCTTCCC AGGGCCGGTG GGGCCTTCTT GATGGCGGGG 2580 

CAGGTGGCTG AGCAGCTGCG GGGCTTCAAT GCCCAGCTCC AGCGGACCAG GCAGATGATT 2640 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAATCCAG 2760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGTCAGC 2820 

• GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2 940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3 060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 312 0 

AGGGTTGCTG AGGTTCAGCA GGTACTGCGG C CAG CAG AAA AGCTGGTGAC AAGCATGACC 3180 

AAGCAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGGCAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 3300 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3360 
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CAGAGTTCCA TGCTGGGTGA GCAGGGTGCC 
GAGCTGTTTG GGGAGACCAT GGAGATGATG 
CTGCGGGGCA GCCAGGCCAT CATGCTGCGC 
GTGGAGCAGA TCCGTGACCA CATCAATGGG 
TGCTACAGCT TCCAGCCCGT TGCCCCACTC 
GATTGGGTTG GAATGCTTTC CATCTCCAGG 
GACCACCCCT GGTGTGTAGC TAGTAAGATT 
GGGACAGTTA CACTTGACAG ACAAAGATGG 
CTCTCAAGTC AAGGAAGCTG GGCTGGGCAG 
GGAAT CCTGG ACCAAGCACA AAAACTTAAC 
AAAATCTTTG G 

Seq ID NO: 20 Protein sequence: 
Protein Accession #: NP 000219 



PCT/US02/12476 



CGGATCCAGA 
GACAGGATGA 
TCGGCGGACC 
CGCGTGCTCT 
ATCTGCCGCC 
AGACTTTCAT 
ACCCTGAGCT 
TGGAGATTGG 
TATCCCCCGC 
AAAAGTGATG 



GTGTGAAGAC 
AAGACATGGA 
TGACAGGACT 
ACTATGCCAC 
TTTGCTTTTG 
GCAGCCTAAA 
GCAGCTGAGC 
CATGCCATTG 
CTTTAGTTCT 
TAAAAATGAA 



11 



21 



31 



41 



AGAGGCAGAG 
GTTGGAGCTG 
GGAGAAGCGT 
CTGCAAGTGA 
GTTGGGGGCA 
GTACAGCCTG 
CTGAGCCAAT 
AAAC TAAGAG 
CCACTGGGGA 
AAGCCAAATA 



51 



MRPFFLLCFA 
EWQMKCCKCD 
MEFQGPMPAG 
NARLNGGKVQ 
VSQLRLQGSC 
PWRPAEGQDA 
HYFRNRRPGA 
TYANPQGCHR 
CACDPHNSPQ 
TEGPGCDKAS 
RLRNATASLW 
TLQGLQLDLP 
AYEQSAQAAQ 
TPTFNKLCGN 
GFNAQLQRTR 
DPDTDAATIQ 
RLQAEAEEAR 
VLRPAEKLVT 
IKQKYAELKD 
MLRSADLTGL 



LPGLLHAQQA 
SRQPHNYYSH 
MLIERSSDFG 
LNLMDLVSGI 
FCHGHADRCA 
HE CQRCDCNG 
SIQETCISCE 
CDCNILGSRR 
PTVQPVHRAV 
GRCLCRPGLT 
SGPGLEDRGL 
LEEETLSLPR 
QVSDSSRLLD 
SRQMACTPIS 
QMIRAAEESA 
EVSEAVLALW 
SRAHAVEGQV 
SMTKQLGDFW 
RLGQSSMLGE 
EKRVEQIRDH 



CSRGACYPPV 
RVENVASSSG 
KTWRVYQYLA 
PATQSQKIQE 
PKPGASAGPS 
HSETCHFD PA 
CDPDGAVPGA 
DMPCDEESGR 
PCREGFGGLM 
GPRCDQCQRG 
ASRILDAKSK 
DLESLDRSFN 
QLRDSRREAE 
CPGELCPQDN 
SQIQSSAQRL 
LPTDSATVLQ 
EDWGNLRQG 
TRMEELRHQA 
QGARIQSVKT 
I NGRVLY YAT 



GDLLVGRTRF 
PMRWWQSQND 
ADCTSTFPRV 
VGE I TNLRVN 
TAVQVHDVCV 
VFAASQGAYG 
PCDPVTGQCV 
CLCLPNWGP 
CSAAAIRQCP 
YCNRYPVCVA 
IEQIRAVLSS 
GIiLTMYQRKR 
RLVRQAGGGG 
GTACGSRCRG 
ETQVSASRSQ 
KMNEIQAIAA 
TVALQEAQDT 
RQQGAEAVQA 
EAEELFGETM 
CK 



LRASSTCGLT 
VNPVSLQLDL 
RQGRPQSWQD 
FTRLAPVPQR 
CQHNTAGPNC 
GVCDNCRDHT 
CKEHVQGERC 
KCDQ CAP YHW 
DRTYGDVATG 
CHPCFQTYDA 
PAVTEQEVAQ 
EQFEKISSAD 
GTGSPKLVAL 
VLPRAGGAFL 
MEEDVRRTRL 
RLPNVDLVLS 
MQGTSRSLRL 
QQLAEGASEQ 
EMMDRMKDME 



KPETYCTQYG 
DRRFQLQEVM 
VRCQSLPQRP 
GYHPPSAYYA 
ERCAPFYNNR 
EGKNCERCQL 
DLCKPGFTGIi 
KLASGQGCEP 
CRACD CDFRG 
DLREQALRFG 
VASAILSLRR 
PSGAFRMLST 
RLEMSSLPDL 
MAGQVAEQLR 
LIQQVRDFLT 
QTKQDIARAR 
IQDRVAEVQQ 
ALSAQEGFER 
LELLRGSQAI 



Seq ID NO: 21 DNA sequence 

Nucleic Acid Accession #: NM_003722 

Coding sequence: 145-1491 



TCGTTGATAT 
ACAGTACTGC 
AAAGAAAGTT 
CCAGAGGT TT 
ATTGACTTGA 
AGCATGGACT 
ACGAACCTGG 
AGTCCCTATA 
CCCAGCTCCA 
CCAGGCCCGC 
TGGACGTATT 
CAGATCAAGG 
AAAAAAGCTG 
GAATTCAA CG 
CATGCCCAGT 
CCACCCCAGG 
TGTGTTGGAG 
GGGCAAGTCC 
AGGAAGGCGG 
GATGGTACGA 
AAACGAAGAT 
GAAATGCTGT 
ATTGAAACGT 
CTTTCAGCCT 
GACGTCTTCT 
TCTATATTTT 
TGTGTGTGCG 
CCCAACTGCT 
TTACAAGAAA 
GAACCACTGT 
GAAAGGGGCA 
AATTCACAGG 
AAAAAAGTTG 
CCCTTTTAAT 
TACTGCTGGG 
TTTGTGAGAA 
GCTGTGTACC 
CATGAAACCC 
CTCATTTTGT 
TGTTTACCAT 
AATTTGCTTA 
CTGATACTGT 
AGACGTGTTA 



11 

I 

CAAAGACAGT 
CCTGACCCTT 
AT T ACCG AT C 
TCCAGCATAT 
ACTTTGTGGA 
GTATCCGCAT 
GGCTCCTGAA 
ACACAGACCA 
CCTTCGATGC 
ACAGTTT CG A 
CCACTGAACT 
TGATGACCCC 
AGCACGTCAC 
AGGGACAGAT 
ATGTAGAAGA 
TTGGCACTGA 
GGATGAACCG 
TGGGCCGACG 
ATGAAGATAG 
AGCGCCCGTT 
CCCCAGATGA 
TGAAGAT C AA 
ACAGGCAACA 
GCTTCAGGAA 
TTAGACATTC 
AAGTGTGTGT 
TGTGTAT CTA 
CAAAGGCACA 
GGATGTTTTC 
GTTTGTCTGT 
TTAAGATGTT 
GAAGCTTTTG 
TTATTGTCTG 
GCTGGTCATG 
CAGCGAGGTG 
CTTGCATTAT 
TGCCTCTGCC 
TGGAAGACCT 
GCTTTTAATA 
TATTCAAAGC 
ATTAGAGCTT 
TCAGTGCATT 
AAATCAGCAC 



21 

I 

TGAAGGAAAT 
ACAT CCAGCG 
CACCATGTCC 
CTGGGATTTT 
TGAACCATCA 
GCAGGACTCG 
CAGCATGGAC 
CGCGCAGAAC 
TCTCTCTCCA 
CGTGTCCTTC 
GAAGAAACTC 
ACCTCCTCAG 
GGAGGTGGTG 
TGCCCCTCCT 
TCCCATCACA 
ATT CACGACA 
CCGTCCAATT 
CTGCTTTGAG 
CATCAGAAAG 
TCGT CAGAAC 
TGAACTGTTA 
AGAGTCCCTG 
GCAACAGCAG 
TGAGCTTGTG 
CAAGCCCCCA 
GTTGTATTTC 
GCCCTCATAA 
AAGCCACTAG 
TGCAGATTTT 
GAGCTTTCTG 
TATTGGAACC 
AGCAGGTCTC 
TG CATAAGT A 
TAATAATATT 
ATCATTACCA 
TTGTGTCCTC 
ACTGTATGTT 
ACTACAAAAA 
GAAAGACAAA 
TCAAAATAGA 
CTATCCCTCA 
TAGCCAGGAG 
TCCTGGACTG 



31 
I 

GAATTTTGAA 
TTTCGTAGAA 
CAGAGCACAC 
CTGGAACAGC 
GAAGATGGTG 
GACCTGAGTG 
CAGCAGATTC 
AGCGTCACGG 
TCACCCGCCA 
CAGCAGTCGA 
TACTGCCAAA 
GGAGCTGTTA 
AAGCGGTGCC 
AGTCATTTGA 
GGAAGACAGA 
GTCTTGTACA 
TTAATCATTG 
GCCCGGATCT 
CAGCAAGTTT 
ACACATGGTA 
TACTTACCAG 
GAACTCATGC 
CAGCACCAGC 
GAGCCCCGGA 
AACCGATCAG 
CATGTGTATA 
ACAGGACTTG 
TGAGAGAATC 
GTATCCTTAG 
TTGTTTCCTG 
CTTTTCTGTC 
AAACTTAAGA 
AGTTGTAGGT 
GCAAGTAGTA 
AAAGTAAT C A 
CCCTCATGTG 
GGCATCTGTT 
AACTGTTGTT 
TCCACCCCAG 
ATTTGAAGCC 
AGCCTACCTA 
ACTTACGTTT 
GAAATTAAAG 



41 
1 

ACTTCACGGT 
ACCCAGCTCA 
AGACAAATGA 
CTATATGTTC 
CGACAAACAA 
ACCCCATGTG 
AGAACGGCTC 
CGCCCTCGCC 
TCCCCTCCAA 
GCACCGCCAA 
TTGCAAAGAC 
TCCGCGCCAT 
CCAACCATGA 
TTCGAGTAGA 
GTGTGCTGGT 
ATTTCATGTG 
TTACTCTGGA 
GTGCTTGCCC 
CGGACAGTAC 
TCCAGATGAC 
TGAGGGGCCG 
AGTACCTTCC 
ACTTACTTCA 
GAGAAACTCC 
TGTACCCATA 
TGTGAGTGTG 
AAGACACTTT 
TTTTGAAGGG 
ACCGGCCATT 
GGAGGGAGGG 
TTCTTCTGTT 
TGTCTT T TT A 
GACTGAGAGA 
AGAAACGAAG 
ACTTTGTGGG 
TAGGTAGAAC 
ATGCTAAAGT 
TGGCCCCCAT 
TAATATTGCC 
CTCTCACAAA 
CCATAAAACC 
TGAGTAAGTG 
ATTGAAAGGG 



51 
I 

GTGCCACCCT 
TTTCTCTTGG 
ATTCCTCAGT 
AGTTCAGCCC 
GATTGAGATT 
GCCACAGTAC 
CTCGTCCACC 
CTACGCACAG 
CACCGACTAC 
GTCGGCCACC 
ATGCCCCATC 
GCCTGTCTAC 
GCTGAGCCGT 
GGGGAACAGC 
ACCTTATGAG 
TAACAGCAGT 
AACCAGAGAT 
AGGAAGAGAC 
AAAGAACGGT 
AT C CAT CAAG 
TGAGACTTAT 
TCAGCACACA 
GAAACATCTC 
AAAACAATCT 
GAGCCCTATC 
TGTGTGTGTA 
GGCT CAGAGA 
ACTCAAACCT 
GGTGGGTGAG 
GTCAGGTGGG 
GTTTTTCTAA 
AGAAAAGGAG 
CTCAGTCAGA 
GTGTCAAGTG 
TGGAGAGTTC 
ATTTCTTAAT 
TTTTCTTGTA 
AG CAGGTG AA 
CTTACGTAGT 
ATCTGTGATT 
AGCCATATTA 
AGATCCAAGC 
TAGACTACTT 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
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TTCTTTTTTT TACTCAAAAG TTTAGAGAAT CTCTGTTTCT TTCCATTTTA AAAACATATT 2640 

TTAAGATAAT AGCATAAAGA CTTTAAAAAT GTTCCTCCCC TCCATCTTCC CACACCCAGT 2700 

CACCAGCACT GTATTTTCTG TCACCAAGAC AATGATTTCT TGTTATTGAG GCTGTTGCTT 2760 

TTGTGGATGT GTGATT TTAA TTTTCAATAA ACTTTTGCAT CTTGGTTTAA AAGAAA 

Seq ID NO: 22 Protein sequence: 
Protein Accession #: NP_003713 

1 11 21 31 41 51 

I i I I I I 

MSQSTQTNEF LSPEVFQHIW DPLEQPICSV QPIDLNFVDE PSEDGATNKI EISMDCIRMQ 60 

DSDLSDPMWP QYTNLGLLNS MDQQIQNGSS STSPYNTDHA QNSVTAPSPY AQPSSTFDAL 120 

SPSPAIPSNT DYPGPHSFDV SFQQSSTAKS ATWTYSTELK KLYCQIAKTC PIQIKVMTPP 180 

PQGAVIRAMP VYKKAEHVTE WKRCPNHEL SREFNEGQIA PPSHLIRVEG NSHAQYVEDP 240 

ITGRQSVLVP YEPPQVGTEF TTVLYNFMCN SSCVGGMNRR PILIIVTLET RDGQVLGRRC 300 

FEARICACPG RDRKADEDS I RKQQVSDSTK NGDGTKRPFR QNTHGIQMTS I KKRRS PDDE 3 60 

LLYLPVRGRE TYEMLLKIKE SLELMQYLPQ HTIETYRQQQ QQQHQHLLQK HLLSACFRNE 420 
LVEPRRETPK QSDVFFRHSK PPNRSVYP 



Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession ft: NM_001944.1 

Coding sequence: 84-3083 

1 11 21 31 41 51 

I i I I I I 

TTTTCTTAGA CATTAACTGC AGACGGCTGG CAGGATAGAA GCAGCGGCTC ACTTGGACTT 60 
TTTCACCAGG G AAAT C AG AG ACAATGATGG GGCTCTTCCC CAGAACTACA GGGGCTCTGG 120 
CCATCTTCGT GGTGGT CAT A TTGGTTCATG GAGAATTGCG AATAGAGACT AAAGGT CAAT 180 
ATGATGAAGA AGAGATGACT ATGCAACAAG CTAAAAGAAG GCAAAAACGT GAATGGGTGA 240 
AATTTGCCAA ACCCTGCAGA GAAGGAGAAG AT AAC TCAAA AAGAAACCCA ATTGCCAAGA 300 
TTACTTCAGA TTACCAAGCA ACCCAGAAAA TCACCTACCG AATCTCTGGA GTGGGAATCG 360 
ATCAGCCGCC TTTTGGAATC TTTGTTGTTG ACAAAAACAC TGGAGATATT AACATAACAG 42 0 
CTATAGTCGA CCGGGAGGAA ACTCCAAGCT TCCTGATCAC ATGTCGGGCT CTAAATGCCC 480 
AAGGACTAGA TGTAGAGAAA CCACTTATAC TAACGGTTAA AATTTTGGAT ATTAATGATA 540 
ATCCTCCAGT ATTTT CACAA CAAATTTTCA TGGGTGAAAT TGAAGAAAAT AGTGCCTCAA 600 
ACTCACTGGT GATGATACTA AATGCCACAG ATGCAGATGA ACCAAACCAC TTGAATTCTA 660 
AAATTGCCTT CAAAATTGTC T CT CAGGAAC CAGCAGGCAC ACCCATGTTC CTCCTAAGCA 72 0 
GAAACACTGG GGAAGTCCGT ACTTTGACCA ATTCTCTTGA CCGAGAGCAA GCTAGCAGCT 780 
ATCGTCTGGT TGTGAGTGGT GCAGACAAAG ATGGAGAAGG ACTATCAACT CAATGTGAAT 840 
GTAATATTAA AGTGAAAGAT GTCAACGATA ACTTCCCAAT GTTTAGAGAC TCTCAGTATT 900 
CAGCACGTAT TGAAGAAAAT ATTTTAAGTT CTGAATTACT TCGATTTCAA GTAACAGATT 960 

TGGATGAAGA GTACACAGAT AATTGGCTTG CAGTATATT T CTTTACCTCT GGGAATGAAG 102 0 

GAAATTGGTT TGAAATACAA ACTGATCCTA GAACTAATGA AGGCATCCTG AAAGTGGTGA 10 80 

AGGCTCTAGA TTATGAACAA CTACAAAGCG TGAAACTTAG TATTGCTGTC AAAAACAAAG 1140 

CTGAATTTCA CCAATCAGTT AT CTCT CGAT ACCGAGTTCA GTCAACCCCA GTCACAATTC 12 00 

AGGTAATAAA TGTAAGAGAA GGAATTGCAT TCCGTCCTGC TTCCAAGACA TTTACTGTGC 1260 

AAAAAGG CAT AAGTAGCAAA AAATTGGTGG ATTATATCCT GGGAACATAT CAAGCCATCG 132 0 

ATGAGGACAC TAACAAAGCT GCCTCAAATG TCAAATATGT CATGGGACGT AACGATGGTG 13 80 

GATACCTAAT GATTGATT C A AAAACTGCTG AAATCAAATT TGTCAAAAAT ATGAACCGAG 144 0 

ATTCTACTTT CATAGTTAAC AAAACAATCA CAGCTGAGGT TCTGGCCATA GATGAATACA 150 0 

CGGGTAAAAC TTCTACAGGC ACGGTATATG TTAGAGTACC CGATTTCAAT GACAATTGTC 1560 

CAACAGCTGT CCTCGAAAAA GATGCAGTTT GCAGTTCTTC ACCTTCCGTG GTTGTCTCCG 162 0 

CTAGAACACT GAATAATAGA TACACTGGCC CCTATACATT TGCACTGGAA GATCAACCTG 1680 

TAAAGTTGCC TGCCGTATGG AGT AT CACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA G AC AGT C AG A 180 0 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 18 60 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2 040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG G AAAT CACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTG AT CT A TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2 640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2 700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GT CCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGT CAAGG AGCTTCTGCT TTGTCCGCCT 282 0 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2 880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2 940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3 000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3 060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 318 0 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 24 Protein sequence: 
Protein Accession #: NP_0 01935.1 

1 11 21 31 41 51 



196 



WO 02/086443 



PCT/US02/12476 



10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



MMGLFPRTTG 
GEDNSKRNPI 
PSFLITCRAL 
ATDADEPNHL 
DKDGEGLSTQ 
WLAVYFFTSG 
SRYRVQSTPV 
SNVKYVMGRN 
VYVRVPD FND 
ITTLNATSAL 
TTSPGTRYGR 
GSEGTIHQWG 
MTTKLGAATE 
DGAISMNFLD 
LDDSFLDSLG 
LSGSQGASAL 
VTERVICPIS 



ALAIFWVIL 
AKITSDYQAT 
NAQGLDVEKP 
NSKIAFKIVS 
CECNIKVKDV 
NEGNWFEIQT 
TIQVINVREG 
DGGYLMIDSK 
NCPTAVLEKD 
LRAQEQIPPG 
PHSGRLGPAA 
IEGAHPEDKE 
SGGAAGFATG 
SYFSQKAFAC 
PKFKKLAEIS 
SASGSVQPAV 
SVPGNLAGPT 



VHGELRIETK 
QKITYRISGV 
LILTVKILDI 
QEPAGTPMFL 
NDNFPMFRDS 
DPRTNEGILK 
IAFRPASKTF 
TAEIKFVKNM 
AVCSSSPSW 
VYHISLVLTD 
IGLIiLLGLIiL 
ITNICVPPVT 
TVSGAASGFG 
AEEDD GQEAN 
LGVDGEGKEV 
SIPDPLQHGN 
QLRGSHTMLC 



I 

GQYDEEEMTM 
GIDQPPFGIF 
NDNPPVFSQQ 
LSRNTGEVRT 
QYSARIEENI 
WKALDYEQL 
TVQKGISSKK 
NRDSTF I VNK 
VSARTLNNRY 
SQNNRCEMPR 
LLLAPLLLLT 
ANGADFMESS 
AATGVGICSS 
DCLLIYDNEG 
QPPSKDSGYG 
YLVTETYSAS 
TEDPCSRLI 



QQAKRRQKRE 
WDKNTGDIN 
I FMGE I EENS 
LTNSLDREQA 
LSSEUjRFQV 
QSVKLSIAVK 
LVDYILGTYQ 
TITAEVLAID 
TGPYTFALED 
SLTLEVCQCD 
CDCGAGSTGG 
EVCTNTYARG 
GQSGTMRTRH 
ADATGSPVGS 
IESCGHPIEV 
GSLVQPSTAG 



WVKFAKPCRE 
ITAIVDREET 
ASNSLVMILN 
SSYRLWSGA 
TDLDEEYTDN 
NKAEFHQSVI 
AIDEDTNKAA 
EYTGKTSTGT 
QPVKLPAVWS 
NRGICGTSYP 
VTGGFIPVPD 
TAVEGTSGME 
STGGTNKDYA 
VGCCSFIADD 
QQTGFVKCQT 
FDPLLTQNVI 



Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 56-1642 



AGTATCCCAG 
GCAAGGGATC 
CATGTTTGAG 
CTGCTCTGTC 
GGAGAAGGTG 
GGAAGATCAG 
GGACTCTTTT 
CTTTTCCCAG 
GGAGATGGTA 
TAACTCAGGG 
GTCCCTGGCG 
GCCCTTGCTC 
GAAGAAGCTG 
GAGGAGTGTC 
TGGGCTCTCT 
ATGGGCACAG 
GATCT CATTC 
ACAGCGCAAG 
AGATCTCAAC 
TCGTAAGAAC 
CATCTTCTCA 
CGAGCTGTCA 
ACGGTTGAAG 
TGCCCTTCGT 
CAAGTTGACT 
CAATGTGAAT 
CATTGCTAGC 
ACTCGTTCAT 
CAGACACAGG 
AGGAGCTCCT 
AGCTACAGCT 
AACAGCGGGA 
TGTATGAAGA 
TTCAGGAGCG 
AGTCAGTGGC 
CAGCTTCTGC 
CAGAGCTAAA 
CCTCAGCCAA 
TAAGGCTGTT 
CTTGTTGCCA 
TCTTAATCAA 
TGGACCTTCG 
AAGGCCAGGT 
AACAACCACC 
AAAGCTCAAC 
TCAAATCTGG 
GGCCCTGAGG 
TAT CAGGAAT 
TATAACCACC 
GCACACAAAA 
GTAGCAAAAT 



11 
I 

GAGGAGCAAG 
CTTTCTCCGC 
TCCACAGCTG 
GTGTCTACCT 
AAAGTATACT 
GGTTGTGTCC 
GCCCTGAAGA 
ATCTTTGGGC 
AAGGATGTAC 
AAAACCCACA 
CTGATCTTCA 
TCCAATGAGG 
TCCCTGCTAA 
TACAT CGAAA 
TCTATCAGTC 
CCAGACACTG 
TTTGAGATCT 
AGGCAGACTT 
TGGATT CATG 
CAGAGCTTTG 
AT CAGGATCC 
CTCTGTGATC 
GAAGCAGGAA 
CAAAACCAGC 
CGAGTGTTCC 
CCCTGTGCAT 
CAGGTGACTT 
CAAGGAACAT 
CCTTGATGAT 
ACAAGTTGTG 
GGAGATGCAT 
ACAGTGGTGC 
AAAACTAAAT 
GGATGAAAAG 
CCATCAGCAA 
CTCCACCCAG 
CTCTACCACT 
GCCCTTCACC 
GCGGACAGAG 
CAGCACTGGG 
ACAGGACCAG 
GAAGAAGGCA 
TTCTGCCAAA 
AGGGAAGAAA 
AGACTGCAGC 
GCCTTTTGGC 
TGGGTCAGCT 
TATATCCAGG 
TATGTAATCT 
ACAGTTATAT 
CAT T AAAACA 



21 

I 

TGGCACGTCT 
CAGCGGGCTT 
CAGATTTGGG 
CCCTAGAGGA 
TGAGGGTTAG 
GTATTGAGAA 
GCAATGAACG 
CAGAAGTGGG 
TCAAAGGGCA 
CGATTCAAGG 
ATAGCCTCCA 
TAATCTGGCT 
ATGGAGGCCT 
GTCGGATAGG 
AGTGTACCAG 
CCCCACTACC 
ACAACGAACT 
TGCGGCTATG 
TGCAAGATGC 
CCAGCACCCA 
TACACCTTCA 
TGGCTGGCTC 
ACATTAACAC 
AGAACCGGTC 
AAGGTTTCTT 
CTACCTATGA 
GTGCATGCCC 
AGTCTTCAGG 
GATATTGAAA 
GAAGCCATGA 
CTCCGAGATG 
AGTGAACATT 
ATCCTCAAGG 
ATTGAAGAGC 
TCAGGGTCTG 
CAGCTTCAGG 
GAAGAGTTGC 
ATTGATGTGG 
CTTCAGAAAC 
GCAGGAAAAC 
ACTCTGGCTG 
GCATGTATTG 
AAGCGCCTTG 
CCATTCCTTC 
CCTTATGCCC 
AAAAAGTACT 
ACTCTCCTGA 
ATGCAATACT 
CATGTTGTTG 
TAAAGATATT 
AATTATAAAA 



31 
I 

TCGGACCTAG 
GCTGTCCGAT 
GTCTGTGGTA 
CAAG CAGCAG 
GCCCTTGTTA 
TGTGGAGACC 
GGGAATTGGC 
ACAGGCATCC 
GAACTGGCTC 
TACCATCAAG 
AGGCCAACTT 
AGACAGCAAG 
CCAAGAGGAG 
TACCAGCACC 
CAGTAGCCAG 
TGTCCCGGCA 
GCTTTATGAC 
CGAGGATCAA 
TGAGGAGGCC 
CCTCAACCAG 
GGGGGAAGGA 
AGAGCGCTGC 
CTCTCTACAC 
AAAGCAGAAC 
CACAGGCCGA 
TGAAACTCTT 
CACCTATGCA 
TATCCCCCAG 
ATGAAGCTGA 
AGACACTGCT 
AAATTTGCAA 
TGGACACCCA 
AGTCACTGAC 
TAGAAGCTCT 
AATTGGCCCT 
AGGTTAAAGC 
ATAAGTATCA 
ACAAGAAGTT 
TTGGTGAGTC 
TTCGTCAAGC 
AACTGCAGAA 
CTGAGCAGTA 
GTACCAACCA 
GAAATTTACT 
GG AT CCTACG 
AAGGCTGTGG 
AGAAATAGGT 
CAGACACTAG 
TTTTTTTTTA 
ATTGTTCACA 
GGGACAGAAA 



41 
I 

GCTGCCCCTG 
GACGATGTCG 
CGCAAGAACC 
GTTCCATCTG 
CCTTCAGAGT 
CTTGTTCTAC 
CAAGCCACAC 
TTCTTCAACC 
ATCTATACAT 
GATGGAGGGA 
CATCCAACAC 
CAGAT CCGAC 
GAGCTGTCCA 
AGCTTCGACA 
CTGGATGAAA 
AACATTCGCT 
CTATTAGAAC 
AATGGCAATC 
TGGAAGCTCC 
AACTCCAGCC 
GATATAGTCC 
AAAGATCAGA 
ACCCTGGGCC 
CTGGTTCCCT 
GGCCGTTCCT 
CATGTGGCCA 
ACTGGGATTC 
CTTAGAGAAA 
CATCTCCATG 
TTTGAAGGAA 
TGAGATGGTA 
AAAGGAACTA 
AAGTTTTTAC 
CTTGCAGGAA 
ACGGCGGTCA 
TAAATTACAG 
GAAAATGTTA 
AGAAGAGGGC 
TCTCCAATCA 
CTTGACCACT 
CAACATGGTG 
TCATACTGTG 
GGAAAATCAG 
TCCCCGAACA 
CTCACGGCGT 
GGAAAGAGAA 
CTCTTTTATG 
CTTTTTTCTC 
TTTACTTATA 
TTTTTTATTG 
AA 



51 
1 

CCGTCATGTC 
TAGTTTCTCC 
TGCTATCAGA 
AGGACAGTAT 
TGGAACGACA 
AAGCACCCAA 
ACAGGTTCAC 
TAACTGTGAA 
ATGGAGT CAC 
TTCTCCCCCG 
CTGAT CTGAA 
AGGAGGAAAT 
CTTCCTTGAA 
GTGGCATTGC 
CAAGT CAT CG 
TCTCCATCTG 
CGCCTAGCCA 
CCTATGTGAA 
TAAAAGTGGG 
GCAGTCACAG 
CCAAGATCAG 
AGAGTGGTGA 
GCTGTATTGC 
TCCGTGACAG 
GCATGATTGT 
AGTTCTCAGC 
CCATCCCTGC 
GGGGCTAAGG 
TATGG CAAAG 
CGACAGGAAA 
GAACAGATGC 
TTGGAGGAAA 
CAAGAAGAGA 
GCCAGACAAC 
CAAAGGTTGG 
CAGTGCAAAG 
GAACCACCAC 
CAGAAGAATA 
G C AG AG AG AG 
TGTGATGACA 
CTAGTGAAAC 
TTGAAACTCC 
CAACCAAACC 
CCAACCTGCC 
TCCCCTTTAC 
GAGCAGTCAT 
CTTTACCATA 
ACTTTTGTAT 
TGATTTCTAT 
AATTCCAAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



80 



85 



Seq ID NO: 2 6 Protein sequence: 
Protein Accession #: Eos sequence 



MSQGILSPPA 
SMEKVKVYLR 
FTFSQIFGPE 
PRSIALIFNS 
LKRSVYIESR 



11 
I 

GLLSDDDWV 
VRPLLPSELE 
VGQASFFNLT 
LQGQLHPTPD 
IGTSTSFDSG 



21 
I 

SPMFESTAAD 
RQEDQGCVRI 
VKEMVKDVLK 
LKPIjIiSNEVI 
IAGLSSISQC 



31 
I 

LGSWRKNLL 
ENVETLVLQA 
GQNWLIYTYG 
WLDSKQIRQE 
TSSSQLDETS 



41 
I 

SDCSWSTSL 
PKDSFALKSN 
VTNSGKTHT I 
EMKKLSLLNG 
HRWAQPDTAP 



51 
I 

EDKQQVPSED 
ERGIGQATHR 
QGTIKDGGIL 
GLQEEELSTS 
LPVPANIRFS 



60 
120 
180 
240 
300 



197 



5 
10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

IWISFFEIYN ELLYDLLEPP SQQRKRQTLR LCEDQNGNPY VKDLNWIHVQ DAEEAWKLIiK 
VGRKNQSFAS THLNQNSSRS HSIFSIRILH LQGEGDIVPK I SELSLCDLA GSERCKDQKS 
GERLKEAGNI NTSLHTLGRC IAALRQNQQN RSKQNLVPFR DSKLTRVFQG FFTGRGRSCM 
IVNVNPCAST YDETLHVAKF SAIASQVTCA CPTYATGIPI PALVHQGT 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 13-1424 



PCT/US02/12476 



360 
420 
480 



TAGAAGTTTA 
CTTCCCCTGA 
TTAGAAAAAT 
GGAAACTTAA 
GGGCAACTGG 
GTCCATCATT 
AGAATCAATA 
GCTTTCCAAG 
GCTGACATTT 
AAAGGTGGAA 
TTCGATGAGG 
GTTCACGAGA 
TTCCCCACCT 
GGCATTCAGT 
TCAGAACCAG 
AAGATCTTTT 
AGTGTTAATT 
GAAATTGAAG 
AATTTAAGAC 
GTGAAAAAAA 
GATAACCAGT 
CTGATTACCA 
AACAAATACT 
CGTATCACCA 
TGGTTTTTGT 
GTGTACCACT 
TTATATAAAA 
CTCTACTATT 
CTCTGTAAGT 
TAAAATTAAG 



11 
I 

CAATGAAGTT 
ACAGCTCTAC 
TTTATGGCCT 
TGAAGGAAAA 
ACACATCTAC 
TCAGGGAAAT 
ATTACACACC 
TATGGAGTAA 
TGGTGGTTTT 
TCCTAGCCCA 
ACGAATTCTG 
TTGGCCATTC 
ACAAATATGT 
CCCTGTATGG 
CTCTCTGTGA 
TCTTCAAAGA 
TAATTTCTTC 
CCAGAAATCA 
CAGAGCCAAA 
TTGATGCAGC 
ATTGGAGGTA 
AGAACTTCCA 
ACTATTTCTT 
AAACACTGAA 
TAGTTCACTT 
ACTTAGAGAT 
TACATAATAT 
AAGTTTGAAA 
TGCTTCCTAA 
TATATATATT 



21 

! 

TCTTCTAATA 
AAGCCTGGAA 
TGAGATAAAC 
AAT CCAAGAA 
CCTGGAGATG 
GCCAGGGGGG 
TGACATGAAC 
TGTTACCCCC 
TGCCCGTGGA 
TGCTTTTGGA 
GACTACACAT 
CTTAGGTCTT 
TGACATCAAC 
AGACCCAAAA 
CCCCAATTTG 
CAGGTTCTTC 
CTTATGGCCA 
AGTTTTTCTT 
TT AT CCCAAG 
TGTTTTTAAC 
TGATGAAAGG 
AGGAATCGGG 
CCAAGGATCT 
AAGCAATAGC 
CAGCTTAATA 
ATGTATCATA 
TTTTCAATTT 
ATAGTTACCT 
CATCCTTGGA 
TTGGCTCAAA 



31 

1 

CTGCTCCTGC 
AAAAATAATG 
AAACTTCCAG 
ATGCAGCACT 
ATGCACGCAC 
CCCGTATGGA 
CGTGAGGATG 
TTGAAATTCA 
GOT CATGGAG 
CCTGGATCTG 
TCAGGAGGCA 
GGCCATTCTA 
ACATTTCGCC 
GAGAACCAAC 
AGTTTTGATG 
TGGCTGAAGG 
ACCTTGCCAT 
TTTAAAGATG 
AGCATACATT 
CCACGTTTTT 
AGACAGATGA 
CCTAAAATTG 
AACCAATTTG 
TGGTTTGGTT 
AGTATTTATT 
AAAATAAAAT 
TGAAAACTCT 
TCAAAGCAAG 
CTGAGAAATT 
TAAAATTG 



41 

1 

AGGCCACTGC 
TGCTATTTGG 
TGACAAAAAT 
TCTTGGGTCT 
CTCGATGTGG 
GGAAACATTA 
TTGACTACGC 
GCAAGATTAA 
ACTTCCATGC 
GCATTGGAGG 
CAAACTTGTT 
GTGATCCAAA 
TCTCTGCTGA 
GCTTGCCAAA 
CTGTCACTAC 
TTTCTGAGAG 
CTGGCATTGA 
ACAAATACTG 
CTTTTGGTTT 
ATAGGACCTA 
TGGACCCTGG 
ATGCAGTCTT 
AATATGACTT 
GTTGAAAATG 
GCATATTTGC 
CTGTAAACCA 
AATTGTCCAT 
ATAATTCTAT 
ATACTTACTT 



51 
I 

TTCTGGAGCT 
TGAAAGATAC 
GAAATATAGT 
GAAAGTGACC 
AGT CCCCGAT 
TATCACCTAC 
AATCCGGAAA 
CACAGGCATG 
TTTTGATGGC 
GGATGCACAT 
CCTCACTGCT 
GGCCGTAATG 
TGACATACGT 
TCCTGACAAT 
CGTGGGAAAT 
ACCAAAGACC 
AGCTGCTTAT 
GTTAATTAGC 
TCCTAACTTT 
CTTCTTTGTA 
TTATCCCAAA 
CTACTCTAAA 
CCTACTCCAA 
GTGTAATTAA 
TATGTCCTCA 
TAGGTAATGA 
TCTTGCTTGA 
TTGAAGCATG 
CTGGCATAAC 



Seq ID NO: 28 Protein sequence: 
Protein Accession #: Eos sequence 



1 
I 

MKFLLILLLQ 
KEKIQEMQHF 
YTPDMNREDV 
LAHAFGPGSG 
KYVDINTFRL 
FKDRFFWIiKV 
EPNYPKSIHS 
NFQGIGPKID 



11 
I 

ATASGALPLN 
LGLKVTGQLD 
DYAIRKAFQV 
IGGDAHFDED 
SADDIRGIQS 
SERPKTSVNL 
FGFPNFVKKI 
AVFYSKNKYY 



21 

1 

SSTSLEKNNV 
TSTLEMMHAP 
WSNVTPLKFS 
EFWTTHSGGT 
LYGDPKENQR 
ISSLWPTLPS 
DAAVFNPRFY 
YFFQGSNQFE 



31 

I 

LFGERYLEKF 
RCGVPDVHHF 
KINTGMAD I L 
NLFLTAVHEI 
LPNPDNSEPA 
GIEAAYEIEA 
RTYFFVDNQY 
YDFLLQRITK 



41 
I 

YGLEINKLPV 
REMPGGPVWR 
WFARGAHGD 
GHSLGLGHSS 
LCDPNLSFDA 
RNQVFLFKDD 
WRYDERRQMM 
TLKSNSWFGC 



51 
i 

TKMKYSGNLM 
KHYITYRINN 
FHAFDGKGGI 
DPKAVMFPTY 
VTTVGNKIFF 
KYWLISNLRP 
DPGYPKLITK 



Seq ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_0 0 6115.1 

Coding sequence: 236.. 1765 



GCTTCAGGGT 
CGGGACACCC 
ACTCTCTGAG 
GAGACCTAGA 
ACGAAGGCGT 
CCCACGGAGA 
TGCCGCCCTG 
CGGGAGACAC 
TCTGGGAGTG 
TGGACTTGAT 
GGATT TACGG 
TCTGTACTCA 
TGGTTTGAGC 
CCTCAAGGAA 
GAAAAATGTA 
TAT C AAGATG 
TACCTGGAAG 
GCGTAGACTC 
GCAGTATATC 
TGTGGACTCT 
CCCCTTGGAA 
GTCCCAGAGT 
CGATGTAAGT 
CCTGGTCTTT 
GAGCCACTGC 



11 

I 

ACAGCTCCCC 
CACCCGCTTC 
GAAAAACCAT 
AATCCAAGCG 
TTGTGGGGTT 
CTTGTGGAGC 
GAGTTGCTGC 
AGCCAGACCC 
CTGATGAAGG 
GTGCTCCTTG 
AAGAACTCTC 
TTTCCAGAGC 
ACAGAGGCAG 
GGTGCCTGTG 
CTACGCCTGT 
ATCCTGAAAA 
CTACCCACCT 
CTCCTCTCCC 
GCCCAGTTCA 
TTATTTTTCC 
ACCCTCTCAA 
CCCAGCGTCA 
CCCGAGCCCC 
GATGAGTGTG 
TCCCAGCTTA 



21 
I 

CGCAGCCAGA 
CCAGGCGTGA 
TTTGATTATT 
TTGGAGGTCC 
CC ATT CAG AG 
TGGCAGGGCA 
CCAGGGAGCT 
TGAAGGCAAT 
GACAACATCT 
CGCAGGAGGT 
AT CAGGACTT 
CAGAAGCAGC 
AGCAGCCCTT 
ATGAATTGTT 
GCTGTAAGAA 
TGGTGCAGCT 
TGGCGAAATT 
ACATCCATGC 
CCTCTCAGTT 
TTAGAGGCCG 
TAACTAACTG 
GTCAGCTAAG 
TCCAAGCTCT 
GGAT CACGGA 
CAACCTTAAG 



31 
I 

AGCCGGGCCT 
CCTGTCAACA 
ACTCTCAGAC 
TGAGGCCAGC 
CCGATACATC 
GAGCCTGCTG 
CTTCCCGCCA 
GGTGCAGGCC 
TCACCTGGAG 
TCGCCCCAGG 
CTGGACTGTA 
TCAGCCCATG 
CATTCCAGTA 
CTCCTACCTC 
GCTGAAGATT 
GGACTCTATT 
TTCTCCTTAC 
ATCTTCCTAC 
CCTCAGTCTG 
CCTGGATCAG 
CCGGCTTTCG 
TGTCCTGAGT 
GCTGGAGAGA 
TGAT CAGCTC 
CTTCTACGGG 



41 
I 

GCAGCCCCTC 
GCAACTTCGC 
GTGCGTGGCA 
CTAAGTCGCT 
AGCATGAGTG 
AAGGATGAGG 
CTCTTCATGG 
TGGCCCTTCA 
ACGTTCAAAG 
AGGTGGAAAC 
TGGTCTGGAA 
ACAAAGAAGC 
GAGGTGCTCG 
AT TGAGAAAG 
TTTGCAATGC 
GAAGATTTGG 
CTGGGCCAGA 
ATTTCCCCGG 
CAGTGCCTGC 
TTGCTCAGGC 
GAAGGGGATG 
CTAAGTGGGG 
GCCTCTGCCA 
CTTGCCCTCC 
AATTCCATCT 



51 

I 

AGCACCGCTC 
GGTGTGGTGA 
ACAAGTGACT 
TCAAAATGGA 
TGTGGACAAG 
CCCTGGCCAT 
CAGCCTTTGA 
CCTGCCTCCC 
CTGTGCTTGA 
TTCAAGTGCT 
ACAGGGCCAG 
GAAAAGTAGA 
TAGACCTGTT 
TGAAGCGAAA 
CCATGCAGGA 
AAGTGACTTG 
TGATTAATCT 
AGAAGGAAGA 
AGGCTCTCTA 
ACGTGATGAA 
TGATGCATCT 
TCATGCTGAC 
CCCTCCAGGA 
TGCCTTCCCT 
CCATATCTGC 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



198 



10 



WO 02/086443 

CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 

TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1620 

TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 

TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 

GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1800 

TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 

ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTT CAGTGAG GAAAAAACAT TCAGACAAAT 1920 

GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTGACTTG AGGAGTTAAT 1980 

GTGATCTTTG GGGAGATACA TCTTATAGAG TTAGAAATAG AATCTGAATT TCTAAAGGGA 2040 

GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTGAAAAT AAAGAGAAGC AATGTGAAGC AAAAAAAAAA AAAAAAAA 
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15 
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30 



35 
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50 



55 



60 



65 



70 



75 



80 



85 



Seq ID NO: 3 0 Protein sequence: 
Protein Accession #: NP 006106.1 



GCTTCAGGGT 
CGGGACACCC 
ACTCTCTGAG 
GAGACCTAGA 
ACGAAGGCGT 
CCCACGGAGA 
TGCCGCCCTG 
CGGGAGACAC 
TCTGGGAGTG 
TGGACTTGAT 
GGATTTACGG 
TCTGTACTCA 
TGGTTTGAGC 
CCT CAAGGAA 
GAAAAATGTA 
TAT CAAGATG 
TACCTGGAAG 
GCGTAGACTC 
GCAGTATATC 
TGTGGACTCT 
CCCCTTGGAA 
GTCCCAGAGT 
CGATGTAAGT 
CCTGGTCTTT 
GAGCCACTGC 
CTTGCAGAGT 
TGTCCCCCTG 
TCTGCATGCC 
TAGTGCCAAC 
GTGCCCCTGT 
TTGGACACTA 
ACAAATGTTC 
GTTCAGTGAG 
GTGATCTTTG 
GATTCTGGCT 
TGTTGAAAAT 



11 
I 

ACAGCTCCCC 
CACCCGCTTC 
GAAAAACCAT 
AATCCAAGCG 
TTGTGGGGTT 
CTTGTGGAGC 
GAGTTGCTGC 
AGCCAGACCC 
CTGATGAAGG 
GTGCTCCTTG 
AAGAACTCTC 
TTTCCAGAGC 
ACAGAGGCAG 
GGTGCCTGTG 
CTACGCCTGT 
ATCCTGAAAA 
CTACCCACCT 
CTCCTCTCCC 
GCCCAGTTCA 
TTATTTTTCC 
ACCCTCTCAA 
CCCAGCGTCA 
CCCGAGCCCC 
GATGAGTGTG 
TCCCAGCTTA 
CTCCTGCAGC 
GAGAGTTATG 
AGGCTCAGGG 
CCCTGTCCTC 
TTCATGCCTA 
AAGCCAGGAT 
AGTGTGAGTG 
GAAAAAAAGG 
GGGAGATACA 
TGGGAAGTAC 
AAAGAGAAGC 



21 
I 

CGCAGCCAGA 
CCAGGCGTGA 
TTTGATTATT 
TTGGAGGTCC 
CCATT GAG AG 
TGGCAGGGCA 
CCAGGGAGCT 
TGAAGGCAAT 
GACAACATCT 
CCCAGGAGGT 
AT CAGGACTT 
CAGAAGCAGC 
AGCAGCCCTT 
ATGAATTGTT 
GCTGTAAGAA 
TGGTGCAGCT 
TGGCGAAATT 
ACATCCATGC 
CCTCTCAGTT 
TTAGAGGCCG 
TAACTAACTG 
GTCAGCTAAG 
TCCAAGCTCT 
GGAT CACGGA 
CAACCTTAAG 
ACCTCATCGG 
AGGACATCCA 
AGTTGCTGTG 
ACTGTGGGGA 
ACTAGCTGGG 
GTGCATGCAT 
AGGAAAACAT 
GGAAGTTGGG 
TCTTATAGAG 
ATGTAGGAGT 
AATGTGAAGC 



31 
I 

AGCCGGGCCT 
CCTGTCAACA 
ACTCTCAGAC 
TGAGGCCAGC 
CCGATACATC 
GAGCCTGCTG 
CTTCCCGCCA 
GGTGCAGGCC 
TCACCTGGAG 
TCGCCCCAGG 
CTGGACTGTA 
TCAGCCCATG 
CATTCCAGTA 
CTCCTACCTC 
GCTGAAGATT 
GGACT CTATT 
TTCTCCTTAC 
ATCTTCCTAC 
CCTCAGTCTG 
CCTGGATCAG 
CCGGCTTTCG 
TGTCCTGAGT 
G CTGG AG AG A 
TGATCAGCTC 
CTTCTACGGG 
GCTGAGCAAT 
TGGTACCCTC 
TGAGTTGGGG 
CAGAACCTTC 
TGCACATATC 
CTTGAAGCAA 
GTTCAGTGAG 
GATAGGCAGA 
TTAGAAATAG 
TAATCCCTGT 
AAAAAAAAAA 



41 
I 

GCAGCGCCTC 
GCAACTTCGC 
GTGCGTGGCA 
CTAAGTCGCT 
AGCATGAGTG 
AAGGATGAGG 
CTCTTCATGG 
TGGCCCTTCA 
ACCTT CAAAG 
AGGTGGAAAC 
TGGTCTGGAA 
ACAAAGAAGC 
GAGGTGCTCG 
ATTGAGAAAG 
TTTGCAATGC 
GAAGATTTGG 
CTGGGCCAGA 
ATTTCCCCGG 
CAGTGCCTGC 
TTGCTCAGGC 
GAAGGGGATG 
CTAAGTGGGG 
GCCTCTGCCA 
CTTGCCCTCC 
AATTCCATCT 
CTGACCCACG 
CACCTGGAGA 
CGGCCCAGCA 
TATGACCCGG 
AAATGCTTCA 
CAAAGCAGCC 
GAAAAAACAT 
TGTTGACTTG 
AATCTGAATT 
GTAGACTGTT 
AAAAAAAA 



51 
I 

AGCACCGCTC 
GGTGTGGTGA 
ACAAGTGACT 
TCAAAATGGA 
TGTGGACAAG 
CCCTGGCCAT 
CAGCCTTTGA 
CCTGCCTCCC 
CTGTGCTTGA 
TTCAAGTGCT 
ACAGGGCCAG 
GAAAAGTAGA 
TAGACCTGTT 
TGAAGCGAAA 
CCATGCAGGA 
AAGTGACTTG 
TGATTAATCT 
AGAAGGAAGA 
AGGCTCTCTA 
ACGTGATGAA 
TGATGCATCT 
TCATGCTGAC 
CCCTCCAGGA 
TGCCTTCCCT 
CCATATCTGC 
TGCTGTATCC 
GGCTTGCCTA 
TGGTCTGGCT 
AGCCCATCCT 
TTCTGCATAC 
ACAGTTTCAG 
TCAGACAAAT 
AGGAGTTAAT 
TCTAAAGGGA 
GTAAAGAAAC 



Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 64-2754 



1 
I 

GGCAGGTCTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAGCCA 
GACAAAAGGA 
TCGAAGACAA 
ATTCCTTGCT 
GAATCTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACTACAGTGG 
CTGAAATACA 
AGCACAGGCG 
TCATTGATAA 
ACTTGT AT CA 
TATGAAGCAT 
GATAAGGATT 
GAAAATGGAC 
GTAAAGCCAC 
GAAGCGCCAT 
GTTCATGTGA 
ATTAAAGAAA 



11 
I 

GCTCTCGGCA 
CCGCTGGGCC 
TGAT CTTCAG 
TAGAGGCAGA 
TCATCCGGTC 
GGGCTGTTGC 
AACAGACACA 
GACACACTAG 
CTATGCAAGA 
CAGCACAGAA 
TAAATTTGTT 
GTGAAGAATA 
CAGATCTGCC 
TCACAGAAGC 
GGGTGGTTTG 
GCATTTTGCA 
TAAT CACCAC 
TGAAAGTACA 
TAACAGTAAC 
TTGTAGAGGA 
TAATTAACAC 
ATTTCAAAAT 
TGAATTATGA 
TTGCTAGAGA 
GGGATCTGGA 
ACTTAGCAGT 



21 
I 

CCCTCCCGGC 
CCGGCGCTCC 
TCGTGATGGT 
CAAAATAATT 
AAGTGATCCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
GAATTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 
AGTCTCTCAT 
AGACATGGAT 
AGATTCAAAT 
AAATGCATTC 
TGCCAATTGG 
CAGCACAGAC 
AGAAAACCGT 
TATTCCCAGA 
TGAGGGGCCT 
GGGGTCAAAG 



31 

I 

GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCAT CAGGG 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 
TATTTGGACA 
GGCCAGTTTT 
GATAATGCAC 
AATGTGGAAA 
AGAGTCAATT 
AAAGAAACTA 
CAAGTGAACC 
GTGACAGCCT 
GAATGCACTC 
AT CAACGGCT 



41 

] 

TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAAT CTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 
GAGAGGTTGT 
TTGGATTGAT 
CCACTTTCAG 
TCTTACGAAT 
TTACCATTTT 
ATGAAGGTGT 
TGGAAATTGG 
TGAACAGAGC 
CTGCAGCCCA 
ATAAGGCATA 



51 
I 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGTCAGTG 
ATGGCTTTCT 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTGCAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 
AGACAAGTAC 
AGGCACAT CA 
ACAAAATGCT 
ACCTATAGAA 
AAAGGGAAAT 
TCTTTCTGTT 
AGTAAACAAT 
CTTGGTTACA 
ATATGTG CGG 
TGACCCCGAA 



60 
120 
180 
240 
300 
360 
420' 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



199 



WO 02/086443 

AATAGAAATG GCAATGGTTT AAGGTACAAA 
ATTGATGAAA TTTCAGGGTC AATCATAACT 
CCCAAAAATG AGTTGTATAA TATTACAGTC 
ACTGGAACAC TTGCTGTGAA CATTGAAGAT 
5 GAATATGTAG TCATTTGCAA ACCAAAAATG 

GATGAACCTG TCCATGGAGC TCCATTTTAT 
AGTAGACTGT GGAGCCTCAC CAAAGTTAAT 
AATGCTGGAT TTCAAGAATA TACCATTCCT 
GCAACAAAAT TATTGAGAGT TAATCTGTGT 

10 ACTT CAAGGA GTACAGGAGT AATACTTGGA 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT 
GGGAAACGTT TTCCTGAAGA TTTAGCACAG 
CCTGGAGACG ATAGAGTGTG CTCTGCCAAT 
AGCCAAGGTT TTTGTGGTAC TATGGGATCA 

15 GAAATGATGA AAGGAGGAAA CCAGACCTTG 

ACCCTGGACT CCTGCAGGGG AGGACACACG 
GAGTGGCACA GTTTTACTCA ACCCCGTCTC 
GAAGACCGCA TGCCATCCCA AGATTATGTC 
CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA 

20 AATAATTTGG AACCCAAATT TATTACATTA 

AGTGCTACAA TTAGGTCTTT GTCAGACATT 
TTCAATTTCA ACATGTATGT ATATGATGAT 
CCAATTTATA TTTTTAAAGC CAGTTGTTGC 
AACAGACAAC TGGTAAATCT CAAACTCCAG 

25 TCTTTTTTTT TTTTACGGAT ATTTTAGTAA 

ATAGGTAAGT TATGCTAATA TCACATTATT 
ATAAACAAGA AATATTGAGT AT CACTATGT 
ACTGAATTAA ATTAAAAATG TTGCAGCTCA 
ACCAAATTCA TTTGACTTTG GAGGCAAAAT 

30 TCTATAGGAA TATAGTTGGA AATAAATGTG 

ATTTAAAATG AAATGAGAAC AAAGAGGAAA 
TAGTTTGTCC TACAATAGAA AAAAGAGAGA 
ATTATAACTG AGTCTATGAG GAAATAGTTC 
GTAAATAAAT TAAACTTTTC TGGTTTCTGT 

35 TAGCTTTGCT TTGCAGTCTG T TT CAAGATT 

GAAT ACT CGC TGCAGCTGGG GTTCCCTGCT 
TTTTTTTCGG GGAGCTAATA ACAAAAACAT 
CTCTATTGCT GTTTCTATTC TCT CTTATAG 
TAACCATGTC CTCCTAGAGT TTAGAGGCTA 

40 GCACCCTGGG GAGATTGATT GTCCTTAAAC 

GTCTGGGAGC TACAAAATTT CATTTTTCTC 
CTGAATCAAG GAAAGCCAGG CCTTGTGGGC 
ACCTCCAGCA GAGATTCCCT TAAGTGACTC 
AATTTTTAAT CAGTTTGCTT TCTCCAGAGA 

45 TTGAATGTAT AAAAGAAAAA GATCAAGTTG 

AAGCAGCCCA AGTAGGTTAT TTGTACAGTC 
GGGCAAGGAG AGGCCACAAG GAATATGGGT 
CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC 
GTCCGGTGAG GGAT CAGCC A ACCTCTTCTC 

50 AAGGAGACAG AGCTGACTGC ATGATGAGTC 

GTTGTGCAGA ACAAACAAGG CATT CATGGG 
CTGGGCACTA AGAAGGTCTA TGAATTAAAT 
TTTTCTGTTT TCTAATTTGA CCCTAAAATC 
CCCCCCCCCC TTTTTTTTTG AGACGGAGTC 

55 GCTCCGATCT CTGCTCACTG AAAGCTCCGC 

GCCTCCTGAG TAGCTGGGAC TACAGGCGCC 
TTTAATAGAG ACGGGGTTTC ACTGTGTTAG 
ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG 
CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT 

60 TGAT CATACG AATTGG AT CA AT CTTGAAAT 

GGGAGAAAGA ACTCAGGGCA CAAAATATTG 
TTGCTGAAAT TTCCTGCTGT AACCAGAAGC 
ACTGTGTTTT GCTCACTCCC TCACTCACCG 
CTAGTGCCGA TAAACTTTCT CAAAGAGCAA 

65 TAACCATCTC TTTGTTCTTT GAACATGCTG 

TTTGTAATTC TTTTCTCTCA AATGAAAATT 
CATATGTAGT ATTATTATTT CCTTATATGT 
CAAGAAAATA TATTTTTAAA GCTTTCATTT 
TGTAAATATA CAGAATGTTT TTTCTTACTT 

70 GGGGTTTGTT TTGCAATGTT TTAAACAGAG 

TGCTTTTAAA GAAACTTGGC TGCTTAAAAT 
TACAGATGTG GGGAGATGTA ATAAAACAAT 
TAGAGATTAA ATAATTCTAA GATGATCACT 
AATAGAAATA CTCAATTATG TCTTTGTTGT 

75 ATTAT CAAAT TGTCGACATC ATTAATATAT 

GAAGCACAGC TTTACAGATG AGTATCTATG 
ATTAAAAGTA TTAGAAGGTG GTTATAATTG 
AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC 
GGCAATATTG CAGTCTTGAT TCTGCCACTT 

80 CAAGATGATC CAACCATAAA GGTGCTCTGT 

AGTGTGCTCC CCTACAAACG TTAAGACTGA 
AGCCTTACAT TTTAATATAG GTTGAACCAA 
CATTATTTTT GTGTATGTCT TCAAGAATGT 
ACCGGATACA TTTCACGTGT CCTTCAGTAT 

85 TGAGAAGCAT GGACACTAGA GCCAGAATGC 

TTCTGTGTGA CCTTTGAAAG GCTACTTATT 
GAACAATGCC AGCCTCATGG GGTTGTTGAA 



AAATTGCATG ATCCTAAAGG TTGGATCACC 1620 

TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

TTCAGTTTGC CCAATACTTC TCCAGAAATC 192 0 

GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2 040 

GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

CAAAACTTAA TTAT AT C AAA CACAGAAGCA 2280 

GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAATCCTGCC GGGGGGCTGG GCATCAT CAT 2460 

GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GGTGAAAAAT TGCATCGATG TAATCAGAAT 2580 

CTCACTTATA ACTATGAGGG AAGAGGATCT 2640 

AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2700 

GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 

CTGGAGGTTT CCAAAAATAA TATTGTAAAG 282 0 

TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2880 

TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2940 

CACTGGAATT AAGGTCTCTA AAGC AT CTGC 3 000 

TAAATATGCT GGATAAATAT TAGTCCAACA 3 060 

ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3300 

TGTGTGTATA TTATTATTAA TCAATGCAAT 3 360 

ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

CTGTCCAATT TGTGTAATTT GTTTAAAATT 3540 

GGGAAGGAAA TAGGGAATCC AATGGAACAG 3600 

TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3 660 

TTTTGGTAGC AAGGGT CCAG AGATGAGGTG 3720 

TTTAAAACTT ACCTTTACTG AAGTTAAATC 3780 

TGACCAACAT CTTTTTAATT TAGATCCAAA 3 840 

GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3900 

CTAAGCCCCA CAAACTTGAC ACCTGATCAG 3 960 

CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTAAAA TAATAGAAGA AATAGAAATT 42 00 

TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 42 60 

AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 432 0 

GGGAGTAAAA GCAACATCGT CTGCTTCATA 43 80 

TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 

TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 4620 

GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 4800 

CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 

CACCACCACG CCCGGCTAAT TTTTTGTATT 4920 

CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 

GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

ACT C AACCAA AAGACAGTCG AGAAGCCAGG 5160 

GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 5220 

CAGTTTTATC TAACGGCTAC TGAAACACCC 52 80 

ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 

TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

AAGCAAAAAT TGGATGCATA AAGTAATATT 582 0 

ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 

ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 

ATTGTAATGT TGGGAAGAGA TCACTATTTT 6060 

ATACATATGT ATAATAAATT TTGAT CGGGT 6120 

CAGAGTATTC CATGAATAGT ACACTGACAC 6180 

AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 6240 

ACAGGATAGA TAATGCCTGA ACTTTAATGA 6300 

GCTTCACAGT GAATCTTTTC CCCATGCAGG 6360 

TCATTTCAAA AATCTATTAG CTATAT CAAA 6420 

AATTTCAATT CCAGTAACTT CTATTGTAAC 6480 

TCATTGGATT TTTGTTTGTA ATAGTAAAAT 6540 

TGATTTGGTT GAATATTGGG TCATAATGGT 6600 

TTGGATATGA ATCCTGGATC TGTCACTTAC 6660 

TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 6720 

TGATTAAATT AGTTAATATA CCTAAAGTAC 6780 



PCT/US02/12476 



200 



WO 02/086443 

ATAGAACACT GCCTGCACAT AGTAAAAGAA T T AT AAGTGT GAGGTAGTTG GTAAAATTAT 
GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 
TATATATAAT CCCGAAACAT G 



6840 
6900 
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Seq ID NO: 32 Protein sequence: 
Protein Accession #: NP_001932.1 



10 



15 



20 



25 



MAAAGPRRSV 
ADLIRSSDPD 
KTRHTRETVL 
EPLNLFYIER 
PVFTEAIYNF 
TGVITTVSHY 
EAFVEENAFN 
KPLNYEENRQ 
KENLAVGSKI 
KNELYNITVL 
EPVHGAPFYF 
TKLLRVNLCE 
KRFPEDLAQQ 
MMKGGNQTLE 
DRMPSQDYVL 



11 
I 

RGAVCLHLLL 
FRVLNDGSVY 
RRAKRRWAPI 
DTGNLFCTRP 
EVLESSRPGT 
LDREWDKYS 
VEILRIPIED 
VNLEIGVNNE 
NGYKAYDPEN 
AIDKDDRSCT 
SLPNTSPEIS 
CTHPTQCRAT 
NLIISNTEAP 
SCRGAGHHHT 
TYNYEGRGSP 



21 
I 

TLVIFSRDGE 
T ARAVAL SDK 
PCSMQENSLG 
VDREEYDVFD 
TVGWCATDR 
LIMKVQDMDG 
KDLINTANWR 
APFARDIPRV 
RNGNGLRYKK 
GTLAVNIEDV 
RLWSLTKVND 
SRSTGVILGK 
GDDRVCSANG 
LDSCRGGHTE 
AGSVGCCSEK 



31 

I 

ACKKVILNVP 
KRSFTIWLSD 
PFPLFLQQVE 
L I AYASTADG 
DEPDTMHTRL 
QFFGLIGTST 
VNFTILKGNE 
TALNRALVTV 
LHDPKGWITI 
NDNPPEILQE 
TAARLSYQKN 
WAILAILLGI 
FMTQTTNNSS 
VDNCRYTYSE 
QEEDGLDFLN 



41 

I 

SKLEADKIIG 
KRKQTQKEVT 
SDAAQNYTVF 
YSADLPLPLP 
KYSILQQTPR 
CIITVTDSND 
NGHFKI STDK 
HVRDLDEGPE 
DEISGSIITS 
YWICKPKMG 
AGFQEYTIPI 
ALLFSVLLTL 
QGFCGTMGSG 
WHSFTQPRLG 
NLEPKFITLA 



51 

1 

RVNLEECFRS 
VLLiEHQKKVS 
YS I SGRGVDK 
IRVEDENDNH 
SPGLFSVHPS 
NAPT FRQNAY 
ETNEGVLSW 
CTPAAQYVRI 
KILDREVETP 
YTDILAVDPD 
TVKDRAGQAA 
VCGVFGATKG 
MKNGGQETIE 
EKLHRCNONE 
EACTKR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



80 



85 



Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 64-2583 



GGCAGGT CTC 
CCGATGGCCG 
CTGACCCTCG 
CCTTCTAAAC 
TCTGCAGACC 
TACACAG CCA 
G ACAAAAG G A 
TCGAAGACAA 
ATTCCTTGCT 
GAAT CTGATG 
AAAGAACCTT 
CCTGTGGATC 
GGATATTCAG 
CACCCTGTTT 
ACT AC AGT GG 
CTGAAATACA 
AGCACAGGCG 
TCATTGATAA 
ACTTGTATCA 
TATGAAGCAT 
GATAAGGATT 
G AAAAT GG AC 
GTAAAGCCAC 
GAAGCGCCAT 
GTTCATGTGA 
ATTAAAGAAA 
AATAGAAATG 
ATTGATGAAA 
CCCAAAAATG 
ACTGGAACAC 
GAATATGTAG 
GATGAACCTG 
AGTAGACTGT 
AATGCTGGAT 
GCAACAAAAT 
ACTTCAAGGA 
ATAGCACTGC 
GGGAAACGTT 
CCTGGAGACG 
AGCCAAGGTT 
GAAATGATGA 
ACCCTGGACT 
GAGTGGCACA 
TAAAAATTAA 
CCAAGATTAT 
CTGCTGCAGT 
ATTTATTACA 
TTTGTCAGAC 
TGTATATGAT 
AGCCAGTTGT 
TCTCAAACTC 
GATATTTTAG 
ATATCACATT 
AGTATCACTA 
ATGTTGCAGC 



11 
I 

GCTCTCGGCA 
CCGCTGGGCC 
TGATCTTCAG 
TAGAGGCAGA 
TCATCCGGTC 
GGGCTGTTGC 
AACAGACACA 
GACACACTAG 
CTATG CAAGA 
CAGCACAGAA 
TAAATTTGTT 
GTGAAGAATA 
CAGATCTGCC 
TCACAGAAGC 
GGGTGGTTTG 
GCATTTTGCA 
TAAT CACCAC 
TGAAAGTACA 
TAACAGTAAC 
TTGTAGAGGA 
TAATTAACAC 
ATTTCAAAAT 
TGAATTATGA 
TTGCTAGAGA 
GGGATCTGGA 
ACTTAGCAGT 
GCAATGGTTT 
TTTCAGGGTC 
AGTTGTATAA 
TTGCTGTGAA 
TCATTTGCAA 
TCCATGGAGC 
GGAGCCTCAC 
TTCAAGAATA 
TATTGAGAGT 
GTACAGGAGT 
TCTTTTCTGT 
TTCCTGAAGA 
ATAGAGTGTG 
TTTGTGGTAC 
AAGGAGGAAA 
CCTGCAGGGG 
GTTTTACTCA 
ACATAAAAGA 
GTCCTCACTT 
GAAAAGCAGG 
TTAGCAGAAG 
ATTCTGGAGG 
GATTTTTTTC 
TGCTTATCTT 
CAGCACTGGA 
TAATAAATAT 
ATTATGTATT 
TGTGAAGAAA 
TCATAAAGAA 



21 
I 

CCCTCCCGGC 
CCGGCGCTCC 
TCGTGATGGT 
CAAAATAATT 
AAGTGAT CCT 
GCTGTCTGAT 
GAAAGAGGTT 
AGAAACTGTT 
GAATTCCTTG 
CTATACTGTC 
TTATATAGAA 
TGATGTTTTT 
CCTCCCACTA 
AATTTATAAT 
TGCCACAGAC 
GCAGACACCA 
AGTCTCT CAT 
AGACATGGAT 
AGATTCAAAT 
AAATGCATTC 
TGCCAATTGG 
CAGCACAGAC 
AGAAAACCGT 
TATTCCCAGA 
TGAGGGGCCT 
GGGGTCAAAG 
AAGGTACAAA 
AATCATAACT 
TATTACAGTC 
CATTGAAGAT 
ACCAAAAATG 
TCCATTTTAT 
CAAAGT TAAT 
TACCATTCCT 
TAATCTGTGT 
AATACTTGGA 
ATTGCTAACT 
TTTAGCACAG 
CTCTGCCAAT 
TATGGGATCA 
CCAGACCTTG 
AGGACACACG 
ACCCCGTCTC 
AATTGCATCG 
ATAACTATGA 
AAGAAGATGG 
CATGCACAAA 
TTTCCAAAAA 
TCAATTTTGA 
TTCCAAAAAG 
ATTAAGGTCT 
GCTGGATAAA 
CACTT TAAGT 
GTTTTGGAAA 
TTGGGACTCA 



31 

I 

GCCCGCGTTC 
GTGCGCGGAG 
GAAGCCTGCA 
GGCAGAGTTA 
GATTTCAGAG 
AAGAAAAGAT 
ACTGTGCTGC 
CTCAGGCGTG 
GGCCCTTTCC 
TTCTACTCAA 
AGAGACACTG 
GATTTGATTG 
CCCATCAGGG 
TTTGAAGTTT 
AGAGATGAAC 
AGGTCACCTG 
TATTTGGACA 
GGCCAGTTTT 
GATAATGCAC 
AATGTGGAAA 
AGAGTCAATT 
AAAGAAACTA 
CAAGTGAACC 
GTGACAGCCT 
GAATGCACTC 
AT CAACGGCT 
AAATTGCATG 
TCCAAAATCC 
CTGGCAATAG 
GTAAATGATA 
GGGTATACCG 
TTCAGTTTGC 
GATACAGCTG 
ATTACTGTAA 
GAATGTACTC 
AAATGGGCAA 
TTAGTATGTG 
CAAAACTTAA 
GGATTTATGA 
GGAATGAAAA 
GAATCCTGCC 
GAGGTGGACA 
GGTGAAGAAT 
ATGTAATCAG 
GGGAAGAGGA 
CCTTGACTTT 
GAGATAATGT 
TAATATTGTA 
ATTATGCTAC 
TGAAAAATGT 
CTAAAGCATC 
TATTAGTCCA 
GATAGTTTAA 
AGAAACAATG 
CCCCTACTGC 



41 

I 

TCCTGGCCCT 
CCGTCTGCCT 
AAAAGGTGAT 
ATTTGGAAGA 
TTCTAAATGA 
CATTTACCAT 
TAGAACATCA 
CCAAGAGGAG 
CATTGTTTCT 
TAAGTGGACG 
GAAATCTATT 
CTTATGCGTC 
TAGAGGATGA 
TGGAAAGTAG 
CGGACACAAT 
GGCTCTTTTC 
GAGAGGTTGT 
TTGGATTGAT 
CCACTTTCAG 
TCTTACGAAT 
TTACCATTTT 
ATGAAGGTGT 
TGGAAATTGG 
TGAACAGAGC 
CTGCAGCCCA 
ATAAGGCATA 
ATCCTAAAGG 
TGGATAGGGA 
ACAAAGATGA 
ATCCACCAGA 
ACATTTTAGC 
CCAATACTTC 
CCCGTCTTTC 
AAGACAGGGC 
AT CCAACTCA 
TCCTTGCAAT 
GAGTTTTTGG 
TTATATCAAA 
CCCAAACTAC 
ATGGAGGGCA 
GGGGGGCTGG 
ACTGCAGATA 
CCATTAGAGG 
AATGAAGACC 
TCTCCAGCTG 
TTAAATAATT 
CACAGTGCTA 
AAGTTCAATT 
TCACCAATTT 
TAAAACAGAC 
TGCTCTTTTT 
ACAATAGCTA 
AAAATAAACA 
AAGACTGAAT 
ACTACCAAAT 



51 

! 

GCCCGGCATC 
GCATCTGCTG 
ACTTAATGTA 
GTGCTTCAGG 
TGGGT CAGTG 
ATGGCTTTCT 
GAAGAAGGTA 
ATGGGCACCT 
TCAACAAGTT 
TGGAGTTGAT 
TTGCACTCGG 
AACTG CAGAT 
AAATGACAAC 
TAGACCTGGT 
GCATACGCGC 
TGTGCATCCC 
AGACAAGTAC 
AGGCACATCA 
ACAAAATGCT 
ACCTATAGAA 
AAAGGGAAAT 
TCTTTCTGTT 
AGTAAACAAT 
CTTGGTTACA 
ATATGTGCGG 
TGACCCCGAA 
TTGG AT CACC 
GGT TGAAACT 
TAGATCATGT 
AATACTTCAA 
TGTTGATCCT 
TCCAGAAATC 
ATATCAGAAA 
CGGCCAAGCT 
GTGTCGTGCG 
ATTACTGGGT 
TGCAACTAAA 
CACAGAAGCA 
CAACAACTCT 
GGAAACCATT 
GCATCAT CAT 
CACTT ACT CG 
ACACACTGGT 
GCATGCCATC 
GTTCTGTGGG 
TGGAACCCAA 
CAATTAGGTC 
TCAACATGTA 
ATATTTTTAA 
AACTGGTAAA 
TTTTTTTACG 
AGTTATGCTA 
AGAAATATTG 
TAAATTAAAA 
TCATTTGACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



201 
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TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 

GGAAATAAAT GTGTGTGTGT ATATTATTAT TAATCAATGC AATATTTAAA ATGAAATGAG 3420 

AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3480 

GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3540 

5 GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 

TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 

CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 372 0 

GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3780 

ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3840 

10 TTCTCTCTTA TAGTGACCAA CATCTTTTTA AT TTAGATCC AAATAACCAT GTCCTCCTAG 3900 

AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GATCTTACTG AAAGCACCCT GGGGAGATTG 3960 

ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 

TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 

AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 

15 CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 4200 

CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 

AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 

TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 

AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 

20 CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGAT CAG 4500 

CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA ATCAAGGAGA CAGAGCTGAC 4560 

TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 4620 

AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 

CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 

25 TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 4800 

TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 

CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 4920 

GACTACAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 

TTCACTGTGT TAG CCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 5040 

30 TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGATCAT ACGAATTGGA 5160 

TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 5220 

GCACAAAATA TTGGT CTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 52 80 

TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

35 CCCACTCACC GATCAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGT AT CA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 5520 

AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

40 AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGGA AGCAGCTGTC T AAAATG CAG TGGGGTTTGT TTTGCAATGT 5760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 5820 

CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5940 

45 TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 6000 

TTTAGAGATT AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GGCTGGCATG 6060 

GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 612 0 

TCATTATCAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 6240 

50 GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 6300 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG T CAAGGG AAA ACATGAGTTA AAAAGAAAAG 63 60 

CAGGCAATAT TGCAGTCTTG ATTGTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 6420 

GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAAT CTATT AGCTATATCA 6540 

55 AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTTCAA TT CCAGTAAC TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTT CATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTTCACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 6720 

GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTTCTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC AT T AAAAT CA 6840 

60 ATGAACAATG CCAGCCTCAT GGGGT TGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6960 

ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 

CATATATATA AT CCCGAAAC ATG 

65 Seq ID NO: 3 4 Protein sequence: 
Protein Accession #: NP_077741.1 

1 11 21 31 41 51 

™ 1 1 1 1 1 ' 

/U MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVAL SDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRW AP I PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

EPLNLFYIER DTGNLFCTRP VDREEYDVFD h I AYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 300 

75 TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 360 

EAFVEENAFN VEILRIPIED KDLINTANWR VNFT I LKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARD I PRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYNITVL A I DKDDRS CT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 600 

80 EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 

MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EESIRGHTG 

85 Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession #: Eos sequence 
Coding sequence: 146-1273- 



202 



WO 02/086443 



PCT/US02/12476 



1 11 21 31 41 51 

I I I 1 I I 

GGGAGTGGGC GTGGCGGTGC TGCCCAGGTG AGCCACCGCT GCTTCTGCCC AGACACGGTC 60 

5 GCCTCCACAT CCAGGTCTTT GTGCTCCTCG CTTGCCTGTT CCTTTTCCAC GCATTTTCCA 120 

GGATAACTGT GACT CCAGGC CCGCAATGGA TGCCCTGCAA CTAGCAAATT CGGCTTTTGC 180 

CGTTGATCTG TTCAAACAAC TATGTGAAAA GGAGCCACTG GGCAATGTCC TCTTCTCTCC 240 

AATCTGTCTC TCCACCTCTC TGTCACTTGC TCAAGTGGGT GCTAAAGGTG ACACTGCAAA 3 00 

TGAAAT TGGA CAGGTTCTTC ATTTTGAAAA TGT CAAAGAT ATACCCTTTG GATTTCAAAC 360 

10 AGTAACATCG GATGTAAACA AACTTAGTTC CTTTTACTCA CTGAAACTAA TCAAGCGGCT 420 

CTACGTAGAC AAATCT CTG A ATCTTTCTAC AGAGTT CATC AGCTCTACGA AGAGACCCTA 480 

TGCAAAGGAA TTGGAAACTG TTGACTTCAA AGATAAATTG GAAGAAACGA AAGGTCAGAT 540 

CAACAACTCA ATTAAGGATC TCACAGATGG CCACTTTGAG AACATTTTAG CTGACAACAG 600 

TGTGAACGAC CAGACCAAAA TCCTTGTGGT TAATGCTGCC TACTTTGTTG GCAAGTGGAT 660 

15 GAAGAAATTT CCTGAAT CAG AAACAAAAGA ATGTCCTTTC AGACTCAACA AGACAGACAC 720 

CAAACCAGTG CAGATGATGA ACATGGAGGC CACGTTCTGT ATGGGAAACA TTGACAGTAT 780 

CAATTGTAAG AT CAT AGAGC TTCCTTTTCA AAATAAGCAT CTCAGCATGT TCATCCTACT 840 

ACCCAAGGAT GTGGAGGATG AGTCCACAGG CTTGGAGAAG ATTGAAAAAC AACTCAACTC 900 

AG AGT CACTG TCACAGTGGA CTAATCCCAG CACCATGGCC AATGCCAAGG TCAAACTCTC 960 

20 CATTCCAAAA TTTAAGGTGG AAAAGATGAT TGATCCCAAG GCTTGTCTGG AAAATCTAGG 102 0 

GCTGAAACAT AT CTT CAGTG AAGACACATC TGATTTCTCT GGAATGTCAG AGACCAAGGG 1080 

AGTGGCCCTA TCAAATGTTA TCCACAAAGT GTGCTTAGAA ATAACTGAAG ATGGTGGGGA 1140 

TTCCATAGAG GTGCCAGGAG CACGGATCCT GCAGCACAAG GATGAATTGA ATGCTGACCA 1200 

TCCCTTTATT TACATCAT CA GGCACAACAA AACTCGAAAC ATCATTTTCT TTGGCAAATT 12 60 

25 CTGTTCTCCT TAAGTGGCAT AGCCCATGTT AAGTCCTCCC TGACTTTTCT GTGGATGCCG 1320 

ATTTCTGTAA ACTCTG CATC CAGAGATTCA TTTTCTAGAT ACAATAAATT GCTAATGTTG 1380 

CTGGAT CAGG AAGCCGCCAG TACTTGTCAT ATGTAGCCTT CACACAGATA GACCTTTTTT 1440 

TTTT TCCAAT TCTATCTTTT GTTTCCTTTT TTCCCATAAG ACAATGACAT ACGCTTTTAA 1500 

TGAAAAGGAA TCACGTTAGA GGAAAAATAT TTATT CATTA TTTGTCAAAT TGTCCGGGGT 1560 

30 AGT TGG CAG A AATACAGTCT TCCACAAAGA AAATTCCTAT AAGGAAGATT TGGAAGCTCT 1620 

TCTTCCCAGC ACTATGCTTT CCTTCTTTGG GATAGAGAAT GTTCCAGACA TTCTCGCTTC 1680 

CCTGAAAGAC TGAAGAAAGT GTAGTGCATG GGACCCACGA AACTGCCCTG GCTCCAGTGA 1740 

AACTTGGGCA CATGCT CAGG CTACTATAGG TCCAGAAGTC CTTATGTTAA GCCCTGGCAG 1800 

GCAGGTGTTT ATT AAAATT C TGAATTTTGG GGATTTTCAA AAGATAATAT TTTACATACA 1860 

35 CTGTATGTTA TAGAACT T CA TGGATCAGAT CTGGGGCAGC AACCTATAAA TCAACACCTT 1920 

AATATGCTGC AACAAAATGT AGAATATTCA GACAAAA TGG ATACATAAAG ACTAAGTAGC 1980 

CCATAAGGGG TCAAAATTTG CTGCCAAATG CGTATGCCAC CAACTTACAA AAACACTTCG 2040 

TTCGCAGAGC TTTTCAGATT GTGGAATGTT GGATAAGGAA TTATAGACCT CTAGTAGCTG 2100 

AAATG CAAGA CCCCAAGAGG AAGTTCAGAT CTTAATATAA ATTCACTTTC ATTTTTGATA 2160 

40 GCTGTCCCAT CTGGT CATGT GGTTGGCACT AGACTGGTGG CAGGGGCTTC TAGCTGACTC 2220 

GCACAGGGAT TCTCACAATA GCCGATATCA GAATTTGTGT TGAAGGAACT TGTCTCTTCA 2280 

TCTAATATGA TAGCGGGAAA AGGAGAGGAA ACTACTGCCT TTAGAAAATA TAAGTAAAGT 2340 

GATTAAAGTG CTCACGTTAC CTTGACACAT AGTTTTTCAG TCTATGGGTT TAGTTACTTT 2400 

AGATGGCAAG CATGT AACTT ATATTAATAG TAATTTGTAA AGTTGGGTGG ATAAGCTATC 2460 

45 CCTGTTGCCG GTTCATGGAT TACTTCTCTA TAAAAAATAT ATATTTACCA AAAAATTTTG 2 520 

TGACATTCCT TCTCCCATCT CTTCCTTGAC ATGCATTGTA AATAGGTTCT TCTTGTTCTG 2580 
AGATTCAATA TTGAATTTCT CCTATGCTAT TGACAATAAA ATATTATTGA ACTACC 

Seq ID NO: 36 Protein sequence: 
50 Protein Accession #: NP_002630.1 

1 11 21 31 41 51 

I I I I I I 

MDALQLANSA FAVDLFKQLC EKEPLGNVLF SPICLSTSLS LAQVGAKGDT ANEIGQVLHF 60 

55 ENVKD I PFGF QTVTSDVNKL SSFYSLKIiIK RLYVDKSLNL STEFISSTKR PYAKELETVD 120 

FKDKLEETKG QINNSIKDLT DGHFENILAD NSVNDQTKIL WNAAYFVGK WMKKFPESET 180 

KECPFRLNKT DTKPVQMMNM EATFCMGNID SINCKIIELP FQNKHLSMFI LLPKDVEDES 240 

TGLEKIEKQL NSESLSQWTN PSTMANAKVK LSIPKFKVEK M I DPKACLEN LGLKHI FSED 300 

TSDFSGMSET KGVALSNVIH KVCLEITEDG GDSIEVPGAR ILQHKDELNA DHPFIYIIRH 360 
60 NKTRNIIFFG KFCSP 



Seq ID NO: 3 7 DNA sequence 
Nucleic Acid Accession #: NM_0168583 
65 Coding sequence: 72-842 

1 11 21 31 41 51 

I I 1 I I I 

GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 60 

70 TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 12 0 

CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 180 

ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 240 

ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 300 

TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 360 

75 CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 420 

AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 480 

TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGC 540 

TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 600 

TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC T CTG CTTGAT GGACTTGGCC 660 

80 CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 720 

AGT TGGTTCJ\ GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 780 

CCCTGGTGCA TGACATTGTT AACATGCTGA TCCACGGACT ACAGTTTGTC ATCAAGGTCT 840 

AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 900 

GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 960 

85 TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 1020 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 
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Seq ID NO : 3 8 Protein sequence: 
Protein Accession #: NP_057667 

1 11 21 31 41 51 

I I I I I I 

MFQTGGLIVF YGLLAQTMAQ FGGLPVPLDQ TLPIiNVNPAL PL S PTGLAGS LTNALSNGLL 60 

SGGLLGILEN LPLLDILKPG GGTSGGLLGG LLGKVTSVIP GLNNI IDIKV TDPQLLELGIi 120 

VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AE I LAVRDKQ ERIHLVLGDC 180 

THSPGSLQIS LLDGLGPLPI QGLLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 240 

DIVNMLIHGL QFVIKV 



Seq ID NO: 3 9 DNA sequence 

Nucleic Acid Accession #: NM_0043 63.1 

Coding sequence: 115-2223 

1 11 21 31 41 51 

1 I I I I I 

CTCAGGGCAG AGGGAGGAAG GACAGCAGAC CAGACAGT C A CAGCAGCCTT GACAAAACGT 60 

TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 120 

TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCCCTGGC AGAGGCTCCT GCTCACAGCC 180 

TCACTTCTAA CCTTCTGGAA CCCGCCCACG ACTGCCAAGC TCACTATTGA ATCCACGCCG 240 

TTCAATGTCG CAGAGGGGAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 300 

TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 3 60 

GTAATAGGAA CTCAACAAGC TACCCCAGGG CCCGCATACA GTGGTCGAGA GATAATATAC 420 

CCCAATGCAT CCCTGCTGAT CCAGAACATC AT CCAGAATG ACACAGGATT CTACACCCTA 480 

CACGTCATAA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCCG 540 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 72 0 

TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGAT CAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 900 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 102 0 

AACTCAGACA CTGGCCTCAA TAGGACCACA GT CACGACG A TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 1200 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 1260 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTGCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 1500 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACT CA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 162 0 

CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GT CAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC T CACTCT ATT CAATGTCACA 1800 

AGAAATGACG CAAGAGCCTA TGTATGTGGA AT CCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 192 0 

TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 1980 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 2 040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 210 0 

GGCCGCAATA AT TCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCATCATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 222 0 

TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 22 80 

TAAAGCATTT GCAACAGCTA CAGTCTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC C AG AG AT CG A GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 2400 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 2520 

ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2 640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGTCCA CCAAGATCAA GCAGAGAAAA 2 700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGT CTTGT 2 760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 282 0 

AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 

Seq ID NO: 40 Protein sequence: 
Protein Accession #: NP_004354.1 

1 11 21 31 41 51 

I I I I I I 

MESPSAPPHR WCIPWQRLLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE VLLLVHNLPQ 60 

HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGRE I IYPNASLLIQ NIIQNDTGFY 12 0 

TLHVIKSDLV NEEATGQFRV YPELPKPSIS SNNSKPVEDK DAVAFTCEPE TQDATYLWWV 180 

NNQSLPVSPR LQLSNGNRTL TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP 240 

TISPLNTSYR SGENLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 300 

AHNSDTGLNR TTVTTITVYA EPPKPFITSN NSNPVEDEDA VALTCEPEIQ NTTYLWWVNN 360 

QSLPVSPRLQ LSNDNRTDTL LSVTRNDVGP YECGIQNELS VDHSDPVILN VLYGPDDPTI 42 0 

SPSYTYYRPG VNI1SL1SCHAA SNPPAQYSWL IDGNIQQHTQ EL»FI SNITEK NSGLYTCQAN 480 

NSASGHSRTT VKTITVSAEL PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS 540 

LPVSPRLQLS NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 60 0 

PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN GTYACFVSNL 660 
ATGRNNS I VK SITVSASGTS PGLSAGATVG IMIGVLVGVA III 



204 



WO 02/086443 
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Seq ID NO: 41 DNA sequence 
Nucleic Acid Accession #: NM_006952.1 
^ Coding sequence: 11-793 

1 11 21 31 . 41 51 

I ' I I I I 1 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCTGATTTT 60 

TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

10 ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG ISO 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 42 0 

15 TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCAT CAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCGT GGTTTTTATC ACAAT CAGGG 660 

CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

20 AT TTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 7 80 
AATTGAATAT TAAGAA 



25 



Seq ID NO: 42 Protein sequence: 
Protein Accession #: NP 008883.1 



1 11 21 31 41 51 

I ] I 1 1 I 

MAKDNSTVRC FQGLLIFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAW 60 

IGIFVGICLF CLSVLGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TQRDFFTPNL 12 0 

30 FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 240 
LCWTFWVLLG TMFYWSRIEY 

35 Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession #: Eos sequence 
Coding sequence: 83-2605 

1 11 21 31 41 51 

40 i | | I I I 

GCCGGACAGA TCTGCGCGTA TCCTGGAGCC GGCCCAGTTG TGAACTAGGA GAGCTTTGGG 60 

ACCTCTGTCC CAAGCAAGAG AGATGAATGG AGAGTATAGA GGCAGAGGAT TTGGACGAGG 120 

AAGATTTCAA AGCTGGAAAA GGGGAAGAGG TGGTGGGAAC TTCT CAGGAA AATGGAGAGA 180 

AAGAGAACAC AGACCTGATC TGAGTAAAAC CACAGGAAAA CGTACTTCTG AACAAACCCC 240 

45 ACAGTTTTTG CTTTCAACAA AGACCCCACA GTCAATGCAG TCAACATTGG AT CGATTCAT 300 

AC CAT AT AAA GGCTGGAAGC TTTATTTCTC TGAAGTTTAC AGCGATAGCT CTCCTTTGAT 360 

TGAGAAGATT CAAGCATTTG AAAAATTTTT CACAAGGCAT ATTGATTTGT ATGACAAGGA 420 

TGAAATAGAA AGAAAGGGAA GTATTTTGGT AGATTTTAAA GAACTGACAG AAGGTGGTGA 480 

AGTAACTAAC TTGATACCAG ATATAGCAAC TGAACTAAGA GATGCACCTG AGAAAACCTT 540 

50 GGCTTGCATG GGTTTGGCAA TACATCAGGT GTTAACTAAG GACCTTGAAA GGCATGCAGC 600 

TGAGTTACAA GCCCAGGAAG GATTGTCTAA TGATGGAGAA ACAATGGTAA ATGTGCCACA 660 

TATTCATGCA AGGGTGTACA ACTATGAGCC TTTGACACAG CTCAAGAATG TCAGAGCAAA 720 

TTACTATGGA AAATACATTG CTCTAAGAGG GACAGTGGTT CGTGT CAGTA ATATAAAGCC 780 

TCTTTGCACC AAGATGGCTT TTCTTTGTGC TGCATGTGGA GAAATTCAGA GCTTTCCTCT 840 

55 TCCAGATGGA AAATACAGTC TTCCCACAAA GTGTCCTGTG CCTGTGTGTC GAGGCAGGTC 900 

ATTTACTGCT CTCCGCAGCT CTCCTCTCAC AGTTACGATG GACTGGCAGT CAATCAAAAT 960 

CCAGGAATTG ATGTCTGATG ATCAGAGAGA AGCAGGTCGG ATTCCACGAA CAATAGAATG 102 0 

TGAGCTTGTT CATGATCTTG TGGATAGCTG TGTCCCGGGA GACACAGTGA CTATTACTGG 1080 

AATTGT CAAA GTCTCAAATG CGGAAGAAGG TTCTCGAAAT AAGAATGACA AGTGTATGTT 1140 

60 CCTTTTGTAT ATTGAAGCAA ATTCTATTAG TAATAGCAAA GGACAGAAAA CAAAGAGTT C 12 0 0 

TGAGGATGGG TGTAAGCATG GAATGTTGAT GGAGTTCTCA CTTAAAGACC TTTATGCCAT 1260 

CCAAGAGATT CAAGCTGAAG AAAACCTGTT TAAACTCATT GTCAACTCGC TTTGCCCTGT 132 0 

CATTTTTGGT CATGAACTTG TTAAAGCAGG TTTGGCATTA GCACTCTTTG GAGGAAGCCA 1380 

GAAATACGCA GATGACAAAA ACAGAATTCC AATTCGGGGA GACCCCCACA TCCTTGTTGT 1440 

65 TGGAGATCCA GGCCTAGGAA AAAGTCAAAT GCTACAGGCA GCGTGCAATG TTGCCCCACG 1500 

TGGCGTGTAT GTTTGTGGTA ACACCACGAC CACCTCTGGT CTGACGGTAA CTCTTTCAAA 1560 

AGATAGTTCC TCTGGAGATT TTGCTTTGGA AGCTGGTGCC CTGGTACTTG GTGATCAAGG 1620 

TATTTGTGGA ATCGATGAAT TTGATAAGAT GGGGAATCAA CATCAAGCCT TGTTGGAAGC 1680 

CATGGAGCAG CAAAGTATTA GTCTTGCTAA GGCTGGTGTG GTTTGTAGCC TTCCTGCAAG 1740 

70 AACTTCCATT ATTGCTGCTG CAAATCCAGT TGGAGGACAT TACAATAAAG CCAAAACAGT 1800 

TTCTGAGAAT TTAAAAATGG GGAGTGCACT ACTATCCAGA TTTGATTTGG TCTTTATCCT 1860 

GTTAGATACT CCAAATGAGC ATCATGATCA CTTACTCTCT GAACATGTGA TTGCAATAAG 1920 

AG CTGG AAAG CAGAGAACCA TTAGCAGTGC CACAGTAGCT CGTATGAATA GTCAAGATTC 1980 

AAATACTT CC GTACTTGAAG TAGTTTCTGA GAAGCCATTA TCAGAAAGAC TAAAGGTGGT 2 040 

75 TCCTGGAGAA AC AATAGAT C CCATTCCCCA CCAGCTATTG AGAAAGTACA TTGGCTATGC 2100 

TCGGCAGTAT GTGTACCCAA GGCTATCCAC AGAAGCTGCT CGAGTTCTTC AAGATTTTTA 2160 

CCTTGAGCTC CGGAAACAGA GCCAGAGGTT AAATAGCTCA CCAATCACTA CCAGGCAG CT 2220 

GGAATCTTTG ATTCGTCTGA CAGAGGCACG AGCAAGGTTG GAATTGAGAG AGGAAGCAAC 2280 

CAAAGAAGAC GCTGAGGATA TAGTGGAAAT TATGAAATAT AGCATGCTAG GAACTTACTC 2340 

80 TGATGAATTT GGGAACCTAG ATT TTGAGCG AT CCCAGCAT GGTTCTGGAA TGAGCAACAG 2400 

GTCAACAGCG AAAAGATTTA TTTCTGCTCT CAACAACGTT GCTGAAAGAA CTTATAATAA 2460 

TATATTTCAA TTTCATCAAC TTCGGCAGAT TGCCAAAGAA CTAAACATTC AGGTTGCTGA 2520 

TTTTGAAAAT TTTATTGGAT CACTAAATGA CCAGGGTTAC CTCTTGAAAA AAGGCCCAAA 2580 

AGTTTACCAG CTTCAAACTA TGTAAAAGGA CTTCACCAAG TTAGGGCCTC CTGGGTTTAT 2 640 

85 TGCAGATTAA AGCCATCTCA GTGAAGATAT GCGTGCACGC ACAGACAGAC AGACACACAC 2700 

ACACACACAC AC AC AC AC AC AGACACACAC AC ACACAGT C AAATACTGTT CTCTGAAAAA 2760 

TGATGTCCCA AAAGTATTAT AATAGGAAAA AAG CAT T AAA TATAATAAAC TAATTTAAGA 2 820 
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AGTGATAAAG TCTCCAGATG CAGTAGCTCA CACTGTAATC ACAGTGACTC 
GGTGAGAGGA TTCCTTGAGG CCAGGGTTCG AGACCAACCT TGGGCAACAT 
CATTTCTTAA AAAAAAAAAA AAAAAATTTA AACTTAGCTG GGTATGGTGG 
TAGTCTCAGC TACTTGTGAG GCTGAGGCAG GAGGATT CTT TGAGCCCAGG 
TACAGTGAGC CACAATCACA CCAAT CACTG CACTCCAGCC TGGGCAATAA 
GACTCAAAAA AATAAAAAAA ATTGTAGTGG TAGCCATGTG TTAATTGTTA 
CCAAAGGGCT AAAAGTAAAT TACTTATAAA TTTTTTATAG TTGTATTTTT 
TTATATGTAT GAATATTTCA TAGTTTTGCA TATCAGATGT AGGCATACAG 
AAAC CAATG A ATATATTACA TATTCTGTGT TCCAATAAAA CTTTATTTAT 
ATTTGAATTT CATAAAATTT TCCCATGTCA AGAATACAAA ATACTTGAGT 
GCTATTTAAT AATAGGTCTC ATTTATTCCA CAGGCTGTAG TTTGTAGTCT 
AATAGAAACA GACTGATTAA GCAGGAGAAG TTTTTTGAAA GAATTTTGTT 
AATTATTAGA AGGCAGGTGA ACCAGGAGGG TAAGCTTCCA GCAGCAATTT 
GCCTTAGAAT TGGACTAAGG AAGAAGCTGC TGACACTCCA CTGCCACACA 
AGAAAGTGCT GCTGCCTCCC TGCCCCACCT TTGCCACTTC TGCAGCAGGA 
GAATGCCCCC ACCCGCACCG GAACAGCAAC AAAAGGATTC TGCATGAGAT 
ATTGCTGAAT TCAAAAAAGA AGTTGCATAC AAAGACATCT GATTGAAAAA 
TATGCCCCTT TCATAGGCTG CTAGGGAGTT TTCCTGGTTC TACTTTCAGG 
AATAAGACCA GAATTTCTCA TATGTTGTGA GAGGAT TCAA ATGTTACAGG 
AAACTAT CAA TCATGTATAA ATCCAACAAA CACTTTGTAA CATACAAGAA 
GTGAACCATT GTTGGAGAAT CTACTAAAAT ACGGCTTCCC GCAAACGAAG 
AATGTAAATA AAAAGAACTG GCAGTGTATA TCAGATGTTT AACTATAGGA 
GATGTGGAGA CTATTGCCAT AGACCACAAT GTAAATTTTT AAGTGAGGAA 
GGAAT CAAAA GGGGCCAGGT GCAGTGGCTC ACATCTATAA TCCCAGAGCT 
GAGGCAGGAG GATCACTTGA AGCCAGTTTT GAGACCAGCC TATGCAACAC 
TATCTCTACA AAAAATAGAT TAGCTGGGCA CGGTGGTGCA TGCCTATTGT 
GTGGAGGCTG AAGTAGGAAA TCACTTGAGC CCGAGAGTTT GAGGTTACAG 
TTATACCACT GCACTCCAGC CTGGGCAAGA GAGCAAGACC TTGTCTCTT 

Seq ID NO: 44 Protein sequence: 
Protein Accession CAB55276.2 



PCT/US02/12476 



MNGEYRGRGF 
TPQSMQSTLD 
ILVDFKELTE 
LSNDGETMVN 
LCAACGEIQS 
QREAGRIPRT 
SISNSKGQKT 
KAGLALALFG 
TTTTSGLTVT 
LAKAGWCSL 
HDHLLSEHVI 
IPHQLIjRKYI 
EARARLELRE 
SALNNVAERT 



11 

i 

GRGRFQSWKR 
RF I PYKGWKL 
GGEVTNLIPD 
VPHIHARVYN 
FPLPDGKYSL 
IECELVHDLV 
KSSEDGCKHG 
GSQKYADDKN 
LSKDSSSGDF 
PARTS I I AAA 
AIRAGKQRTI 
GYARQYVYPR 
EATKEDAED I 
YNNIFQFHQL 



21 
i 

GRGGGNFSGK 
YFSEVYSDSS 
IATELRDAPE 
YEPLTQLKNV 
PTKCPVPVCR 
DSCVPGDTVT 
MLMEFSLKDL 
RIPIRGDPHI 
ALEAGALVLG 
NPVGGHYNKA 
SSATVARMNS 
LSTEAARVLQ 
VEIMKYSMLG 
RQIAKELNIQ 



31 
I 

WREREHRPDL 
PLIEKIQAFE 
KTLACMGLAI 
RANYYGKYIA 
GRS FT ALRS S 
ITGIVKVSNA 
YAIQEIQAEE 
LWGDPGLGK 
DQGICGIDEF 
KTVSENLKMG 
QDSNTSVLEV 
DFYIjELRKQS 
TYSDEFGNLD 
VADFENFIGS 



41 
I 

SKTTGKRTSE 
KFFTRHIDLY 
HQVLTKDLER 
LRGTWRVSN 
PLTVTMDWQS 
EEGSRNKNDK 
NLFKLIVNSL 
SQMLQAACNV 
DKMGNQHQAL 
SALLSRFDLV 
VSEKPLSERL 
QRLNSSPITT 
FERSQHGSGM 
LNDQGYLLKK 



Seq ID NO: 45 DNA sequence 

Nucleic Acid Accession #: NM_005416.1 

Coding sequence: 149.. 658 



ACCAGATCCC 
CTGAAGACCA 
AAAGAGTGTG 
CCCACCACCT 
AATATTTGTT 
AAAGATTCCA 
GCCAGGCTGT 
CAAGGTCCCT 
ACCAGGCAGC 
CAAAGTTCCT 
GCCATGTCCT 
TGGTGCACAG 
TGTTTCTGTG 
AGTCTCTCTC 
CTGAAGAATC 
GGCTGCTCAG 
CTCATTAAAT 



11 

I 

AGAGGCTGAA 
GAAAAGCCAC 
TCCACGATCC 
CAGCTTCAAC 
CCCACAACCA 
GAGCCAGGCT 
ACCAAGGTCC 
GAGCCAGGTT 
ATCAAGGTCC 
GAGCAAGGAT 
TCAACGGTCA 
ACAAGCCCTT 
TCTTAATTGT 
TTATTTGTAT 
CTGTAAGCCC 
GGTTCATCTG 
TGCTTTTAAT 



21 
I 

CACCTCGACC 
TAAGACTTTC 
TTTGAAGCAT 
AGCAGCAGGT 
AGGAGCCATG 
GTACCAAGGT 
CTGAGCCAGG 
GTACCAAGGT 
CTGACCAAGG 
ACACCAAAGT 
CTCCAGGCCC 
GAGAAGCCAA 
CTGTAGACCT 
CCTAAAAATA 
CTGAATTAAG 
AAGATTCGAA 
TCCA 



31 
I 

TTCTCTGCAC 
TGCTTAATTC 
GAGTT CTT AC 
GAAACAACCC 
CCACTCAAAG 
CCCTGAGCCA 
TTGTACCAAG 
CCCTGAGCCA 
CTTCATCAAG 
TCCTGTGCCA 
AGCTCAGCAG 
CCACCAGATG 
TGTAATCAGC 
CGTACTATAA 
CAGAAAGTCT 
TGAAAAGAAA 



41 
I 

AGCAGATGAT 
AGGAGCTTAG 
CAGCAGAAGC 
AGCCAGCCTC 
GTTCCACAAC 
GGCTGTACCA 
GTCCCTGAGC 
GGCTACACCA 
TTTCCTGAGC 
GGCTACACAA 
AAGACCAAGC 
CTGGACACCC 
ACATTGTCAC 
AGCTTTTGTT 
TCATGGCTTT 
TGCATGTTTC 



AGGAGGCTGA 
AGCAAGACCC 
CACATGCCTA 
AGTTTGAGGT 
AGTAACTCTT 
AATAAATTCT 
GACCTGCCTT 
ACAAATACAT 
GGACACTAAA 
TTTGTTTTTA 
TGCTTGAAAC 
TGGCTCACGG 
GTAAAACCAT 
GGGCACTGGA 
ATAGGTAGAA 
GCCTCCCTAA 
GGGTATGTTA 
TGGTGGGATC 
GTTGCCAGCC 
CTCAGGAAAT 
ATGAATGGAA 
CCAGAACTAA 
GGAAAAATCA 
TTGGGAGTTC 
ATTGAGACCC 
CCTACCTACT 
TGAGCTATGA 



51 
I 

QTPQFLLSTK 
DKDEIERKGS 
HAAELQAQEG 
IKPLCTKMAF 
IKIQELMSDD 
CMFLLYIEAN 
CPVIFGHELV 
APRGVYVCGN 
LEAMEQQSIS 
FILLDTPNEH 
KWPGETIDP 
RQLESLIRLT 
SNRSTAKRFI 
GPKVYQLQTM 



51 
I 

CCCTGAGCAG 
AGGATTCTTC 
AGACCTTTAC 
CACCTCAGGA 
CTGGAAACAC 
AGGTCCCTGA 
CAGGCTGTAC 
AGGTCCCTGA 
CAGGTGCCAT 
AGCTACCAGA 
AGAAGTAATT 
TCTTCCCATC 
CCCAAGCCAT 
CACACACACT 
TCTGGTCTTC 
CTGCTCTTCC 



Seq ID NO: 46 Protein sequence: 
Protein Accession #: NP 005407.1 



11 



21 



31 



41 



51 



MSSYQQKQTF TPPPQLQQQQ VKQPSQPPPQ EIFVPTTKEP CHSKVPQPGN TKIPEPGCTK 
VPEPGCTKVP EPGCTKVPEP GCTKVPEPGC TKVPEPGCTK VPEPGYTKVP EPGSIKVPDQ 
GFIKFPEPGA IKVPEQGYTK VPVPGYTKLP EPCPSTVTPG PAQQKTKQK 

Seq ID NO: 47 DNA sequence 

Nucleic Acid Accession #: Eos sequence 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
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10 
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11 



PCT/US02/12476 



GCGTCGTGTG 
AAGGCTCGTT 
TTGCAATTAA 
GAAAATGGAT 
TTTGGGGAAA 
AGT CGGCGTT 
TAAGGATAAC 
TTGGAGCTGC 
CTAAAAACTT 



CAGGCGTCCC 
AGAATTCGCC 
GCTTAGGGAA 
TAGAGAAACT 
GTGCCCCGAC 
GGCGGCAGCG 
AT C C TGGAAA 
CCTGTGGAGT 
TGTGAGAATT 



21 
I 

CGGGCTGTGG 
CTAGAGCTGT 
CCAGCAACAA 
TCTTCCCCGA 
CGCAGAGGCG 
GTGGCCTTCC 
TGACTTCTGT 
TACAGTTTAC 
TTCTTTTACT 



31 
I 

ATAATTAGAC 
ATCATGTATT 
AAGCAAACTT 
TTTAAGGGGA 
ACGACAGGGG 
TCATCTGGGC 
ACGGTTTGAG 
CAAACACATT 
AAAATTTTTT 



41 

1 


51 

» 




1 

ACGTTCTTCC 


1 

CTCATTGCCC 


60 


TTCTTTCAAA 


TTAACTTTGC 


12 0 


GGCCCGAGGT 




180 


AAGATTCCTG 


CGGCCAGCGC 


240 


AGCAGGAAGC 


TGCTCACGGT 


300 


GATGTGGGCT 


CCTAGAAGAG 


360 


CCCAACTGCA 


CACTCATGAC 


420 


CATGAACATA 


ATCTCATTTA 


480 


CTTATTACAA 


A 





15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 4 8 DNA sequence: 

Nucleic Acid Accession #: CAT cluster 



TTCCAAATTT 
TTTTAGTAAA 
CTCCAAGTCA 
TCCTTACTCT 
CCGACTACCG 
CCCAAAGCGC 
ATTTTCGCGG 
TTGCAAGCAA 
AGCCTTGGGC 
CGACGCT 



11 
! 

TTTTTTTTGT 
TGAGATTATG 
TGAGTGTGCA 
TCTCGGAGCC 
TGAGCAGCTT 
TGGCCGCAGG 
TGAACGACCT 
AGTTAATTTG 
AATGAGGGAA 



21 
I 

AATAAGAAAA 
TTCATGAATG 
GTTGGGCTCA 
CACATCGCCC 
CCTGCTCCCC 
AATCTTTCCC 
CGGGCCAAGT 
AAAGAAAATA 
GAACGTGTCT 



31 
I 

AATTTTAGTA 
TGTTTGGTAA 
AACCGTACAG 
AGATGAGGAA 
TGTCGTCGCC 
CTTAAATCGG 
TTGCTTTTGT 
CATGATACAG 
AGTTATCCAC 



41 
I 

AAAGAAAATT 
ACTGTAACTC 
AAGTCATTTC 
GGCCAGCGCT 
TCTGCGGTCG 
GGAAGAAGTT 
TGCTGGTTCC 
CTCTAGGGCG 
AGCCCGGGGA 



51 
I 

CTCACAAAGT 
CACAGGGCAG 
CAGGATGTTA 
GCCGCCAACG 
GGGCACTTTC 
TCTCTAATCC 
CTAAGCTTAA 
AATTCTAACG 
CGCCTGCACA 



Seq ID NO: 4 9 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



TCTTTCTTCT 
CCTGCCGACC 
ACCGTAGACC 
GACACATGCA 
CCCAACCAAA 
TTCATTTAAA 
CTTTCTCTGA 
TCGCTGTAGC 
TAAAGAAACT 
GATTGAACCA 
TGCTGGTATC 
GACGTGGGGA 
CCGGGTCTCT 
GAGACAGCAT 
ACTG 



11 
I 

GCTGCTCGTT 
TCTGTTGTCT 
CGAAACCATT 
GACCAGTTTT 
GTGTTTAAAA 
AAACTCTAAT 
TCTGTGTCTT 
CATGGGAATC 
GACACAGGAG 
GTGCACTCCA 
GTCCTGCAGC 
GAGCTGGTCT 
CCTGGCCCCG 
CATTTATGAG 



21 

1 

TGTCTCTCCT 
CTTCTCTGAT 
GGGTGT CACA 
CCTGGAACNG 
CTTTTTAGGG 
ATTTATATTA 
TTTTCTTTGA 
CGTTTCATTA 
AATCACTTGA 
GCCTTGGCAG 
CCCATCCTCG 
ATATATCCGG 
GGGACCTAGT 
CCTGCAGCAT 



31 
I 

GTGCTCTTCT 
GGCGGGGGGC 
AGCCGGTCGC 
CATGACCATG 
CACCCCCAAA 
AATACAAAGA 
CAGCATCTCC 
TTATGGTAGC 
ACTTGGGAGG 
CGG AG CAAG A 
GTTCCATTGC 
GTGAAGCTCA 
ATTTTTGCCA 
CCACCCTACT 



41 

1 

TCTTTCTTTC 
GGGAGAAGCT 
CGGCTTTTTT 
TTATTACTAT 
ATTTTTTTTT 
TACCCAAACC 
ATTTTTTTTC 
AATATGGAGT 
CAGAGTTTGC 
TTCTGTCACA 
GCTGCCAGGC 
GCTGTGGCAC 
CGAGTGTACA 
GCTGTATCCA 



51 

I 

CCTCGCCGCT 
GACCGGTGAG 
GGGAGAACCC 
GGGCCGCCTC 

CTTTATGCTT 
TGCTGCTTCA 
GCTGTATTCC 
AGTGAGCCGA 
GTTCCTGAAG 
AGGGTGCTGG 
ACCTTGGATG 
CCAAACAAAG 
GTTTCCATTG 



Seq ID NO: 50 DNA sequence 
Nucleic Acid Accession #: L05187 
Coding sequence: 1991.. 2260 



CTGCAGGGAG 
TCAGAAAGGA 
CAGAAGAAGG 
TGAAGGAAAG 
AGAGTCATAA 
GGAAATGGAT 
ATTTCTAGCT 
CCCCTCCCTT 
ACAACCATCT 
CCAGGGTTAA 
CCCTGCACCT 
CAGCCAGCTA 
GATGAGGATG 
AGCTTCTATT 
TCACACCAAA 
ATTGCAACAA 
ATATGTGTAA 
TATTTTAAGT 
CCTCAGTAGA 
AGTTCATAGC 
TGACAAGATA 
ATTTAAGGCA 
AACATAAAAC 
AGTAATTGGC 
AGGAGACCTC 
AGATGGGAAG 
GAGGCTTAGA 
GAGGAAAGTG 
GAAGCCAGCT 
GAGCCAAGAA 
TTCAAAGGGC 



11 
1 

GCAGGTAGAA 
GGAAAAGGCC 
ATTAGCCCCT 
CAGGTTTTCC 
GTAAATTATT 
GGAAGGTCTT 
TCCACCTTCA 
TCCCACCTAT 
CAATGACAAG 
CTCATGAAAC 
GGGT CTGAGG 
GTGCCAAAAA 
GTAGTGTGAG 
TCCTTGAGGC 
CCCAAGGGAC 
ACTGGCAATT 
GCAGGTTAAT 
TAAATTACAG 
TAGTCATTGA 
AGAACTAGAA 
TTTATAGAAA 
GTATGCTAGG 
CTAGCAGGAA 
ATGACGGAGA 
TAGGGTGTCA 
AAAAGCATTT 
TGAATATAAA 
GTCTGATGCC 
TTAGTAGGGC 
GAGAACTCCA 
CTGAAAATTA 



21 

I 

AAGGCTTTTG 
AGGGCAGATG 
GAAAGTCCCT 
CAGATTAGCA 
CTGAATGTGT 
GGACTCTGAG 
CCAAGGCAGA 
TCATGTGTGC 
GACAGCAGGT 
CCTCCATGAA 
ATGAGGGTGG 
ATATCAGGTG 
TCATGTGTGA 
AGGGCTCATT 
CACACAGCCC 
CTAGTGTACT 
CCAGGGTTTC 
TCTGGATTTG 
ACTGGGAGTC 
CTCAGGCCAG 
TTTTAATTTA 
CACTTTGGAC 
GGTAATACAT 
TGGG CAG AG A 
AGTGATGTGA 
GGAAGGGACT 
GCCATCCTAT 
ATTTTCCAAA 
ATTTTTCCAG 
ATAAAATGGA 
TCCAAGCTTA 



31 

I 

GGTTTTCAGG 
TCTGGGTGGA 
GAAGTAGGAG 
ACCAGTCAGG 
GTAGTTTAAT 
ACAAGGGGTC 
CAAGGAGGGC 
AAGAGTGCCC 
GGCAAGGCTC 
GCCTGCTGCT 
CAGTGAAAAT 
GTGTTCATCA 
CAGGTGAGGA 
CATCTTATAA 
ATTCTGCTCC 
TTTTCATTAT 
AATGGGAGAT 
AAAGGACCTT 
CTGGAGAAGA 
AGCACTCTCA 
TTAGATGGAT 
AAATCAATGC 
ATATATAAAT 
AGGGCTGTGC 
GCTATGATGG 
GTGTAAGCAC 
AAGTCACAGG 
AGACCTAATA 
AACAGATATA 
GCAGAAGAAA 
TTTCATTTTT 



41 
I 

TGGGGGGCAG 
GTGAAGGGAA 
AAGGGTAAAG 
GGGAGGAAGG 
GGAATTGGGA 
TATAATCAGT 
CCACCTCAGC 
TGTCCCACAG 
AACAGGACTC 
CACCCCTCCC 
TAGGCCAGTG 
AATAAGCCGA 
ATGAAAACAG 
AAGCCAGCTG 
GTATACCAGG 
TAGAAATTAG 
AGAGAATAGT 
AGAGATGGTT 
TTGTTCAAAT 
GTAACACTGC 
CTCTACTGAG 
CCTAACGTAC 
AAATGAAATG 
ACTTTTGGGA 
AGGGGTATTT 
AGACCAGAAG 
CTTTCTACAT 
TGCGGACCTC 
AGGTGCCTTG 
TTGCCTTTTA 
AAATGTAATG 



51 
I 

TCTAGCCTGA 
AAAGTG AT CC 
GTGTGGTTGG 
TGAGAGTGGG 
AAAAGATGGG 
CCATTTCATT 
TCCTCTGCTC 
AACACGGGGA 
AGATGTCCCC 
TCAAGGCAAG 
ACATCATTTT 
GCCAACCGGT 
AGTGCCCGAG 
GCCATTGCCT 
TAAGTCTCTG 
CTAAAGGCAA 
GGAATATCTT 
AGGGCTCCCA 
G C CCATGGG A 
AATTTCCCCC 
CATTTATTCC 
TTACTTAACA 
CAAAGTAGAT 
GACTTGCTCA 
GGACAAGCAG 
CAAAACCATA 
GGTACTAGGA 
ATGTCCCTCA 
GGTAGGAAGG 
GCTCCTCCTC 
GGGGAGCTAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



207 



WO 02/086443 

GGGAGATGAA AGGCTTTCTC TTCTAAAGGG TCCTGAAATA AAATCTGTTT GGCATTGAAT 1920 

TTGTATCCAT CTTTCTTTAA TTGAATCACT GTGTCAGCTT TCTGTCTCTA GAAAAAAACA 1980 

CATTTGAAGC ATGAATTCTC AGCAGCAGAA GCAGCCTTGC ACCCCACCCC CTCAGCCTCA 2040 

GCAGCAGCAG GTGAAACAAC CTTGCCAGCC TCCACCCCAG GAACCATGCA TCCCCAAAAC 2100 

5 CAAGGAGCCC TGCCAACCCA AGGTGCCTGA GCCCTGCCAC CCCAAAGTGC CTGAGCCCTG 2160 

CCAGCCCAAG ATTCCAGAGC CCTGCCAGCC CAAGGTGCCT GAGCCCTGCC CTTCAACGGT 2220 

CACTCCAGCA CCAGCCCAGC AGAAGACCAA GCAGAAGTAA TGTGGTCCAC AGCCATGCCC 2280 

TTGAGGAGCT GGCCACTGGA TACTGAACAC CCTACTCCAT TCTGCTTATG AATCCCATTT 2340 

GCCTATTGAC CCTGCAGTTA GCATGCTGTC ACCCTGAATC ATAATCGCTC CTTTGCACCT 2400 

10 CTAAAAAGAT GTCCCTTACC CTCATTCTGG AGGCTCCTGA GCCTCTGCGT AAGGCTGAAC 24S0 

GTCTCACTGA CTGAGCTAGT CTTCTTGTTG CTCGGGTGCA TTTGAGGATG GATTTGGGGA 2520 
AGGTCAAGTG ACCATCCCTA G 



PCT/US02/12476 
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Seq ID NO: 51 Protein sequence; 
Protein Accession #:AAC26838 

1 11 21 31 41 51 

I I I I I I 

MNSQQQKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKEP CQPKVPEPCH PKVPEPCQPK 
IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 



Seq ID NO: 52 DNA sequence 

Nucleic Acid Accession #: NM_002638.1 

Coding sequence: 120-473 



CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
TCAATGGACA 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



11 
I 

AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGTCAAAGGT 
CATGT TGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



21 

I 

CCCTTGTAAA 
GTATGGCCTT 
ATCGTGGTGG 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTT CCAATAA 



31 
I 

TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TGGCCTGTTT 
AGCTACAGGC 
TCTTCCTCCC 
A 



41 
I 

CCGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



51 
1 

GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
CGTGTTCCAT 
AAAGTCAAAG 
ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCACGGCTG 



Seq ID NO: 53 Protein sequence: 
Protein Accession #: NP_002 62 9.1 

1 11 21 31 41 51 

I I I I I I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 54 DNA sequence 

Nucleic Acid Accession #: NM_019618 

Coding sequence: 75-584 



GGCACGAGCC 
GAGACAACCA 
ATCAAT CAAT 
CCCTTCAGGG 
TTGCTGTTAT 
ATTTGGGAAT 
CATTGCAGCT 
CCTTCCTTTT 
CGGACTGGTT 
GGAAGTCATA 
GCAGCTTGGT 
AGTGTCATTT 
TAATGAAGAA 
GGAGAGCTGG 
CTGCATGAGT 
TGAAGATGCT 
CTCTGTTTCT 
CCAATATACC 
TAATTCTTGT 
AATAAACTTT 



11 

i 

ACGATTCAGT 
CACTATGAGA 
GTGTAAACCT 
TCAGAACCTT 
CACATGCAAG 
CCAGAATCCA 
AAAAG AG C AG 
CTACCGTGCC 
CATTGCCTCC 
CAACACTGCC 
CTTTGTCTTA 
TCACGCTGGT 
GAAGCAATTA 
GTGGTATAAG 
GACTTTAAGA 
TCAGAGCTCA 
GTTTTG CTTT 
TCATTGTGTG 
GTTAAGTTAA 
GTGTATTTAT 



21 
I 

CCCCTGGACT 
GGCACTCCAG 
ATTACTGGGA 
GTGGCAGTTC 
TATCCAGAGG 
GAAATGTGTT 
AAGATCATGG 
AAGACTGGTA 
TCCAAGAGAG 
TTTGAATTAA 
AAGTTTCTGG 
GCTGAGACAG 
CTTCATAGCA 
GCTGTCCTCT 
CTCAAAGACC 
TGCGCGTTAC 
ATTCCCTCTT 
TAATAGAACC 
ATCATTTTTG 
ATAATAAAAA 



31 
I 

GTAGATAAAG 
GAGACGCTGA 
CTATTAATGA 
CACGAAGTGA 
CTCTTGAGCA 
TGTATTGTGA 
ATCTGTATGG 
GGACCTCCAC 
ACCAGCCCAT 
ATATAAATGA 
TTCCCAATGT 
GGGCAAGGCT 
ACTGAAGAAC 
CAAGCTGGTG 
AAACACTGAG 
CCACGATGGC 
GGGATGATAT 
TTCTTAGCAT 
TCCTAATTGT 
AAAAAAAAAA 



41 

I 

ACCCTTTCTT 
TGGTGGAGGA 
TTTGAATCAG 
CAGTGTGACC 
AGGCAGAGGG 
GAAGGTTGGA 
CCAACCCGAG 
CCTTGAGTCT 
CATTCTGACT 
CTGAACT CAG 
GTTTTCGTCT 
GCTGTTATCA 
AGGATGTGGC 
CTGTGTAGGC 
CTTTCTTCTA 
ATGACTAGCA 
CATCCAGTCT 
TAAGACCTTG 
AATGTGTAAT 
AAA 



51 

] 

GCCAGGTGCT 
AGGGCCGTCT 
CAAGTGTGGA 
CCAGTCACTG 
GATCCCATTT 
GAACAGCCCA 
CCCGTGAAAC 
GTGGCCTTCC 
TCAGAACTTG 
CCTAGAGGTG 
ACATTTTCTT 
TCTCATTTTA 
CTCAGAAGCA 
CACAAGGCAT 
GGGGTGGGTA 
CAGAGCTGAT 
TTATATGTTG 
TAAACAAAAA 
CTTAAAGTTA 



Seq ID NO: 55 Protein sequence: 
Protein Accession #: NP_062564 



11 



21 



31 



41 



51 

1 



MRGTPGDADG GGRAVYQ SMC KPITGTINDL NQQVWTLQGQ NLVAVPRSDS VTPVTVAVIT 
CKYPEALEQG RGDPIYLGIQ NPEMCLYCEK VGEQPTLQLK EQKIMDLYGQ PEPVKPFLFY 
RAKTGRTSTL ESVAFPDWFI ASSKRDQPII LTSELGKSYN TAFELNIND 

Seq ID NO: 56 DNA sequence 

Nucleic Acid Accession #: NM_003125 

Coding sequence: 65-334 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 

120 



208 



WO 02/086443 



1 11 21 31 41 51 

I I i i I I 

AGCAGTTCTA AGGGACCATA CAGAGTATTC CTCTCTTCAC ACCAGGACCA GCCACTGTTG 60 

CAGCATGAGT TCCCAGCAGC AGAAGCAGCC CTGCATCCCA CCCCCTCAGC TTCAGCAGCA 120 

GCAGGTGAAA CAGCCTTGCC AGCCTCCACC TCAGGAACCA TGCATCCCCA AAACCAAGGA 180 
GCCCTGCCAC CCCAAGGTGC CTGAGCCCTG CCACCCCAAA GTGCCTGAGC CCTGCCAGCC * 240 

CAAGCTTCCA GAGCCATGCC ACCCCAAGGT GCCTGAGCCC TGCCCTTCAA TAGTCACTCC 3 00 

AGCACCAGCC CAGCAGAAGA CCAAGCAGAA GTAATGTGGT CCACAGCCAT GCCCTTGAGG 3 60 

AGCCGGCCAC CAGATGCTGA ATCCCCTATC CCATTCTGTG TATGAGTCCC ATTTGCCTTG 420 

CAATTAGCAT TCTGTCTCCC CCAAAAAAGA ATGTGCTATG AAGCTTTCTT TCCTACACAC 480 

TCTGAGTCTC TGAATGAAGC TGAAGGTCTT AGTACCAGAG CTAGTTTTCA GCTGCTCAGA 540 

ATTCATCTGA AGAGAGACTT AAGATGAAAG CAAATGATTC AGCTCCCTTA TACCCCCATT 600 
AAATTCACTT TCAATTCCA 



Seq ID NO: 57 Protein sequence: 
Protein Accession #: NP_003116 

1 11 21 31 41 51 

1 I I 1 I I 

MSSQQQKQPC IPPPQLQQQQ VKQPCQPPPQ EPCIPKTKEP CHPKVPEPCH PKVPEPCQPK 60 
LPEPCHPKVP EPCPSIVTPA PAQQKTKQK 

Seq ID NO: 58 DNA sequence 

Nucleic Acid Accession #: NM_001793.2 

Coding sequence: 71-2 560 

1 11 21 31 41 51 

I I I I I I 

AAAGGGG C AA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 120 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TT CAGGGAGG CTGAAGTGAC 180 

CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATT CATGGG 240 

CTGCCCTGGG CAAGAGCCAG GTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 300 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAAT CAG CTCAAGTCTA ATAAAGATAG 480 

AGACACCAAG ATTTTCTACA GCAT C ACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGGTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 660 

CCCCATGAAC AT CTCCAT CA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

GACAGCCACG GATGAGGATG ATG CCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CAGCAT CAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 10 80 

GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG CCACCTACCT TAT C ATGGGC GGTGACGACG GGGACCATTT 12 00 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 1260 

TTTTGAGGCC AAAAACC AG C ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGT CCAC GTGGAGGATG TGAATGAGGC 1380 

ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 1500 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 1560 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 

CCAAAGCCCT GTGCGCGAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 1800 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACAT CT AC TGGACGGCAG AGGTCAACGA 1860 

GGAAGGTGAC ACAGTGGT CT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 222 0 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 22 80 

CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 2400 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2 580 

GGGACCAAAC GTCAGGCCAC AG AGCATCT C CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2 640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2760 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2 82 0 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTG ACT G ACC 2880 

TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2 940 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAGAA AAAAA 

Seq ID NO: 59 Protein sequence: 



209 



WO 02/086443 

Protein Accession #: NP_001784.2 



PCT/US02/12476 



1 11 21 31 41 51 

<- I I 1 I I I 

0 MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 

KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPIiDREE I AK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 

10 TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQ FVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 540 

VNDHGPVPEP RQITICNQSP VRQVLNITDK DkSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 60 0 

15 TWLSLKKFIi KQDTYDVHLS LSDHGNKEQL TV I RAT V CDC HGHVETCPGP WKGGFILPVL 660 

GAVLALLFDL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 72 0 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTDDVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKIiA DMYGGGEDD 

20 Seq ID NO: 60 DNA sequence 

Nucleic Acid Accession #: Eos sequence 
Coding sequence: 162-428 

_ 1 11 21 31 41 51 

25 | i i I I I 

GCGTTCCGTT GGCGGCGGAT TCGAACGTTC GGACTGAGGT TTTTCTGCCT GAAGAAGCGT 60 

CATACGGACC GGATTGTTTT CGCTGGCCCA GTGTCCCCGG AGCTTGTGTG CGATACAGAG 12 0 

AGCACCT CGG AAGCTGAGGC AGCTGGTACT TGACAGAGAG GATGGCGCTG T CGACCAT AG 180 

TCTCCCAGAG GAAGCAGATA AAGCGGAAGG CTCCCCGTGG CTTTCTAAAG CGAGTCTTCA 240 

30 AGCGAAAGAA GCCTCAACTT CGTCTGGAGA AAAGTGGTGA CTTATTGGTC CATCTGAACT 30 0 

GTTTACTGTT TGTTCATCGA TTAGCAGAAG AGTCCAGGAC AAACGCTTGT GCGAGTAAAT 360 

GTAGAGTCAT TAACAAGGAG CATGTACTGG CCGCAGCAAA GG TAATTCTA AAGAAGAGCA 420 

GAGGTTAGAA GTCAAAGAAC ATATTCTTGA AAGTTATGAT GCATTCTTTT GGGTGGTAAC 480 
AG AT CAT AAA GACATTTTTT ACACATCAGT TAATATGGGA TTATTAAATA TTGG 
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Seq ID NO: 61 Protein sequence: 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 
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MALSTIVSGR KQIKRKAPRG FLKRVFKRKK PQLRLEKSGD LLVHLNCLLF VHRLAEESRT 60 
NACASKCRVI NKEHVLAAAK VILKKSRG 

Seq ID NO: 62 DNA sequence 
45 Nucleic Acid Accession #: NM_000094.2 
Coding sequence: 99-8933 

1 11 21 31 41 51 

<n I 1 1 1 1 1 

DU GGGCTGGAGG GGCGCTGGGC TCGGACCTGC CAAGGCCACC GCAGGGGGGA GCAAGGGACA 60 

GAGGCGGGGG TCCTAGCTGA CGGCTTTTAC TGCCTAGGAT GACGCTGCGG CTTCTGGTGG 120 

CCGCGCTCTG CGCCGGGATC CTGGCAGAGG CGCCCCGAGT GCGAGCCCAG CACAGGGAGA 180 

GAGTGACCTG CACGCGCCTT TACGCCGCTG ACATTGTGTT CTTACTGGAT GGCTCCTCAT 240 

CCATTGGCCG CAGCAATTTC CGCGAGGTCC GCAGCTTTCT CGAAGGGCTG GTGCTGCCTT 300 

55 TCTCTGGAGC AGCCAGTGCA CAGGGTGTGC GCTTTGCCAC AGTGCAGTAC AGCGATGACC 360 

CACGGACAGA GTTCGGCCTG GATGCACTTG GCTCTGGGGG TGATGTGATC CGCGCCATCC 420 

GTGAGCTTAG CTACAAGGGG GGCAACACTC GCACAGGGGC TGCAATTCTC CATGTGGCTG 480 

ACCATGTCTT CCTGCCCCAG CTGGCCCGAC CTGGTGTCCC CAAGGTCTGC ATCCTGATCA 540 

CAGACGGGAA GTCCCAGGAC CTGGTGGACA CAGCTGCCCA AAGGCTGAAG GGGCAGGGGG 600 

60 TCAAGCTATT TGCTGTGGGG AT CAAGAATG CTGACCCTGA GGAGCTGAAG CGAGTTGCCT 660 

CACAGCCAAC CTCCGACTTC TTCTTCTTCG TCAATGACTT CAGCATCTTG AGGACACTAC 720 

TGCCCCTCGT TTCCCGGAGA GTGTGCACGA CTGCTGGTGG CGTGCCTGTG ACCCGACCTC 780 

CGGATGACTC GACCTCTGCT CCACGAGACC TGGTGCTGTC TGAGCCAAGC AGCCAATCCT 840 

TGAGAGTACA GTGGACAGCG GCCAGTGGCC CTGTGACTGG CTACAAGGTC CAGTACACTC 900 

65 CTCTGACGGG GCTGGGACAG CCACTGCCGA GTGAGCGGCA GGAGGTGAAC GTCCCAGCTG 960 

GTGAGACCAG TGTGCGGCTG CGGGGTCTCC GGCCACTGAC CGAGTACCAA GTGACTGTGA 102 0 

TTGCCCTCTA CGCCAACAGC AT CGGGGAGG CTGTGAGCGG GACAGCTCGG ACCACTGCCC 1080 

TAGAAGGGCC GGAACTGACC ATCCAGAATA CCACAGCCCA CAGCCTCCTG GTGGCCTGGC 1140 

GGAGTGTGCC AGGTGCCACT GGCTACCGTG TGACATGGCG GGTCCTCAGT GGTGGGCCCA 12 00 

70 CACAGCAGCA GGAGCTGGGC CCTGGGCAGG GTTCAGTGTT GCTGCGTGAC TTGGAGCCTG 12 60 

GCACGGACTA TGAGGTGACC GTGAGCACCC TATTTGGCCG CAGTGTGGGG CCCGCCACTT 132 0 

CCCTGATGGC TCGCACTGAC GCTTCTGTTG AGCAGACCCT GCGCCCGGTC ATCCTGGGCC 13 80 

CCACATCCAT CCTCCTTTCC TGGAACTTGG TGCCTGAGGC CCGTGGCTAC CGGTTGGAAT 1440 

GGCGGCGTGA GACTGGCTTG GAGCCACCGC AGAAGGTGGT ACTGCCCTCT GATGTGACCC 1500 

75 GCTACCAGTT GGATGGGCTG CAGCCGGGCA CTGAGTACCG CCTCACACTC TACACTCTGC 1560 

TGGAGGGCCA CGAGGTGGCC ACCCCTGCAA CCGTGGTTCC CACTGGACCA GAGCTGCCTG 1620 

TGAGCCCTGT AACAGACCTG CAAGCCACCG AGCTGCCCGG GCAGCGGGTG CGAGTGTCCT 1680 

GGAGCCCAGT CCCTGGTGCC ACCCAGTACC GCAT CATTGT GCGCAGCACC CAGGGGGTTG 1740 

AGCGGACCCT GGTGCTTCCT GGGAGTCAGA CAGCATTCGA CTTGGATGAC GTTCAGGCTG 1800 

80 GGCTTAGCTA CACTGTGCGG GTGTCTGCTC GAGTGGGTCC CCGTGAGGGC AGTGCCAGTG 1860 

TCCTCACTGT CCGCCGGGAG CCGGAAACTC CACTTGCTGT TCCAGGGCTG CGGGTTGTGG 1920 

TGTCAGATGC AACGCGAGTG AGGGTGGCCT GGGGACCCGT CCCTGGAGCC AGTGGATTTC 1980 

GGATTAGCTG GAGCACAGGC AGTGGTCCGG AGTCCAGCCA GACACTGCCC CCAGACTCTA 2 040 

CTGCCACAGA CATCACAGGG CTGCAGCCTG GAACCACCTA CCAGGTGGCT GTGTCGGTAC 2100 

85 TGCGAGGCAG AGAGGAGGGC CCTGCTGCAG TCATCGTGGC TCGAACGGAC CCACTGGGCC 2160 

CAGTGAGGAC GGTCCATGTG ACTCAGGCCA GCAGCTCATC TGTCACCATT ACCTGGACCA 2220 

GGGTTCCTGG CGCCACAGGA TACAGGGTTT CCTGGCACTC AGCCCACGGC CCAGAGAAAT 2280 
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CCCAGTTGGT TTCTGGGGAG GCCACGGTGG CTGAGCTGGA TGGACTGGAG CCAGATACTG 2340 

AGTATACGGT GCATGTGAGG GCCCATGTGG CTGGCGTGGA TGGGCCCCCT GCCTCTGTGG 2400 

TTGTGAGGAC TGCCCCTGAG CCTGTGGGTC GTGTGTCGAG GCTGCAGATC CTCAATGCTT 2460 

CCAGCGACGT TCTACGGATC ACCTGGGTAG GGGTCACTGG AGCCACAGCT TACAGACTGG 2520 

5 CCTGGGGCCG GAGTGAAGGC GGCCCCATGA GGCACCAGAT ACTCCCAGGA AACACAGACT 2580 

CTGCAGAGAT CCGGGGTCTC GAAGGTGGAG TCAGCTACTC AGTGCGAGTG ACTGCACTTG 2640 

TCGGGGACCG CGAGGGCACA CCTGTCTCCA TTGTTGTCAC TACGCCGCCT GAGGCTCCGC 2700 

CAGCCCTGGG GACGCTTCAC GTGGTGCAGC GCGGGGAGCA CTCGCTGAGG CTGCGCTGGG 2760 

AGCCGGTGCC CAGAGCGCAG GGCTTCCTTC TGCACTGGCA ACCTGAGGGT GGCCAGGAAC 2820 

10 AGTCCCGGGT CCTGGGGCCG GAGCTCAGCA GCTATCACCT GGACGGGCTG GAGCCAGCGA 2880 

CACAGTACCG CGTGAGGCTG AGTGTCCTAG GGCCGGCTGG AGAAGGGCCC TCTGCAGAGG 2940 

TGACTGCGCG CACTGAGTCA CCTCGTGTTC CAAGCATTGA ACTACGTGTG GTGGACACCT 3000 

CGAT CGACTC GGTGACTTTG GCCTGGACTC CAGTGTCCAG GGCATCCAGC TACATCCTAT 3060 

CCTGGCGGCC ACTCAGAGGC CCTGGCCAGG AAGTGCCTGG GTCCCCGCAG ACACTTCCAG 3120 

15 GGATCTCAAG CTCCCAGCGG GTGACAGGGC TAGAGCCTGG CGTCTCTTAC ATCTTCTCCC 3180 

TGACGCCTGT CCTGGATGGT GTGCGGGGTC CTGAGGCATC TGT CACACAG ACGCCAGTGT 3240 

GCCCCCGTGG CCTGGCGGAT GTGGTGTTCC TACCACATGC CACTCAAGAC AATGCTCACC 3300 

GTGCGGAGGC TACGAGGAGG GTCCTGGAGC GTCTGGTGTT GGCACTTGGG CCTCTTGGGC 3360 

CACAGGCAGT TCAGGTTGGC CTGCTGTCTT ACAGTCATCG GCCCTCCCCA CTGTTCCCAC 342 0 

20 TGAATGGCTC CCATGACCTT GGCATTATCT TGCAAAGGAT CCGTGACATG CCCTACATGG 3480 

ACCCAAGTGG GAACAACCTG GGCACAGCCG TGGTCACAGC TCACAGATAC ATGTTGGCAC 3540 

CAGATGCTCC TGGGCGCCGC CAGCACGTAC CAGGGGTGAT GGTTCTGCTA GTGGATGAAC 3600 

CCTTGAGAGG TGACATATTC AGCCCCATCC GTGAGGCCCA GGCTTCTGGG CTTAATGTGG 3660 

TGATGTTGGG AATGGCTGGA GCGGACCCAG AGCAGCTGCG TCGCTTGGCG CCGGGTATGG 372 0 

25 ACTCTGTCCA GACCTTCTTC GCCGTGGATG ATGGGCCAAG CCTGGACCAG GCAGTCAGTG 3780 

GTCTGGCCAC AGCCCTGTGT CAGGCATCCT TCACTACTCA GCCCCGGCCA GAGCCCTGCC 3840 

CAGTGTATTG TCCAAAGGGC CAGAAGGGGG AACCTGGAGA GATGGGCCTG AGAGGACAAG 3900 

TTGGGCCTCC TGGCGACCCT GGCCTCCCGG GCAGGACCGG TGCTCCCGGC CCCCAGGGGC 3 960 

CCCCTGGAAG TGCCACTGCC AAGGGCGAGA GGGGCTTCCC TGGAGCAGAT GGGCGTCCAG 4020 

30 GCAGCCCTGG CCGCGCCGGG AATCCTGGGA CCCCTGGAGC CCCTGGCCTA AAGGGCTCTC 4080 

CAGGGTTGCC TGGCCCTCGT GGGGACCCGG GAGAGCGAGG ACCTCGAGGC CCAAAGGGGG 414 0 

AGCCGGGGGC TCCCGGACAA GT CAT CGG AG GTGAAGGACC TGGGCTTCCT GGGCGGAAAG 4200 

GGGACCCTGG ACCATCGGGC CCCCCTGGAC CTCGTGGACC ACTGGGGGAC CCAGGACCCC 42 60 

GTGGCCCCCC AGGGCTTCCT GGAACAGCCA TGAAGGGTGA CAAAGGCGAT CGTGGGGAGC 4320 

35 GGGGTCCCCC TGGACCAGGT GAAGGTGGCA TTGCTCCTGG GGAGCCTGGG CTGCCGGGTC 4380 

TTCCCGGAAG CCCTGGACCC CAAGGCCCCG TTGGCCCCCC TGGAAAGAAA GGAGAAAAAG 4440 

GTGACTCTGA GGATGGAGCT CCAGGCCTCC CAGGACAACC TGGGTCTCCG GGTGAGCAGG 4500 

GCCCACGGGG ACCTCCTGGA GCTATTGGCC CCAAAGGTGA CCGGGGCTTT CCAGGGCCCC 4560 

TGGGTGAGGC TGGAGAGAAG GGCGAACGTG GACCCCCAGG CCCAGCGGGA TCCCGGGGGC 462 0 

40 TGCCAGGGGT TGCTGGACGT CCTGGAGCCA AGGGTCCTGA AGGGCCACCA GGACCCACTG 4680 

GCCGCCAAGG AGAGAAGGGG GAGCCTGGTC GCCCTGGGGA CCCTGCAGTG GTGGGACCTG 4740 

CTGTTGCTGG ACCCAAAGGA GAAAAGGGAG ATGTGGGGCC CGCTGGGCCC AGAGGAGCTA 480 0 

CCGGAGTCCA AGGGGAACGG GGCCCACCCG GCTTGGTTCT TCCTGGAGAC CCTGGCCCCA 4860 

AGGGAGACCC TGGAGACCGG GGTCCCATTG GCCTTACTGG CAGAGCAGGA CCCCCAGGTG 4 920 

45 ACTCAGGGCC TCCTGGAGAG AAGGGAGACC CTGGGCGGCC TGGCCCCCCA GGACCTGTTG 4980 

GCCCCCGAGG ACGAGATGGT GAAGTTGGAG AGAAAGGTGA CGAGGGTCCT CCGGGTGACC 5040 

CGGGTTTGCC TGGAAAAGCA GGCGAGCGTG GCCTTCGGGG GGCACCTGGA GTTCGGGGGC 5100 

CTGTGGGTGA AAAGGGAGAC CAGGGAGATC CTGGAGAGGA TGGACGAAAT GGCAGCCCTG 5160 

GAT CAT CTGG ACCCAAGGGT GACCGTGGGG AGCCGGGTCC CCCAGGACCC CCGGGACGGC 5220 

50 TGGTAGACAC AGGACCTGGA GCCAGAGAGA AGGGAGAGCC TGGGGACCGC GGACAAGAGG 5280 

GT CCTCGAGG GCCCAAGGGT GATCCTGGCC TCCCTGGAGC CCCTGGGGAA AGGGGCATTG 5340 

AAGGGTTTCG GGGACCCCCA GGCCCACAGG GGGACCCAGG TGTCCGAGGC CCAGCAGGAG 5400 

AAAAGGGTGA CCGGGGTCCC CCTGGGCTGG ATGGCCGGAG CGGACTGGAT GGGAAACCAG 5460 

GAGCCGCTGG GCCCTCTGGG CCGAATGGTG CTGCAGGCAA AGCTGGGGAC CCAGGGAGAG 552 0 

55 ACGGGCTTCC AGGCCTCCGT GGAGAACAAG GCCTCCCTGG CCCCTCTGGT CCCCCTGGAT 5580 

TACCGGGAAA GCCAGGCGAG GATGGGAAAC CTGGCCTGAA TGGAAAAAAC GGAGAACCTG 5640 

GGGACCCTGG AGAAGACGGG AGGAAGGGAG AGAAAGGAGA TTCAGGCGCC TCTGGGAGAG 570 0 

AAGGTCGTGA TGGCCCCAAG GGTGAGCGTG GAGCTCCTGG TATCCTTGGA CCCCAGGGGC 5760 

CTCCAGGCCT CCCAGGGCCA GTGGGCCCTC CTGGCCAGGG TTTTCCTGGT GTCCCAGGAG 5820 

60 GCACGGGCCC CAAGGGTGAC CGTGGGGAGA CTGGAT CCAA AGGGGAGCAG GGCCTCCCTG 5880 

GAGAGCGTGG CCTGCGAGGA GAGCCTGGAA GTGTGCCGAA TGTGGATCGG TTGCTGGAAA 5940 

CTGCTGGCAT CAAGGCATCT GCCCTGCGGG AGATCGTGGA GACCTGGGAT GAGAGCTCTG 600 0 

GTAGCTTCCT GCCTGTGCCC GAACGGCGTC GAGGCCCCAA GGGGGACTCA GGCGAACAGG 6060 

GCCCCCCAGG CAAGGAGGGC CCCATCGGCT TTCCTGGAGA ACGCGGGCTG AAGGGCGACC 612 0 

65 GTGGAGACCC TGGCCCTCAG GGGCCACCTG GTCTGGCCCT TGGGGAGAGG GGCCCCCCCG 6180 

GGCCTTCCGG CCTTGCCGGG GAGCCTGGAA AGCCTGGTAT TCCCGGGCTC CCAGGCAGGG 6240 

CTGGGGGTGT GGGAGAGGCA GGAAGGCCAG GAGAGAGGGG AGAACGGGGA GAGAAAGGAG 63 0 0 

AACGTGGAGA ACAGGGCAGA GATGGCCCTC CTGGACTCCC TGGAACCCCT GGGCCCCCCG 6360 

GACCCCCTGG CCCCAAGGTG TCTGTGGATG AGCCAGGTCC TGGACTCTCT GGAGAACAGG 6420 

70 GACCCCCTGG ACTCAAGGGT GCTAAGGGGG AGCCGGGCAG CAATGGTGAC CAAGGTCCCA 648 0 

AAGGAGACAG GGGTGTGCCA GGCATCAAAG GAGACCGGGG AGAGCCTGGA CCGAGGGGTC 654 0 

AGGACGGCAA CCCGGGTCTA CCAGGAGAGC GTGGTATGGC TGGGCCTGAA GGGAAGCCGG 6600 

GTCTGCAGGG TCCAAGAGGC CCCCCTGGCC CAGTGGGTGG TCATGGAGAC CCTGGACCAC 6660 

CTGGTGCCCC GGGTCTTGCT GGCCCTGCAG GACCCCAAGG ACCTTCTGGC CTGAAGGGGG 6720 

75 AGCCTGGAGA GACAGGACCT CCAGGACGGG GCCTGACTGG ACCTACTGGA GCTGTGGGAC 6780 

TTCCTGGACC CCCCGGCCCT TCAGGCCTTG TGGGTCCACA GGGGTCTCCA GGTTTGCCTG 6840 

GACAAGTGGG GGAGACAGGG AAGCCGGGAG CCCCAGGTCG AGATGGTGCC AGTGGAAAAG 6900 

ATGGAGACAG AGGGAGCCCT GGTGTGCCAG GGTCACCAGG TCTGCCTGGC CCTGTCGGAC 6960 

C T AAAGG AG A ACCTGGCCCC ACGGGGGCCC CTGGACAGGC TGTGGTCGGG CTCCCTGGAG 702 0 

80 CAAAGGGAGA GAAGGGAGCC CCTGGAGGCC TTGCTGGAGA CCTGGTGGGT GAGCCGGGAG 7080 

CCAAAGGTGA CCGAGGACTG CCAGGGCCGC GAGGCGAGAA GGGTGAAGCT GGCCGTGCAG 7140 

GGGAGCCCGG AGACCCTGGG GAAGATGGTC AGAAAGGGGC TCCAGGACCC AAAGGTTTCA 7200 

AGGGTGACCC AGGAGT CGGG GTCCCGGGCT CCCCTGGGCC TCCTGGCCCT CCAGGTGTGA 72 60 

AGGGAGATCT GGGCCTCCCT GGCCTGCCCG GTGCTCCTGG TGTTGTTGGG TTCCCGGGTC 732 0 

85 AGACAGGCCC T CGAGGAG AG ATGGGTCAGC CAGGCCCTAG TGGAGAGCGG GGTCTGGCAG 73 80 

GCCCCCCAGG GAGAGAAGGA ATCCCAGGAC CCCTGGGGCC ACCTGGACCA CCGGGGTCAG 7440 

TGGGACCACC TGGGGCCTCT GGACTCAAAG GAGACAAGGG AGACCCTGGA GTAGGGCTGC 7500 
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CTGGGCCCCG AGGCGAGCGT GGGGAGCCAG GCATCCGGGG TGAAGATGGC CGCCCCGGCC 7560 

AGGAGGGACC CCGAGGACTC ACGGGGCCCC CTGGCAGCAG GGGAGAGCGT GGGGAGAAGG 7620 

GTGATGTTGG GAGTGCAGGA CTAAAGGGTG ACAAGGGAGA CTCAGCTGTG AT CCTGGGGC 7680 

CTCCAGGCCC ACGGGGTGCC AAGGGGGACA TGGGTGAACG AGGGCCTCGG GGCTTGGATG 7740 

GTGACAAAGG ACCTCGGGGA GACAATGGGG ACCCTGGTGA CAAGGGCAGC AAGGGAGAGC 7800 

CTGGTGACAA GGGCTCAGCC GGGTTGCCAG GACTGCGTGG ACTCCTGGGA CCCCAGGGTC 7860 

AACCTGGTGC AGCAGGGATC CCTGGTGACC CGGGATCCCC AGGAAAGGAT GGAGTGCCTG 7 920 

GTATCCGAGG AGAAAAAGGA GATGTTGGCT TCATGGGTCC CCGGGGCCTC AAGGGTGAAC 7980 

GGGGAGTGAA GGGAGCCTGT GGCCTTGATG GAGAGAAGGG AGACAAGGGA GAAGCTGGTC 8040 

CCCCAGGCCG CCCCGGGCTG GCAGGACACA AAGGAGAGAT GGGGGAGCCT GGTGTGCCGG 8100 

GCCAGTCGGG GGCCCCTGGC AAGGAGGGCC TGATCGGTCC CAAGGGTGAC CGAGGCTTTG 8160 

ACGGGCAGCC AGGCCCCAAG GGTG AC CAGG GCGAGAAAGG GGAGCGGGGA ACCCCAGGAA 8220 

TTGGGGGCTT CCCAGGCCCC AGTGGAAATG ATGGCTCTGC TGGTCCCCCA GGGCCACCTG 8280 

GCAGTGTTGG TCCCAGAGGC CCCGAAGGAC TTCAGGGCCA GAAGGGTGAG CGAGGTCCCC 8340 

CCGGAGAGAG AGTGGTGGGG GCTCCTGGGG TCCCTGGAGC TCCTGGCGAG AGAGGGGAGC 8400 

AGGGGCGGCC AGGGCCTGCC GGTCCTCGAG GCGAGAAGGG AGAAGCTGCA CTGACGGAGG 8460 

ATGACATCCG GGGCTTTGTG CGCCAAGAGA TGAGT CAGCA CTGTGCCTGG CAGGGCCAGT 8520 

TCATCGCATC TGGATCACGA CCCCTCCCTA GTTATGCTGC AGACACTGCC GGCTCCCAGC 8580 

TCCATGCTGT GCCTGTGCTC CGCGTCTCTC ATGCAGAGGA GGAAGAGCGG GTACCCCCTG 8640 

AGGATGATGA GTACTCTGAA TACTCCGAGT ATTCTGTGGA GGAGTACCAG GACCCTGAAG 8700 

CTCCTTGGGA TAGTGATGAC CCCTGTTCCC TGCCACTGGA TGAGGGCTCC TGCACTGCCT 8760 

ACACCCTGCG CTGGTACCAT CGGGCTGTGA CAGGCAGCAC AGAGGCCTGT CACCCTTTTG 8820 

TCTATGGTGG CTGTGGAGGG AATGCCAACC GTTTTGGGAC CCGTGAGGCC TGCGAGCGCC 8880 

GCTGCCCACC CCGGGTGGTC CAGAGCCAGG GGACAGGTAC TGCCCAGGAC TGAGGCCCAG 8940 

ATAATGAGCT GAGATTCAGC ATCCCCTGGA GGAGTCGGGG TCTCAGCAGA ACCCCACTGT 9000 

CCCTCCCCTT GGTG CTAG AG GCTTGTGTGC ACGTGAGCGT GCGAGTGCAC GTCCGTTATT 9060 

TCAGTGACTT GGTCCCGTGG GTCTAGCCTT CCCCCCTGTG GACAAACCCC CATTGTGGCT 9120 

CCTGCCACCC TGGCAGATGA CTCACTGTGG GGGGGTGGCT GTGGGCAGTG AGCGGATGTG 9180 

ACTGGCGTCT GACCCGCCCC TTGACCCAAG CCTGTGATGA CATGGTGCTG ATTCTGGGGG 9240 
GCATTAAAGC TGCTGTTTTA AAAGGCAAAA AA 

Seq ID NO: 63 Protein sequence: 
Protein Accession #: NP_000085.1 

1 11 21 31 41 51 

l I I I I I 

MTLRLLVAAL CAG I LAE APR VRAQHRERVT CTRLYAADIV FLLDGSSSIG RSNFREVRSF 60 

LEGLVDPFSG AASAQGVRFA TVQYSDDPRT EFGLDALGSG GDVTRAIREL SYKGGNTRTG 120 

AAILHVADHV FLPQLARPGV PKVCILITDG KSQDLVDTAA QRLKGQGVKL FAVGIKNADP 180 

EELKRVASQP TSDFFFFVND FSILRTLLPL VSRRVCTTAG GVPVTRPPDD STSAPRDIiVL 240 

SEPSSQSLRV QWTAASGPVT GYKVQYTPLT GLGQPLPSER QEVNVPAGET SVRLRGLRPL 300 

TEYQVTVIAL YANSIGEAVS GTARTTALEG PELTI QNTTA HSLLVAWRSV PGATGYRVTW 360 

RVLSGGPTQQ QELGPGQGSV LLRDLEPGTD' YEVTVSTLFG RSVGPATSLM ARTDASVEQT 420 

LRPVILGPTS ILLSWNLVPE ARGYRLEWRR ETGLEPPQKV VLPSDVTRYQ LDGLQPGTEY 480 

RLTLYTLLEG HEVATPATW PTGPELPVSP VTDLQATELP GQRVRVSWSP VPGATQYRII 54 0 

VRSTQGVERT LVLPGSQTAF DLDDVQAGLS YTVRVSARVG PREGSASVLT VRREPETPLA 600 

VPGLRVWSD ATRVRVAWGP VPGASGFRIS WSTGSGPESS QTLPPDSTAT DITGLQPGTT 660 

YQVAVSVLRG REEGPAAVIV ARTDPLGPVR TVHVTQASSS SVTITWTRVP GATGYRVSWH 720 

SAHGPEKSQb VSGEATVAELt DGLEPDTEYT VHVRAHVAGV DGPPASVWR TAPEPVGRVS 780 

RLQILNASSD VLRITWVGVT GATAYRLAWG RSEGGPMRHQ ILPGNTDSAE IRGLEGGVSY 840 

SVRVTALVGD REGTPVSIW TTPPEAPPAIi GTLHWQRGE HSLRLRWEPV PRAQGFLLHW 900 

QPEGGQEQSR VLGPELSSYH LDGLEPATQY RVRLSVLGPA GEGPSAEVTA RTESPRVPSI 960 

ELRWDTSID SVTLAWTPVS RASSYILSWR PLRGPGQEVP GSPQTLPGIS SSQRVTGLEP 1020 

GVSYIFSLTP VLDGVRGPEA SVTQTPVCPR GLADWFLPH ATQDNAHRAE ATRRVLERLV 1080 

LALGPLGPQA VQVGLLSYSH RPSPLFPLNG SHDLGIILQR IRDMPYMDPS GNNLGTAWT 1140 

AHRYMLAPDA PGRRQHVPGV MVLLVDEPLR GDIFSPIREA QASGLNWML GMAGADPEQL 1200 

RRLAPGMDSV QTFFAVDDGP SLDQAVSGLA TALCQASFTT QPRPEPCPVY CPKGQKGEPG 1260 

EMGLRGQVGP PGDPGLPGRT GAPGPQGPPG SATAKGERGF PGADGRPGSP GRAGNPGTPG 1320 

APGLKGSPGL PGPRGDPGER GPRGPKGEPG APGQVIGGEG PGLPGRKGDP GPSGPPGPRG 1380 

PLGDPGPRGP PGLPGTAMKG DKGDRGERGP PGPGEGG I AP GEPGLPGLPG SPGPQGPVGP 1440 

PGKKGEKGDS EDGAPGLPGQ PGSPGEQGPR GPPGAIGPKG DRGFPGPLGE AGEKGERGPP 1500 

GPAGSRGLPG VAGRPGAKGP EGPPGPTGRQ GEKGEPGRPG DPAWGPAVA GPKGEKGDVG 1560 

PAGPRGATGV QGERGPPGLV LPGDPGPKGD PGDRGPIGLT GRAGPPGDSG PPGEKGDPGR 1620 

PGPPGPVGPR GRDGEVGEKG DEGPPGDPGL PGKAGERGLR GAPGVRGPVG EKGDQGDPGE 1680 

DGRNGSPGSS GPKGDRGEPG PPGPPGRLVD TGPGAREKGE PGDRGQEGPR GPKGDPGLPG 1740 

APGERGIEGF RGPPGPQGDP GVRGPAGEKG DRGPPGLDGR SGLDGKPGAA GPSGPNGAAG 1800 

KAGDPGRDGL PGLRGEQGIiP GPSGPPGLPG KPGEDGKPGL NGKNGEPGDP GEDGRKGEKG 1860 

DSGASGREGR DGPKGERGAP GILGPQGPPG LPGPVGPPGQ GFPGVPGGTG PKGDRGETGS 1920 

KGEQGLPGER GIiRGEPGSVP NVDRLLETAG IKASALREIV ETWDESSGSF LPVPERRRGP 1980 

KGDSGEQGPP GKEGPIGFPG ERGLKGDRGD PGPQGPPGLA LGERGPPGPS GLAGEPGKPG 2040 

IPGLPGRAGG VGEAGRPGER GERGEKGERG EQGRDGPPGL PGTPGPPGPP GPKVSVDEPG 2100 

PGLSGEQGPP GLKGAKGEPG SNGDQGPKGD RGVPGIKGDR GEPGPRGQDG NPGLPGERGM 2160 

AGPEGKPGLQ GPRGPPGPVG GHGDPGPPGA PGLAGPAGPQ GPSGLKGEPG ETGPPGRGLT 2220 

GPTGAVGLPG PPGPSGLVGP QGSPGLPGQV GETGKPGAPG RDGASGKDGD RGSPGVPGSP 2280 

GLPGPVGPKG EPGPTGAPGQ AWGLPGAKG EKGAPGGLAG DLVGEPGAKG DRGLPGPRGE 2340 

KGEAGRAGEP GDPGEDGQKG APGPKGFKGD PGVGVPGSPG PPGPPGVKGD LGLPGLPGAP 2400 

GWGFPGQTG PRGEMGQPGP SGERGLAGPP GREGIPGPLG PPGPPGSVGP PGASGLKGDK 2460 

GDPGVGLPGP RGERGEPG I R GEDGRPGQEG PRGLTGPPGS RGERGEKGDV GSAGLKGDKG 2520 

DSAVILGPPG PRGAKGDMGE RGPRGLDGDK GPRGDNGDPG DKGSKGEPGD KGSAGLPGLR 2580 

GLLGPQGQPG AAGIPGDPGS PGKDGVPGIR GEKGDVGFMG PRGLKGERGV KGACGLDGEK 2640 

GDKGEAGPPG RPGLAGHKGE MGEPGVPGQS GAPGKEGLIG PKGDRGFDGQ PGPKGDQGEK 2700 

GERGTPGIGG FPGPSGNDGS AGPPGPPGSV GPRGPEGLQG QKGERGPPGE RWGAPGVPG 2760 

APGERGEQGR PGPAGPRGEK GEAALTEDD I RGFVRQEMSQ HCACQGQFIA SGSRPLPSYA 2820 

ADTAGSQLHA VPVLRVSHAE EEERVPPEDD EYSEYSEYSV EEYQDPEAPW DSDDPCSLPL 2880 

DEGSCTAYTD RWYHRAVTGS TEACHPFVYG GCGGNANRFG TREACERRCP PRWQSQGTG 2940 
TAQD 
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Seq ID NO: 64 DNA sequence 

Nucleic Acid Accession #: NM_006945 

Coding sequence: 1-219 

5 1 11 21 31 41 51 

I I I 1 1 I 

ATGTCTTATC AACAGCAGCA GTGCAAGCAG CCCTGCCAGC CACCTCCTGT GTGCCCCACG 
CCAAAGTGCC CAGAGCCATG TCCACCCCCG AAGTGCCCTG AGCCCTGCCC ACCACCAAAG 
TGTCCACAGC CCTGCCCACC TCAGCAGTGC CAGCAGAAAT ATCCTCCTGT GACACCTTCC 
10 CCACCCTGCC AGCCAAAGTA TCCACCGAAG AGCAAGTAA 
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so 

120 
180 



15 

20 
25 
30 
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40 
45 
50 
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60 
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Seq ID NO: 65 Protein sequence j 
Protein Accession NP_008876 

1 11 21 31 41 51 

I ] i I I I 

MSYQQQQCKQ PCQPPPVCPT PKCPEPCPPP KCPEPCPPPK CPQPCPPQQC QQKYPPVTPS 
PPCQPKYPPK SK 

Seq ID NO: 66 DNA sequence 

Nucleic Acid Accession #: NM_005629.1 

Coding sequence: 639-2546 



60 



TAGTCGGAGC 
CCGCCGCCGG 
CCGCCGCCAC 
CGGGGCGGGG 
CCGATGTCGC 
GCCGCCGCCC 
GCCGGGCCCC 
CTCACTTTCA 
CGTCCGCCCG 
CCTCGGGGCC 
GCCGCGACCC 
ACGGCATCTA 
ACGGGGCCCC 
CCGTGCCGCC 
TCGCCGTGGG 
GTGTGTTCCT 
TAGAGATCTC 
CCCTGTTCAA 
ACATCATGGT 
CCTGGGCCAC 
AAGACTGTGC 
CCCCTGTCAT 
CAGGGGCCCT 
TCTGTGTCTG 
CCTACGTGGT 
GCATCATTTA 
ATGCGGGGAC 
GCAGCTACAA 
ACAGTGGGAC 
CAGAGCAGGG 
CCTACCCGCG 
TCATGCTGTT 
GCCTCCTCGA 
CCCTCTGTTG 
ACGTCTTCCA 
TTTGGGAGTG 
CCTGTATGAT 
CGCTGGT CTG 
ACAACACCTA 
CCATGCTGTG 
CTGAGCGCTG 
CTCAGGACGC 
TCGTCGTGGT 
GCCATAGCAG 
CACTTTTGGG 
ACTTTTTCCA 
TCCCCCTCCA 
CAGTGCTGCA 
CTCTGCAGCA 
TGTGAGGAAG 
GAGGGGCAGC 
TTATAGAAGC 
CCAGTATCAA 
TATAGATCTC 
GGAGGCTTGT 
TTTCTAAAAA 
CCTACCTTAC 
GCAAAACAAA 
GTCCCCAGCC 
GCTGTCCCTT 
GGGCTGCTAA 
GACCCTGGAC 



11 

I 

GAGGTGGCGA 
GAAGGAGAGG 
CACCGCCACC 
GGCGCGGGCC 
CCGCGCCCCG 
GAGCCGCGGG 
GACGCCGCCC 
AGTGCTTCGT 
CCGCCCCGTC 
CTCCCCGGTG 
CGGCCCGGCC 
TAGCGTGTCC 
GGCCAAGGGC 
GCGCGAGACC 
CTTGGGCAAC 
TATTCCCTAC 
GCTGGGCCAG 
AGGCCTGGGC 
GCTGGCCTGG 
ATGTGGCCAC 
CAATGCCAGC 
CGAGTTCTGG 
CAACTGGGAG 
GAAGGGGGTC 
CCTGGTCGTG 
CTATCTCAAG 
CCAGATTTTC 
CCGCTTCAAC 
CAGCTT CTTT 
CGTGCACATC 
GGCTGTCACG 
GCTGCTTGGT 
CCTCCTCCCG 
TGCCCTCTGC 
GCTGTTTGAC 
CGTGGTGGTG 
CGGGTACCGA 
C ATGGG CAT C 
CGTGTACCCG 
CGTGCCGCTG 
GCAGCACCTG 
AGATGTCAGG 
GGAGAGTGTC 
CCCCTGCTTC 
GTCTGCCTGG 
TTTTTAATAA 
GCCCTAGCCG 
CTCCTCCTGC 
CACCCGTGGG 
GGAAGGAGGG 
AGAACCAAGG 
TTAGAGAGCC 
TTGTGTGAGC 
TATCTCTTAG 
ATTTTGCACA 

GAGGAAGGAG 
CCCTCTGCCC 
AGCTTCGAGC 
CCAGACTGGA 
TGCCACAAGT 
CCTGGCCTGC 
ACGGCTCCCA 



21 
I 

GTCGCTGAGC 
GCGAGGCGCG 
GGAGTCGCGG 
ACAGGCCCCT 
TTAGGATGAG 
CAGGAGCCTC 
GCGCGCCCCC 
GGACTGCTTC 
CCCCGGCCCG 
CCGCCGGTGC 
GTGCGGCCCG 
GGCGACGAGA 
GACGGCCCCG 
TGGACGCGCC 
GTGTGGCGCT 
GTCCTGATCG 
TTCATGAAGG 
TACGCCTCCA 
GGCTTCTATT 
ACCTGGAACA 
CTGGCCAACC 
GAGAACAAAG 
GTGACCCTTT 
AAAT CCACGG 
CTGCTGGTGC 
CCTGACTGGT 
TTTTCTTACG 
AACAACTGCT 
GCTGGCTTCG 
TCCAAGGTGG 
CTGATGCCAG 
CTCGACAGCC 
GCCTCCTACT 
TTTGTCATCG 
TACT ACT CGG 
GCCTGGGTGT 
CCTTGCCCCT 
TTCATCTTCA 
TGGTGGGGTG 
CACCTCCTGG 
ACCCAGCCCA 
GGCCTGACCA 
ATGTGACAAC 
AGCCCCACCG 
GGGAGGAGGG 
AACGCCAAAA 
AGCTGGTCCT 
CCCTGCCACG 
TGACCCCTCA 
AGAGACG GGA 
CAAATATTTC 
AGCCAGCAAT 
TTGGGTGCGA 
CAAAGGTGAA 
TTTTATAAAA 
CCCAAACCAT 
CTAGCCAAGG 
TGTTGCGTGT 
TTGGAAAAGT 
CTGTGGGGCA 
TCAGGCTTCC 
CGTCCAGGCT 



31 

I 

CCGCCGCGGC 
CCCGAGCCGC 
GCCAGCCGGG 
GCTCCGGCCG 
TCTCGGGTCG 
GGGAGCCGCC 
GGGCCCCCGA 
TGACTGCGCC 
GCCGCCCCCC 
CCCCCGCCTG 
CCGGGGCCAT 
AGAAGGGCCC 
TGGGCCTGGG 
AGATGGACTT 
TCCCCTACCT 
CCCTGGTTGG 
CCGGCAGCAT 
TGGTGATCGT 
ACCTGGT CAA 
CTCCCGACTG 
TCACCTGTGA 
TCTTGAGGCT 
GTCTGCTGGC 
GAAAG AT CGT 
GTGGAGTGCT 
CAAAGCTGGG 
CCATTGGCCT 
ACAAGGACGC 
TGGTCTTCTC 
CAGAGTCAGG 
TGGCCCCACT 
AGTTTGTAGG 
ACTTCCGTTT 
ATCTCTCCAT 
CCAGCGGCAC 
ACGGAGCTGA 
GGATGAAATG 
ACGTTGTGTA 
AGGCCATGGG 
GCTGCCTCCT 
TCTGGGGCCT 
CCCTGACCCC 
TCAGCTCACA 
CACCCCTCCA 
GAGAAAGCAC 
ATATCACAAC 
AGGCCCCGCC 
CCCACCCCCT 
CCCCAGAAGC 
GGGAGGAGAG 
AGCTGGGCTA 
GGAACCTTCT 
GTGCACGCGT 
TGCCAGATGT 
ACTTGAGAGA 
CCTCTCCTTA 
AGTGTGAATT 
GTGAGTCTGT 
GCATGGTGGG 
AGAGGCTGCA 
CACCCTGTGC 
TAAGGTGGAT 



41 

t 

CCCGAGAGCG 
CGCCGCCGCC 
CAGCCTCCGC 
TCGTTTGCAG 
GGCGAGGAGC 
GCCGCCGCCG 
CACACATGAG 
GCCCGCGCCC 
GGCCCCCGGC 
ACCGCCGCCC 
GGCGAAGAAG 
CCTCATCGCG 
GACACCCGGC 
CATCATGTCG 
GTGCTACAAG 
AGGAAT CCCC 
CAATGTCTGG 
CTTCTACTGC 
GTCCTTTACC 
CGTGGAG AT C 
CCAGCTTGCT 
GTCTGGGGGA 
CTGCTGGGTG 
GTACTTCACT 
GCTGCCTGGC 
GTCCCCTCAG 
GGGGGCCCTC 
CATCATCCTG 
CATCCTGGGC 
GCCGGGCCTG 
CTGGGCTGCC 
TGTGGAGGGC 
CCAAAGGGAG 
GGTGACTGAT 
CACCCTGCTC 
CCGCTTCATG 
GTGCTGGTCC 
CTACGAGCCG 
CTGGGCCTTC 
CAGGGCCAAG 
CCACCACTTG 
AGTGT CCGAG 
TCACCAGCTC 
GGGGGCCTGC 
CATGAGTGCT 
CCACCAAAAA 
TAGTGCCCCA 
GCCCACCTCT 
AGCAGTGGCA 
AGAGGAGAAG 
TACCCCTCTC 
GGTTCCTGCG 
GCGTGAGTAC 
AAATGGCGCC 
ATGAGATTTC 
CCACTCCCAT 
TATAGATCTA 
TGTGTGGATG 
GGCCTCGGGG 
ATATTCCGTC 
GGGGCACACC 
GCACTTCCCG 



51 
I 

GCTGCAGCCG 
GCCACCGCCG 
GGGCCCCGGC 
ACCGCGGGCG 
CGCCGCAGCC 
CCGCCGCCCG 
ATTCTTCAGG 
CGCACCCCGC 
CGGCCCGCGC 
CCCGTGAGGC 
AGCGCCGAGA 
CCCGGGCCCG 
GGCCGCCTGG 
TGCGTGGGCT 
AACGGCGGAG 
ATTTTCTTCT 
AACATCTGTC 
AACACCTACT 
ACCACGCTGC 
TTCCGCCATG 
GACCGCCGGT 
CTGGAGGTGC 
CTGGTCTACT 
GCTACATTCC 
GCCCTGGATG 
GTGTGGATAG 
ACAGCCCTGG 
GCTCTCATCA 
TTCATGGCTG 
GCCTTCATCG 
CTGTTCTTCT 
TTCATCACCG 
ATCTCTGTGG 
GGCGGGATGT 
TGGCAGGCCT 
GACGACATTG 
TTCTTCACCC 
CTGGTCTACA 
GCCCTGTCCT 
GGCACCATGG 
GAGTACCGAG 
AGCAGCAAGG 
ACCTCTGGTA 
CTTTCCCTGA 
CACTAAAACA 
TAGATGCCTC 
CCCCCACCCA 
CCAGGCTCTG 
GCTTGGGAAA 
GGAGGCAGGG 
CCCATCCCTG 
CCAATCGCCA 
GGAGAGTATA 
TCTGGGCAAA 
TGCTTGTATA 
CCCTGTGAGC 
ACTTTCATAG 
TGCGTGTGTG 
CTGTCCCCAC 
CTGGGTGTCT 
CCCAGGAAGG 
CACCTCCAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
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CTTCTGTGTA GCAGCTTTAA CCCACGTTTG TCTGTCACGT CCAGTCCCGA GACGGCTGAG 3780 

TGACCCCAAG AAAGGCTTCC CCGACACCCA GACAGAGGCT GCAGGGCTGG GGCTGGGTGA 3840 

GGGTGGCGGG CCTGCGGGGA CATT CTACTG TGCTAAAAAG CCACTGCAGA CAT AG C AAT A 3900 
AAAACATGTC ATTTTCC 

Seq ID NO: 67 Protein sequence: 
Protein Accession #: NP_005620.1 

1 11 21 31 41 51 

1 I 1 1 I I 

MAKKSAENGI YSVSGDEKKG PLIAPGPDGA PAKGDGPVGL GTPGGRLAVP PRE T W TRQMD 60 

FIMSCVGFAV GLGNVWRFPY LCYKNGGGVF LIPYVLIALV GGIPIFFLEI SLGQFMKAGS 120 

INVWNICPLF KGLGYASMVI VFYCNTYYIM VLAWGFYYLV KSFTTTLrPWA TCGHTWNTPD 180 

CVEIFRHEDC ANASLANDTC DQLADRRSPV IEFWENKVLR LSGGLEVPGA LNWEVTLCLL 240 

ACWVLVYFCV WKGVKSTGKI VYFTATFPYV VLWLLVRGV LLPGALDGII YYLKPDWSKL 300 

GSPQVWIDAG TQIFFSYAIG LGALTALGSY NRFNNNCYKD AIILALINSG TSFFAGFWF 3 60 

SILGFMAAEQ GVHISKVAES GPGUVF I AYP RAVTLMPVAP LWAALFFFML LLLGLDSQFV 42 0 

GVEGFITGLL DLLPASYYFR FQREISVALC CALCFVIDLS MVTDGGMYVF QLFDYYSASG 480 

TTLIiWQAFWE CWVAWVYGA DRFMDDIACM IGYRPCPWMK WCWSFFTPLV CMGIFIFNW 540 

YYEPLVYNNT YVYPWWGEAM GWAFALSSML CVPLHLLGCL LRAKGTMAER WQHLTQPIWG 600 
LHHLEYRAQD ADVRGLTTLT PVSESSKWV VESVM 

Seq ID NO: 68 DNA sequence 

Nucleic Acid Accession #: NM_021953.1 

Coding sequence: 178-2469 

1 11 21 31 41 51 

] 1 I I 1 I 

GGCACGAGGG GGACCCGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATT CATAATG 180 

AAAGCTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 3 00 

AAT CAAGC AG AGGCCTCCAA GGAAGTGGCG GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 42 0 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCAT CCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

ATGGCCATGA TACAATTCGC CAT CAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

AT CTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 102 0 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCCACTGGA CCCAGGGTCT CCACAATTGC CCGAGCACTT GGAATCACAG 12 00 

CAGAAACGAC CGAATCCAGA GCTCCGCCGG AACATGACCA T C AAAACCGA ACTCCCCCTG 12 60 

GGCGCACGGC GGAAGATGAA GCCACTGCTA CCACGGGTCA GCTCATACCT GGTACCTATC 132 0 

CAGTTCCCGG TGAACCAGTC ACTGGTGTTG CAGCCCTCGG TGAAGGTGCC ATTGCCCCTG 1380 

GCGGCTTCCC TCATGAGCTC AGAGCTTGCC CGCCATAGCA AGCGAGTCCG CATTGCCCCC 1440 

AAGGTGCTGC TAGCTGAGGA GGGGATAGCT CCTCTTTCTT CTGCAGGACC AGGGAAAGAG 1500 

GAGAAACTCC TGTTTGGAGA AGGGTTTTCT CCTTTGCTTC CAGTTCAGAC TATCAAGGAG 1560 

GAAGAAATCC AGCCTGGGGA GGAAATGCCA CACTTAGCGA GACCCATCAA AGTGGAGAGC 162 0 

CCTCCCTTGG AAGAGTGGCC CTCCCCGGCC CCATCTTTCA AAGAGG AAT C ATCTCACTCC 1680 

TGGGAGGATT CGTCCCAATC TCCCACCCCA AGACCCAAGA AGTCCTACAG TGGGCTTAGG 1740 

TCCCCAACCC GGTGTGTCTC GGAAATGCTT GTGATTCAAC ACAGGGAGAG GAGGGAGAGG 1800 

AGCCGGTCTC GGAGGAAACA GCATCTACTG CCTCCCTGTG TGGATGAGCC GGAGCTGCTC 1860 

TTCTCAGAGG GGCCCAGTAC TTCCCGCTGG GCCGCAGAGC TCCCGTTCCC AGCAGACTCC 192 0 

TCTGACCCTG CCTCCCAGCT CAGCTACTCC CAGGAAGTGG GAGGACCTTT TAAGACACCC 1980 

ATTAAGGAAA CGCTGCCCAT CTCCTCCACC CCGAGCAAAT CTGTCCTCCC CAGAACCCCT 2 040 

GAATCCTGGA GGCTCACGCC CCCAGCCAAA GTAGGGGGAC TGGATTTCAG CCCAGTACAA 2100 

ACCTCCCAGG GTGCCTCTGA CCCCTTGCCT GACCCCCTGG GGCTGATGGA TCTCAGCACC 2160 

ACTCCCTTGC AAAGTGCTCC CCCCCTTGAA TCACCGCAAA GGCTCCTCAG TTCAGAACCC 2220 

TTAGACCTCA TCTCCGTCCC CTTTGGCAAC TCTTCTCCCT CAGATATAGA CGTCCCCAAG 2280 

CCAGGCTCCC CGGAGCCACA GGTTTCTGGC CTTGCAGCCA ATCGTTCTCT GACAGAAGGC 2340 

CTGGTCCTGG ACACAATGAA TGACAGCCTC AGCAAGATCC TGCTGGACAT CAGCTTTCCT 2400 

GGCCTGGACG AGGACCCACT GGGCCCTGAC AACATCAACT GGTCCCAGTT TATTCCTGAG 2460 

CTACAGTAGA GCCCTGCCCT TGCCCCTGTG CTCAAGCTGT CCACCATCCC GGGCACTCCA 2520 

AGGCTCAGTG CACCCCAAGC CTCTGAGTGA GGACAGCAGG CAGGGACTGT TCTGCTCCTC 2580 

ATAGCTCCCT GCTGCCTGAT TATGCAAAAG TAGCAGTCAC ACCCTAGCCA CTGCTGGGAC 2 640 

CTTGTGTTCC CCAAGAGTAT CTGATTCCTC TGCTGTCCCT GCCAGGAGCT GAAGGGTGGG 2 700 

AACAACAAAG GCAATGGTGA AAAGAGATTA GGAACCCCCC AGCCTGTTTC CATTCTCTGC 2760 

CCAGCAGTCT CTTACCTTCC CTGATCTTTG CAGGGTGGTC CGTGTAAATA GTATAAATTC 282 0 

TCCAAATTAT CCTCTAATTA TAAATGTAAG CTTATTTCCT TAGATCATTA T CCAGAGACT 2880 

GCCAGAAGGT GGGTAGGATG ACCTGGGGTT TCAATTGACT TCTGTTCCTT GCTTTTAGTT 2940 

TTGATAGAAG GGAAGACCTG CAGTGCACGG TTTCTTCCAG GCTGAGGTAC CTGGAT CTTG 3000 

GGTTCTTCAC TGCAGGGACC CAGACAAGTG GATCTGCTTG CCAGAGTCCT TTTTGCCCCT 3 0 60 

CCCTGCCACC TCCCCGTGTT T CCAAGT CAG CTTTCCTGCA AGAAGAAATC CTGGTTAAAA 3120 

AAGTCTTTTG TATTGGGTCA GGAGTTGAAT TTGGGGTGGG AGGATGGATG CAACTGAAGC 3180 

AGAGTGTGGG TGCCCAGATG TGCGCTATTA GATGTTTCTC TGATAATGTC CCCAAT CAT A 3240 

CCAGGGAGAC TGGCATTGAC GAGAACTCAG GTGGAGGCTT GAGAAGGCCG AAAGGGCCCC 33 00 

TGACCTGCCT GGCTTCCTTA GCTTGCCCCT CAGCTTTGCA AAGAGCCACC CTAGGCCCCA 3360 

GCTGACCGCA TGGGTGTGAG CCAGCTTGAG AACACTAACT ACT C AAT AAA AGCGAAGGTG 3420 
GACCNAAAAA AAAAAAAAAA AAAA 
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Seq ID NO: 69 Protein sequence: 
Protein Accession #: NP_068772.1 

5 1 11 21 31 41 51 

I 1 I ! I I 

MKASPRRPLI LKRRRLPLPV QNAPSETSEE EPKRSPAQQE SNQAEASKEV AESNSCKFPA 60 

GIKIINHPTM PNTQWAIPN NANIHSIITA LTAKGKESGS SGPNKFILIS CGGAPTQPPG 12 0 

LRPQTQTSYD AKRTEVTLET LGPKPAARDV NLPRPPGALC EQKRETCADG EAAGCTINNS 180 

10 LSNIQWLRKM SSDGLGSRSI KQEMEEKENC HhEQRQVKVE EPSRPSASWQ NSVSERPPYS 240 

YMAMIQFAIN STERKRMTLK DIYTWIEDHF PYFKHIAKPG WKNSIRHNLS LHDMFVRETS 300 

ANGKVSFWTI HP S ANRYLTL DQVFKPLDPG SPQLPEHLES QQKRPNPELR RNMTIKTELP 360 

LGARRKMKPL LPRVSSYLVP IQFPVNQSLV LQPSVKVPLP LAASLMSSEL ARHSKRVRIA 420 

PKVLLAEEGI APLSSAGPGK EEKLLFGEGF SPLLPVQTIK EEEIQPGEEM PHLARPIKVE 480 

15 SPPLEEWPSP APSFKEESSH SWEDSSQSPT PRPKKSYSGL RSPTRCVSEM LVIQHRERRE 540 

RSRSRRKQHL LPPCVDEPEli LFSEGPSTSR WAAELPFPAD SSDPASQLSY SQEVGGPFKT 600 

PIKETLPISS TPSKSVLPRT PESWRLTPPA KVGGLDFSPV QTSQGASDPL PDPLGLMDLS 660 

TTPLQSAPPL ESPQRLLSSE PLDLISVPFG NSSPSDIDVP KPGSPEPQVS GLAANRSLTE 72 0 
GLVLDTMNDS LSKILLDISF PGLDEDPLGP DNINWSQFIP ELQ 



Seq ID NO: 70 DNA sequence 

Nucleic Acid Accession #: BC006529.1 

Coding sequence: 178-2424 

1 11 21 31 41 51 

I I I 1 1 I 

GGCACGAGGG GGACCCGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

30 CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG AT T C ATAATG 180 

AAAACTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA 240 

AATGCCCCAA GTGAAACATC AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 300 

AATCAAGCAG AGGCCTCCAA GGAAGTGGCA GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 360 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 42 0 

35 GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CAT CAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 600 

GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT G CACTAT CAA CAATAGCCTA 720 

40 TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCAT CAAG 780 

CAAGAGATGG AGGAAAAGGA GAATTGTCAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

ATGGCCATGA TACAATTCGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACT TTAAGC ACATTGCCAA GCCAGGCTGG 1020 

45 AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACC AT TCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AG C AGCAG AA ACGACCGAAT CCAGAGCTCC GCCGGAACAT GACCATCAAA 1200 

ACCGAACTCC CCCTGGGCGC ACGGCGGAAG ATGAAGCCAC TGCTACCACG GGTCAGCTCA 1260 

TACCTGGTAC CTATCCAGTT CCCGGTGAAC CAGTCACTGG TGTTGCAGCC CTCGGTGAAG 1320 

50 GTGCCATTGC CCCTGGCGGC TTCCCTCATG AGCTCAGAGC TTGCCCGCCA TAGCAAGCGA 1380 

GTCCGCATTG CCCCCAAGGT GCTGCTAGCT GAGGAGGGGA TAGCTCCTCT TTCTTCTGCA 1440 

GGACCAGGGA AAGAGGAGAA ACTCCTGTTT GGAGAAGGGT TTTCTCCTTT GCTTCCAGTT 1500 

CAGACTATCA AGGAGGAAGA AAT CCAGCCT GGGGAGGAAA TGCCACACTT AG CGAGACCC 1560 

ATCAAAGTGG AGAGCCCTCC CTTGGAAGAG TGGCCCTCCC CGGCCCCATC TTTCAAAGAG 1620 

55 G AAT CAT CTC ACTCCTGGGA GGATTCGTCC CAATCTCCCA CCCCAAGACC CAAGAAGTCC 1680 

TACAGTGGGC TTAGGTCCCC AACCCGGTGT GTCTCGGAAA TGCTTGTGAT TCAACACAGG 1740 

GAGAGGAGGG AG AGG AG CCG GTCTCGGAGG AAACAGCATC TACTGCCTCC CTGTGTGGAT 1800 

GAGCCGGAGC TGCTCTTCTC AGAGGGGCCC AGTACTTCCC GCTGGGCCGC AGAGCTCCCG 1860 

TTCCCAGCAG ACTCCTCTGA CCCTGCCTCC CAGCTCAGCT ACTCCCAGGA AGTGGGAGGA 192 0 

60 CCTTTTAAGA CACCCATTAA GGAAACGCTG CCCATCTCCT CCACCCCGAG CAAATCTGTC 1980 

CTCCCCAGAA CCCCTGAATC CTGGAGGCTC ACGCCCCCAG CCAAAGTAGG GGGACTGGAT 2040 

TTCAGCCCAG TACAAACCCC CCAGGGTGCC TCTGACCCCT TGCCTGACCC CCTGGGGCTG 2100 

ATGGATCTCA GCACCACTCC CTTGCAAAGT GCTCCCCCCC TTGAATCACC GCAAAGGCTC 2160 

CTCAGTTCAG AACCCTTAGA CCTCATCTCC GTCCCCTTTG GCAACTCTTC TCCCTCAGAT 2220 

65 ATAGACGTCC CCAAGCCAGG CTCCCCGGAG CCACAGGTTT CTGGCCTTGC AGCCAATCGT 22 80 

TCTCTGACAG AAGGCCTGGT CCTGGACACA ATGAATGACA GCCTCAGCAA GATCCTGCTG 2340 

GACATCAGCT TTCCTGGCCT GGACGAGGAC CCACTGGGCC CTGACAACAT CAACTGGTCC 240 0 

CAGTTTATTC CTGAGCTACA GTAGAGCCCT GCCCTTGCCC CTGTGCTCAA GCTGTCCACC 2460 

ATCCCGGGCA CTCCAAGGCT CAGTGCACCC CAAGCCTCTG AGTGAGGACA GCAGGCAGGG 2520 

70 ACTGTTCTGC TCCTCATAGC TCCCTGCTGC CTGATTATGC AAAAGTAGCA GTCACACCCT 2580 

AGCCACTGCT GGGACCTTGT GTTCCCCAAG AGTATCTGAT TCCTCTGCTG TCCCTGCCAG 2640 

GAGCTGAAGG GTGGGAACAA CAAAGGCAAT GGTGAAAAGA GATTAGGAAC CCCCCAGCCT 2700 

GTTTCCATTC TCTGCCCAGC AGTCTCTTAC CTTCCCTGAT CTTTGCAGGG TGGTCCGTGT 27 60 

AAATAGTATA AATTCTCCAA ATTATCCTCT AATTATAAAT GTAAGCTTAT TTCCTTAGAT 2 820 

75 CATTAT CCAG AGACTGCCAG AAGGTGGGTA GGATGACCTG GGGTTTCAAT TGACTTCTGT 2880 

TCCTTGCTTT TAGTTTTGAT AGAAGGGAAG ACCTGCAGTG CACGGTTTCT TCCAGGCTGA 2940 

GGTACCTGGA TCTTGGGTTC TTCACTGCAG GGACCCAGAC AAGTGGATCT GCTTGCCAGA 3000 

GTCCTTTTTG CCCCTCCCTG CCACCTCCCC GTGTTTCCAA GTCAGCTTTC CTGCAAGAAG 3060 

AAATCCTGGT TAAAAAAGTC TTTTGTATTG GGTCAGGAGT TGAATTTGGG GTGGGAGGAT 3120 

80 GGATGCAACT GAAGCAGAGT GTGGGTGCCC AGATGTGCGC TATTAGATGT TTCTCTGATA 3180 

ATGTCCCCAA TCATACCAGG GAGACTGGCA TTGACGAGAA CTCAGGTGGA GGCTTGAGAA 3240 

GGCCGAAAGG GCCCCTGACC TGCCTGGCTT CCTTAGCTTG CCCCTCAGCT TTGCAAAGAG 3300 

CCACCCTAGG CCCCAGCTGA CCGCATGGGT GTGAGCCAGC TTGAGAACAC TAACT ACT CA 3 3 60 
AT AAAAG CGA AGGTGGAAAA AAAAAAAAAA AAAAAAA 

b5 

Seq ID NO: 71 Protein sequence: 
Protein Accession #: AAH06529.1 
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1 11 21 31 41 51 

I I I I I I 

MKTSPRRPLI LKRRRLPLPV QNAPSETSEE EPKRSPAQQE SNQAEASKEV AESNSCKFPA 60 

5 GIKIINHPTM PNTQWAIPN NANIHSIITA LTAKGKESGS SGPNKFILIS CGGAPTQPPG 120 

LRPQTQTSYD AKRTEVTLET LGPKPAARDV NLPRPPGALC EQKRETCADG EAAGCTINNS 180 

LSNIQWLRKM SSDGLGSRSI KQEMEEKENC HLEQRQVKVE EPSRPSASWQ NSVSERPPYS 240 

YMAMIQFAIN STERKRMTLK DIYTWIEDHF PYFKHIAKPG WKNSIRHNLS LHDMFVRETS 3 00 

ANGKVSFWTI HPSANRYLTL DQVFKQQKRP NPELRRNMTI KTELPLGARR KMKPLLPRVS 360 

10 SYLVPIQFPV NQSLVLQPSV KVPLPLAASL MSSELARHSK RVRIAPKVLL AEEGIAPLSS 420 

AGPGKEEKLL FGEGFSPLLP VQTIKEEEIQ PGEEMPHLAR PIKVESPPLE EWPSPAPSFK 480 

EESSHSWEDS SQSPTPRPKK SYSGLRSPTR CVSEMLVIQH RERRERSRSR RKQHLLPPCV 540 

DEPELLFSEG PSTSRWAAEL PFPADSSDPA SQLSYSQEVG GPFKTPIKET LPISSTPSKS 600 

VLPRTPESWR LTPPAKVGGL DFSPVQTPQG ASDPLPDPLG LMDLSTTPLQ SAPPLESPQR 660 

15 LLSSEPLDLI SVPFGNSSPS DIDVPKPGSP EPQVSGLAAN RSLTEGLVLD TMNDSLSKIL 720 
LDISFPGLDE DPLGPDNINW SQFIPELQ 



Seq ID NO: 72 DNA sequence 
20 Nucleic Acid Accession #: U74612.1 
Coding sequence: 178-2583 

1 11 21 31 41 51 

o« I 1 > I 1 1 

A J GGCACGAGGG GGACCCGGCC GGTCCGGCGC GAGCCCCCGT CCGGGGCCCT GGCTCGGCCC 60 

CCAGGTTGGA GGAGCCCGGA GCCCGCCTTC GGAGCTACGG CCTAACGGCG GCGGCGACTG 120 

CAGTCTGGAG GGTCCACACT TGTGATTCTC AATGGAGAGT GAAAACGCAG ATTCATAATG 180 

AAAACTAGCC CCCGTCGGCC ACTGATTCTC AAAAGACGGA GGCTGCCCCT TCCTGTTCAA s 240 

AATGCCCCAA GTGAAAC AT C AGAGGAGGAA CCTAAGAGAT CCCCTGCCCA ACAGGAGTCT 30 0 

30 AATCAAGCAG AGGCCTCCAA GGAAGTGGCA GAGTCCAACT CTTGCAAGTT TCCAGCTGGG 3 60 

ATCAAGATTA TTAACCACCC CACCATGCCC AACACGCAAG TAGTGGCCAT CCCCAACAAT 420 

GCTAATATTC ACAGCATCAT CACAGCACTG ACTGCCAAGG GAAAAGAGAG TGGCAGTAGT 480 

GGGCCCAACA AATTCATCCT CATCAGCTGT GGGGGAGCCC CAACTCAGCC TCCAGGACTC 540 

CGGCCTCAAA CCCAAACCAG CTATGATGCC AAAAGGACAG AAGTGACCCT GGAGACCTTG 60 0 

35 GGACCAAAAC CTGCAGCTAG GGATGTGAAT CTTCCTAGAC CACCTGGAGC CCTTTGCGAG 660 

CAGAAACGGG AGACCTGTGC AGATGGTGAG GCAGCAGGCT GCACTATCAA CAATAGCCTA 720 

TCCAACATCC AGTGGCTTCG AAAGATGAGT TCTGATGGAC TGGGCTCCCG CAGCATCAAG 7 80 

CAAGAGATGG AGGAAAAGGA GAATTGT CAC CTGGAGCAGC GACAGGTTAA GGTTGAGGAG 840 

CCTTCGAGAC CATCAGCGTC CTGGCAGAAC TCTGTGTCTG AGCGGCCACC CTACTCTTAC 900 

40 ATGGCCATGA TACAATTCGC CATCAACAGC ACTGAGAGGA AGCGCATGAC TTTGAAAGAC 960 

ATCTATACGT GGATTGAGGA CCACTTTCCC TACTTTAAGC ACATTGCCAA GCCAGGCTGG 1020 

AAGAACTCCA TCCGCCACAA CCTTTCCCTG CACGACATGT TTGTCCGGGA GACGTCTGCC 1080 

AATGGCAAGG TCTCCTTCTG GACCATTCAC CCCAGTGCCA ACCGCTACTT GACATTGGAC 1140 

CAGGTGTTTA AGCCACTGGA CCCAGGGTCT CCACAATTGC CCGAGCACTT GGAATCACAG 120 0 

45 CAGAAACGAC CGAAT CCAGA GCTCCGCCGG AACATGACCA TCAAAACCGA ACTCCCCCTG 12 60 

GGCGCACGGC GGAAGATGAA GCCACTGCTA CCACGGGTCA GCTCATACCT GGTACCTATC 1320 

CAGTTCCCGG TGAACCAGTC ACTGGTGTTG CAGCCCTCGG TGAAGGTGCC ATTGCCCCTG 13 80 

GCGGCTTCCC TCATGAGCTC AGAGCTTGCC CGCCATAGCA AGCGAGTCCG CATTGCCCCC 1440 

AAGGTTTTTG GGGAACAGGT GGTGTTTGGT TACATGAGTA AGTTCTTTAG TGGCGATCTG 1500 

50 CGAGATTTTG GTACACCCAT CACCAGCTTG TTTAATTTTA TCTTTCTTTG TTTATCAGTG 1560 

CTGCTAGCTG AGGAGGGGAT AGCTCCTCTT TCTTCTGCAG GACCAGGGAA AGAGGAGAAA 1620 

CTCCTGTTTG GAGAAGGGTT TTCTCCTTTG CTTCCAGTTC AGACTATCAA GGAGGAAGAA 1680 

ATCCAGCCTG GGGAGGAAAT GCCACACT TA GCGAGACCCA TCAAAGTGGA GAGCCCTCCC 1740 

TTGGAAGAGT GGCCCTCCCC GGCCCCATCT TTCAAAGAGG AATCATCTCA CTCCTGGGAG 1800 

55 GATTCGTCCC AATCTCCCAC CCCAAGACCC AAGAAGTCCT ACAGTGGGCT TAGGTCCCCA 1860 

ACCCGGTGTG TCT CGGAAAT GCTTGTGATT CAACACAGGG AGAGGAGGGA GAGGAGCCGG 1920 

TCTCGGAGGA AACAGCATCT ACTGCCTCCC TGTGTGGATG AGCCGGAGCT GCTCTTCTCA 1980 

GAGGGGCCCA GTACTTCCCG CTGGGCCGCA GAGCTCCCGT TCCCAGCAGA CTCCTCTGAC 2040 

CCTGCCTCCC AGCTCAGCTA CTCCCAGGAA GTGGGAGGAC CTTTTAAGAC ACCCATTAAG 2100 

60 GAAACGCTGC CCATCTCCTC CACCCCGAGC AAATCTGTCC TCCCCAGAAC CCCTGAATCC 2160 

TGGAGGCTCA CGCCCCCAGC CAAAGTAGGG GGACTGGATT TCAGCCCAGT ACAAACCTCC 2220 

CAGGGTGCCT CTGACCCCTT GCCTGACCCC CTGGGGCTGA TGGATCTCAG CACCACTCCC 2280 

TTGCAAAGTG CTCCCCCCCT TGAATCACCG CAAAGGCTCC TCAGTTCAGA ACCCTTAGAC 2340 

CTCATCTCCG TCCCCTTTGG C AACTCT TCT CCCTCAGATA TAGACGTCCC CAAGCCAGGC 2400 

65 TCCCCGGAGC CACAGGTTTC TGGCCTTGCA GCCAATCGTT CTCTGACAGA AGGCCTGGTC 2460 

CTGGACACAA TGAATGACAG CCT C AGCAAG ATCCTGCTGG ACATCAGCTT TCCTGGCCTG 2520 

GACGAGGACC CACTGGGCCC TGACAACATC AACTGGTCCC AGTTTATTCC TGAGCTACAG 2580 

TAGAGCCCTG CCCTTGCCCC TGTGCTCAAG CTGTCCACCA TCCCGGGCAC TCCAAGGCTC 2640 

AGTGCACCCC AAGCCTCTGA GTGAGGACAG CAGGCAGGGA CTGTTCTGCT CCTCATAGCT 2700 

70 CCCTGCTGCC TGATTATGCA AAAGTAGCAG TCACACCCTA GCCACTGCTG GGACCTTGTG 2 760 

TTCCCCAAGA GTATCTGATT CCTCTGCTGT CCCTGCCAGG AGCTGAAGGG TGGGAACAAC 2 82 0 

AAAGGCAATG GTGAAAAGAG ATTAGGAACC CCCCAGCCTG TTTCCATTCT CTGCCCAGCA 2880 

GTCTCTTACC TTCCCTGATC TTTGCAGGGT GGTCCGTGTA AATAGTATAA ATTCTCCAAA 2940 

TTATCCTCTA ATTATAAATG TAAGCTTATT TCCTTAGATC ATTATCCAGA GACTGCCAGA 3000 

75 AGGTGGGTAG GATGACCTGG GGTTTCAATT GACTTCTGTT CCTTGCTTTT AGTTTTGATA 3060 

GAAGGGAAGA CCTGCAGTGC ACGGTTT CTT CCAGGCTGAG GTACCTGGAT CTTGGGTTCT 3120 

TCACTGCAGG GACCCAGACA AGTGGATCTG CTTGCCAGAG TCCTTTTTGC CCCTCCCTGC 3180 

CACCTCCCCG TGTTTCCAAG TCAGCTTTCC TGCAAGAAGA AATCCTGGTT AAAAAAGTCT 3240 

TTTGTATTGG GTCAGGAGTT GAATTTGGGG TGGGAGGATG GATGCAACTG AAGCAGAGTG 3300 

80 TGGGTGCCCA GATGTGCGCT ATTAGATGTT TCTCTGATAA TGTCCCCAAT CATACCAGGG 3360 

AGACTGGCAT TGACGAGAAC TCAGGTGGAG GCTTGAGAAG GCCGAAAGGG CCCCTGACCT 3420 

GCCTGGCTTC CTTAGCTTGC CCCTCAGCTT TGCAAAGAGC CACCCTAGGC CCCAGCTGAC 3480 

CGCATGGGTG TGAGCCAGCT TGAGAACACT AACTACTCAA TAAAAGCGAA GGTGGACAAA 3540 
AAAAAAAAAA AAAAA 

o5 

Seq ID NO: 73 Protein sequence: 
Protein Accession #: AAC51128.1 
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WO 02/086443 



1 11 21 

I 1 I 

MKTSPRRPLI LKRRRLPLPV QNAPSETSEE 
GIKI INHPTM PNTQWAI PN NANIHSIITA 
LRPQTQTSYD AKRTEVTLET LGPKPAARDV 
LSNIQWLRKM SSDGLGSRSI KQEMEEKENC 
YMAMIQFAIN STERKRMTLK DIYTWIEDHF 
ANGKVSFWTI HPSANRYLTL DQVFKPLDPG 
LGARRKMKPL LPRVSSYLVP IQFPVNQSLV 
PKVFGEQWF GYMSKFFSGD LRDFGTPITS 
KLLFGEGFSP LLPVQTIKEE EIQPGEEMPH 
EDSSQSPTPR PKKSYSGLRS PTRCVSEMLV 
SEGPSTSRWA AELPFPADSS DPASQLSYSQ 
SWRLTPPAKV GGLDFSPVQT SQGASDPLPD 
DLISVPFGNS SPSDIDVPKP GSPEPQVSGL 
LDEDPLGPDN INWSQFIPEL Q 



31 41 51 

1 I I 

EPKRSPAQQE SNQAEASKEV AESNSCKFPA 
LTAKGKESGS SGPNKFILIS CGGAPTQPPG 
NLPRPPGALC EQKRETCADG EAAGCT INNS 
HLEQRQVKVE EPSRPSASWQ NSVSERPPYS 
PYFKHIAKPG WKNSIRHNLS LHDMFVRETS 
SPQLPEHLES QQKRPNPELR RNMT I KTELP 
LQPSVKVPLP LAASLMSSEL ARHSKRVRIA 
LFNFIFLCLS VLLAEEGIAP LSSAGPGKEE 
LARPIKVESP PLEEWPSPAP SFKEESSHSW 
IQHRERRERS RSRRKQHLLP PCVDEPELLF 
EVGGPFKTP I KETLPISSTP SKSVLPRTPE 
PLGLMDLSTT PLQSAPPLES PQRLLSSEPL 
AANRSLTEGL VLDTMNDSLS KILLDISFPG 



Seq ID NO: 74 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 111-416 



1 11 

I 1 
GGGAAGAGCC AGGCTGAGCC 
TCATCCTTCT ACTCGTGACG 
CTCAAGCTGA GAGGTCCATA 
ATGACAAGAT TGAGAAGCCA 
TTAGTGCCTG TGACAAAAAG 
AGAATGAGGA TAAGAAGATT 
CAGACTACCA CAAGCAGAGC 
GCCCCACCAA TGGGCCTCCA 



21 31 

I I 
TTATAAAGGA CTGCTCTTTG 
CTTCCCAGCT CTGGCTTTTT 
ATAGGCATGA TCGACATGTT 
AGCCTGCTGA CGATGATGAA 
GGCACAAATT ACCTCGCCGA 
GATTTTTCTG AGTTTCTGTC 
CATGGAGCAG CGCCCTGTTC 
GAGACCCCAG GAACAATAAA 



41 51 

I I 
TCCAAACACA CACATCTCAC 
GAAAGCAAAG ATGAGCAACA 
TCACAAATAC ACCAGACGTG 
GGAGAACTTC CCCAACTTCC 
TGTCTTTGAG AAAAAGGACA 
CTTGCTGGGA GACATAGCCA 
CGGGGGCAGC CAGTGACCCA 
ATGTCTTCTC CCACCAGA 



Seq ID NO: 75 Protein sequence: 
Protein Accession # : Eos sequence 



1 11 21 31 41 51 

1 I I I 1 1 

MSNTQAERSI IGMIDMFHKY TRRDDKIEKP SLLTMMKENF PNFLSACDKK GTNYLADVFE 

KKDKNEDKKI DFSEFLSLLG DIATDYHKQS HGAAPCSGGS Q 



Seq ID NO: 76 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 111-416 

1 11 21 31 41 51 

I l l l l I 

GGGAAGAGCC AGGCTGAGCC TTATAAAGGA CTGCTCTTTG TCCAAACACA CACATCTCAC 

TCATCCTTCT ACTCGTGACA CTTCCCAGTT CTGGCTTTTT GAAAGCAAAG ATGAGCAACA 

CTCAAGCTGA GAGGTCCATA ATAGGCATGA TCGACATGTT TCACAAATAC ACCGGACGTG 

ATGGCAAGAT TGAGAAGCCA AGCCTGCTGA CGATGATGAA GGAGAACTTC CCCAATTTCC 

TCAGTGCCTG TGACAAAAAG GGCATACATT ACCTCGCCAC TGTCTTTGAG AAAAAGGACA 

AGAATGAGGA TAAGAAGATT GATTTTTCTG AGTTTCTGTC CTTGCTGGGA GACATAGCCG 

CAGACTACCA CAAGCAGAGC CATGGAGCGG CGCCCTGTTC TGGGGGAAGC CAGTGATCCA 

GCCCCACCAA GGGGCCTCCA GAGACCCCAG GAACAATAAG TGTCTCCTCC CACCAGA 

Seq ID NO: 77 Protein sequence: 
Protein Accession #: XP_048124.1 

1 11 21 31 41 51 

I I I i I I 

MSNTQAERSI IGMIDMFHKY TGRDGKIEKP SLLTMMKENF PNFLSACDKK GIHYLATVFE 
KKDKNEDKKI DFSEFLSLLG DIAADYHKQS HGAAPCSGGS Q 



Seq ID NO: 78 DNA sequence 
Nucleic Acid Accession #: Z73678.1 
Coding sequence: 253-2433 



1 11 

I I 

GGGGTGGTGC AGGGCAGGGG 
CAGAGAGGGA CGAACCAGGG 
CCTCGCACTC TATGGCCGTA 
CGCTGCACCG CACCTCGCCT 
CCTCCCGCCA CCATGAACCA 
GACCAGGACA ACTCCACGTT 
GGCAGGCAGC GCGTGCAGGA 
TCCCAGTCGT CCACCCTGAG 
AATTACAACT ATGGGACCAC 
GGCTCATGGG GATATCCGAT 
TTCAGCTCCT ACAGCCAGAT 
ACCACCGGCG CAGGCAGCGA 
CCCGACCTCT ACTGTGACCC 
CAGAAGACCA CCCAGAACCG 
AAGAAGTGCC CTGTGCGCCC 



21 31 

I I 
TGGTATATCC TGTCTGACGG 
TGGAAGCGCC AGGAGCAGCT 
GGGAGCCGCT GAGAGCGAGA 
CGCCTCTCTG CTCTCCTAGG 
CTCGCCGCTC AAGACCGCCT 
GGCTTTGCCG TCGGACCAAA 
GCAGGTGATG ATGACCGTCA 
CCACTCCAAT CGAGGTTCCA 
CAGCAGGAGC AGCTACTACT 
CTACAATGGA ACCCTCAAGC 
GGAGAACTGG AGCCGGCACT 
CATCTGCTTC ATGCAGAAAA 
ACGGGGCACC CTGCGCAAGG 
CTACAGCTTT TACAGCACCT 
GCCCTCTTGT GCCTCCAAGC 



41 51 

1 I 
AGGGCGGGCC TCGCCAGTGC 
GCAGGGAGCC CTCACGCGGA 
AGAGCACGCT CCTGCCCGCC 
CCCCGGCCGC GCGCCACCCG 
TGGCGTACGA ATGCTTCCAG 
AGATGAAAAC AGGCACGTCT 
AGCGGCAGAA GTCCAAGTCT 
TGTATGATGG CTTGGCTGAC 
CCAAGTTCCA GGCAGGGAAT 
GGGAGCCTGA CAACAGGCGC 
ACCCCCGGGG CAGCTGTAAC 
TCAAGGCGAG CCGCAGTGAG 
GCACGCTGGG CAGCAAGGGC 
GCAGTGGTCA GAAGGCCATA 
AGGACCCTGT GTATATCCCG 
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CCCATCTCCT GCAACAAGGA CCTGTCCTTT GGCCACTCTA GGGCCAGCTC CAAGATCTGC 960 

AGTGAGGACA TCGAGTGCAG TGGGCTGACC ATCCCCAAGG CTGTGCAGTA CCTGAGCTCC 1020 

CAGGATGAGA AGTACCAGGC CATTGGGGCC TATTACATCC AGCATACCTG CTT CCAGGAT 1080 

GAATCTGCCA AGCAACAGGT CTATCAGCTG GGAGGCATCT GCAAGCTGGT GGACCTCCTC 1140 

CGCAGCCCCA ACCAGAACGT CCAGCAGGCC GCGGCAGGGG CCCTGCGCAA CCTGGTGTTC 12 00 

AGGAGCACCA CCAACAAGCT GGAGACCCGG AGGCAGAATG GGATCCGCGA GGCAGTCAGC 1260 

CTCCTGAGGA GAACCGGGAA CGCCGAGATC CAGAAGCAGC TGACTGGGCT GCTCTGGAAC 13 20 

CTGTCTTCCA CTGACGAGCT GAAGGAGGAA CTCATTGCCG ACGCCCTGCG TGTTCTGGCC 1380 

GACCGCGTCA TCATTCCCTT CTCTGGCTGG TGCGATGGCA ATAGCAACAT GTCCCGGGAA 1440 

GTGGTGGACC CTGAGGTCTT CTTCAATGCC ACAGGCTGCT TGAGGAACCT GAGCTCGGCC 1500 

GATGCAGGCC GCCAGACCAT GCGTAACTAC TCAGGGCTCA TTGATTCCCT CATGGCCTAT 1560 

GTCCAGAACT GTGTAGCGGC CAGCCGCTGT GACGACAAGT CTGTGGAAAA CTGCATGTGT 162 0 

GTTCTGCACA ACCTCTCCTA CCGCCTGGAC GCCGAGGTGC CCACCCGCTA CCGCCAGCTG 1680 

GAGTATAACG CCCGCAACGC CTACACCGAG AAGTCCTCCA CTGGCTGCTT CAGCAACAAG 1740 

AG CG AC AAG A TGATGAACAA CAACTATGAC TGCCCCCTGC CTGAGGAAGA GACCAACCCC 1800 

AAGGGCAGCG GCTGGTTGTA CCATT CAGAT GCCATCCGCA CCTACCTGAA CCTCATGGGC 1860 

AAGAGCAAGA AAGATGCTAC CCTGGAGGCC TGTGCTGGTG CCCTGCAGAA CCTGACAGCC 1920 

AGCAAGGGGC TGATGTCCAG TGGCATGAGC CAGTTGATTG GGCTGAAGGA AAAGGGCCTG 1980 

CCACAAATTG CCCGCCTCCT GCAATCTGGC AACTCTGATG TGGTGCGGTC CGGAGCCTCC 2040 

CTCCTGAGCA ACATGTCCCG CCACCCTCTG CTGCACAGAG TGATGGGGAA CCAGGTGTTC 2100 

CCGGAGGTGA CCAGGCTCCT CACCAGCCAC ACTGGCAATA CCAGCAACTC CGAAGACATC 2160 

TTGTCCTCGG CCTGCTACAC TGTGAGGAAC CTGATGGCCT CGCAGCCACA ACTGGCCAAG 222 0 

CAGTACTTCT CCAGCAGCAT GCTCAACAAC ATCATCAACC TGTGCCGAAG CAGTGCCTCA 22 80 

CCCAAGGCCG CAGAAGCTGC CCGGCTTCTC CTGTCTGACA TGTGGTCCAG CAAGGAACTG 2340 

CAGGGTGTCC TCAGACAGCA AGGTTTCGAT AGGAACATGC TGGGAACCTT AGCTGGGGCC 240 0 

AACAGCCTCA GGAACTTCAC CTCCCGATTC TAAGAAGAGA CTGTCCAAGC AAGTTAGGCT 2460 

TGCAGGAAGA TATGACCCAG CTGAGAAGCC CTCAGGCCTC GCTGGATGGG GTTTTCTGTC 2520 

CATCCTGTGC AGTATTTGGG AAAGTTCACA AGAAACTGAG AAGAAACCTA AAAACTGTGG 2580 

ATAGTGGAAA GATTTTTAGA TTTTTTT ttt CCTTGGGGAA ACTGGCAGGC AATGGGGGTT 2 640 

AGGGAGGTTG GGGCGGGGGG GGCTTTCTTG AGTTAAAGGG GCTTATATGT GATGTCAATA 2700 

TTTCTTCCTC TGAGAAATGG TATATATATG TGTCTAATGT AAGTGTGTGC ATGCATGTGC 2760 

GCGTGCATGT GTGTGTGTGT GAGTGTCTTA AAGCATAACC ACAAACTGCA AAAAGCTAGG 2820 

TAAGCTATTT TGTTGCAGCT CATAAGGTGG TGAAAAGGAC TCTCCTGTGT TTCTTACTCA 2 880 

TAGGCAAGGA CAACATGTGC TTTTTGGTGA GCTGCTCATA ATTCCTGAAA TGTGTGGTGC 2 940 

CAGGGCAAGG GGGCCATCAC TGCAGTCAGG CCCTCAGAGG AGTCCTGCAG GCTTCCTACC 3 000 

AGTGGTCTCC AAGGGTGCAG GAGTAACTGG GGCTGGGCCA GCCTCCCCCC TTACAAGGCT 30 60 

GCTTTCCACG AAGGGAGGTC TGGTGTATCT CATGGGAGAA TCTGGGGTGT CTGTAGTGTC 312 0 

ACCCCTCCAG CAGCGCCACA AGGACTGAGG TTGGGTAGGT GTGAGGTTCC AGAGGACAGC 3180 

AGGACACTCT CGCATACTTT GCCAAATGAG GCCTGCTCAG AGGAGTAGGA GCTGAAAGAT 3240 

GGTGCCTTCC ACCCTCTTGG GCTGTGTGCC CATCAGAGCA GGCTCAGCCT GCAAAGGCCC 3 3 00 

TGCATTCAGA GGTCTTGTAA TCTACTTGTT GCAGGAGAAA GAAGGTAAAA AATGATTTTT 33 60 

T T AAG AAAAG CTATTTTATT GCAGCTCTTT CCCAAGAGCT GTTCTGGGAA TGGCTGGTCT 342 0 

TCATATTCCC AGTGGAGAGG GGAACAAGTG GGGCTGGGCA TATACCTATT CCGGCTTCTA 3480 

GTGGGATGGA GTTGGGGTAT AGAAATTAAC CAGGAAGATG TTTCCACCAA GCCTGCTGTG 3540 

AGTCAATTGA GGGAGTGTTT GGGTCCCAGG AGACT TGGAG GGGGGGAGTT TGGGTAGACT 3 60 0 

AGGAAAGGAA AGTGCCATAT CAGGGTACCG GTACCGGCAA GCTCACATCT CAGCCAGGGG 3 660 

CCATGCCCCA CTTCCCCTGA CCCCAGCTGT CTTGTCTCCA CTCTGTGAAA CCCACAGGGG 3720 

ATGTGATAAA CAGGGCTATT AGGGGTATCA GCCACGT CGA GCCCCCAGAC TCTGTGCACT 3 780 

TCAGACCAGC AG CAG CAGGA GGGCTCCCGA GGGCCTTATG AGAAAACCTG TGTGGACATC 3 840 

CCTTGGTGTA CACTAAGACA GAGCAGAGCC CAGCGCTCCC AAGCCTTCCT CCTTCCAGCT 3 90 0 

TCTACCTCCA TGCTAGCATT GCTGGTGTTA GAGAGGAATT AACTTCCTGG TCTGTGCCCT 3 960 

TCTCTAGAAG AATATAAGAT GCTCCTCCTC CTCACCCCTT CTCAGCCTCC TCCCAAGTCT 4020 

TCCTCTTCTG CACCACCCCC GAGTCCAAAC CCACCTCTTG CCCCAGCATT CAGGCTGGAA 40 80 

AACACTGATG TGGACTCAGT ATGACAACTG AGATGGGGGA AGCCAGACAT GTGAGGACGC 4140 

TGTCCTCCGA GAGGTGTCCC CGGCTGTTAG CCAGCTGTGC TGTGGTGCTG TGGGTCTGTC 42 00 

ATACCCTCCC TTGCTTCTGT TCACACTGGG AGGCCCACTC CTGGCTCACC TCTCCCTCTC 42 60 

AGGGACCCAC GTGGGAGCCT GGATCCCTGG ACTGTCCTGG GCATAGGTTT CAGGGGCCTC 432 0 

CTTTGTTGTC ATCAGAACCC AGAGGAATTC TTCTCCTAAA AAATACGTAT GGCATACCAA 43 80 

TCTGTGCGGG GCAGTGTCCT AAGCACTTAG ACTA CAT CAG GGAAGAACAC AGACCACATC 4440 

CCCGTCCTCA TGCGGCTTAT GTTTTCTGGA GGAAAGTGGA GACACAAGTC CTTGGCTTTA 450 0 

GGGCTCCCCC GGCTGGGGGC TGTGCAGTCC GGTCAGGGCG GGAGGGGAAA TGCACCGCTG 4560 

CATGTGAACC TTACCAGCCC AGGCGGATGC CCCTTCCCCT TAGCACTACC CTGGCCTCCT 4620 

GCATCCCCTC GCCTCATGTT CCTCCCACCT TCAAAGAATG AAGAGCCCCA TGGGCCCAGC 4680 

CCCTGCCCTG GGAACCAGGC AGCCTTCCAG ACCTCAGGGG CTGAGGCAGA CTATTAGGGC 4740 

AGGGCTGACT TTGGTGACAC TGCCCATTCC CTCTCAGGCC AGCTCAGGTC ACCCGGGCCT 4800 

CTGACCCAGG CCTGTCACTT TGAGAGGGGC AAAAC TGAGA GGGGCTTTTC CTAGAGAAAG 4860 

AGAACAAGGA GCTTGCCAGG CTTCATGTAG CCGACACACG TCTCAGGATT TTAAGT CCAC 4 92 0 

ATTGGCCTCA CACTAGCCTA GGCCAATGCC CAAAATAAGG AGTTCCAATT TGGGGCCAAA 4980 

TGAGGAAGGA CACAGACTCT GCCCTGGGAT CTCCTGTGCT AGCGGCCAAT GACAAATCCA 5040 

GTCATTGGCC ACCAGCCACC TCTGCAGTGG GGACCACACT AGCAGCCCTG ACTCCACACT 5100 

CCTCCTGGGG ACCCAAGAGG CAGTGTTGCT GTCTGCGTGT CCACCTTGGA ATCTGGCTGA 5160 

ACTGGCTGGG AGGACCAAGA CTG CGGCTGG GGTGGGCAGG GAAGGGAAGC CGGGGGCTGC 5220 

TGTGAGGGAT CTTGGAGCTT CCCTGTAGCC CACCTTCCCC TTGCTTCATG TTTGTAGAGG 5280 

AACCTTGTGC CGGCCAGGCC CAGTTTCCTT GTGTGATACA CTAATGTATT TGCTTTTTTT 5340 
GGAAATAGAG AAAATCAATA AATTGCTAGT GTTTCTTTGA AAAAAAAAA 

Seq ID NO: 79 Protein sequence: 
Protein Accession #: CAA98022.1 

1 11 21 31 41 51 

1 1 I I I 1 

MNHSPLKTAL AYECFQDQDN STIaALPSDQK MKTGTSGRQR VQEQVMMTVK RQKSKSSQSS 60 

TL SHSNRGSM YDGLADNYNY GTTSRSSYYS KFQAGNGSWG YPIYNGTLKR EPDNRRFSSY 12 0 

SQMENWSRHY PRGSCNTTGA GSDICFMQKI KASRSEPDLY CDPRGTLRKG TLGSKGQKTT 180 

QNRYSFYSTC SGQKAIKKCP VRPPSCASKQ DPVYIPPISC NKDLSFGHSR ASSKICSEDI 240 

ECSGLTIPKA VQYLSSQDEK YQAIGAYYIQ HTCFQDESAK QQVYQLGGIC KLVDLLRSPNT 300 

QNVQQAAAGA LRNLVFRSTT NKLETRRQNG IREAVSLLRR TGNAEIQKQL TGLLWNLSST 3 60 
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10 
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DELKEELIAD ALPVLADRVI IPFSGWCDGN SNMSREWDP 
QTMRNYSGLI DSLMAYVQNC VAASRCDDKS VENCMCVLHN 
RNAYTEKSST GCFSNKSDKM MNNNYDCPLP EEETNPKGSG 
DATLEACAGA LQNLTASKGL MSSGMSQLIG LKEKGLPQIA 
MSRHPLLHRV MGNQVFPEVT RDLTSHTGNT SNSEDILSSA 
SSMLNNIINL CRSSASPKAA EAARLLLSDM WSSKELQGVL 
NFTSRF 



Seq ID NO: 80 DNA sequence 

Nucleic Acid Accession #: NM_006516.1 

Coding sequence: 180-1658 



PCT/US02/12476 



EVFFNATGCL 
LSYRLDAEVP 
WLYHSDAIRT 
RLLQSGNSDV 
CYTVRNLMAS 
RQQGFDRNML 



RNLSSADAGR 
TRYRQLEYNA 
YLNLMGKSKK 
VRSGASLLSN 
QPQLAKQYFS 
GTLAGANSLR 



420 
480 
540 
600 
660 
720 



TAGTCGCGGG 
GTCAGAGTCG 
CGCACGCCCG 
TGGAGCCCAG 
TTGGCTCCCT 
AGGAGTT CTA 
TCACCACGCT 
TCTCTGTGGG 
TGCTGGCCTT 
TGCTGATCCT 
CCATGTATGT 
AGCTGGGCAT 
GCAACAAGGA 
GCATCGTGCT 
AGAACCGGGC 
TGCAGGAGAT 
AGCTGTTCCG 
CCCAGCAGCT 
CGGGGGTGCA 
CTGTCGTGTC 
TCGCTGGCAT 
TACCCTGGAT 
TGGGTCCTGG 
CAGCTGCCAT 
GCTTCCAGTA 
TGGTTCTGTT 
ATGAGATCGC 
AGCTGTTCCA 
GCCTGCTCCC 
AACCTGACAG 
CCAG AAGAAT 
AAATCTATTC 
ATATCAGCCT 
GAGGGTGGAG 
CTGGACCTAT 
GAGGTGGCTA 
CATTAGGATT 
CCTGAGACCA 
GCCGGGTTCT 
GGGAGCCTGC 
TGCAAGATAT 
ATATCTGGAC 
TATAAATGGC 
TTTGGATGGG 
GACT CAGGAT 
TTTGATCCCT 
ATCACATATT 
AGGCTTGAAA 



11 
I 

TCCCCGAGTG 
CAGTGGGAGT 
TCGCCACCCG 
CAGCAAGAAG 
GCAGTTTGGC 
CAACCAGACA 
CTGGTCCCTC 
CCTTTTCGTT 
CGTGTCCGCC 
GGGCCGCTTC 
GGGTGAAGTG 
CGTCGTCGGC 
CCTGTGGCCC 
GCCCTTCTGC 
CAAGAGTGTG 
GAAGGAAGAG 
CTCCCCCGCC 
GTCTGGCATC 
GCAGCCTGTG 
GCTGTTTGTG 
GGCGGGTTGT 
GTCCTATCTG 
CCCCATCCCA 
TGCCGTTGCA 
TGTGGAGCAA 
CTTCATCTTC 
TTCCGGCTTC 
TCCCCTGGGG 
AGCAGCCCTA 
ATGTCAGCCG 
ATTCAGGACT 
AGACAAGCAA 
GAGTCTCCTG 
ACTAAGCCCT 
GTCCTAAGGA 
TGGCCACCCG 
TGCCCCTTCC 
GTTGGGAGCA 
AGTCTCCTTT 
AAACTCACTG 
TTATATATAT 
AAGCCAACTT 
TGGTTTTTAG 
AGTGAGACAG 
CCAGTCCCTT 
GTTACCCAGA 
TGATAGTTGG 
TCGCATTATT 



21 

I 

AGCACGCCAG 
CCCCGGACCG 
CGTACCCGGC 
CTGACGGGTC 
TACAACACTG 
TGGGTCCACC 
TCAGTGGCCA 
AACCGCTTTG 
GTGCTCATGG 
ATCATCGGTG 
TCACCCACAG 
ATCCTCATCG 
CTGCTGCTGA 
CGCGAGAGTC 
CTAAAGAAGC 
AGTCGGCAGA 
TACCGCCAGC 
AACGCTGTCT 
TATGCCACCA 
GTGGAGCGAG 
GCCATACTCA 
AGCATCGTGG 
TGGTTCATCG 
GGCTTCTCCA 
CTGTGTGGTC 
ACCTACTTCA 
CGGCAGGGGG 
GCTGATTCCC 
AGGATCTCTC 
AGCCGGGCCT 
TAACGGCTCC 
CAGGTTTTAT 
TGCCCACATC 
GTCGAGACAC 
CACACTAATC 
TTCTGCTGGC 
CATCTCTTCC 
CTGGAGTGCA 
GCACTGAGGG 
CTCAAGAAGA 
TTTTGGTTGT 
GTAAATACAC 
AAACATGGTT 
AAGTAAGTGG 
ACACGTACCT 
GAATATATAC 
TGTTCAAAAA 
TTGAATGTGA 



31 

i 

GGAGCAGGAG 
GAGCACGAGC 
GCAGCCAGAG 
GCCTCATGCT 
G AGT CAT CAA 
GCTATGGGGA 
TCTTTTCTGT 
GCCGGCGGAA 
GCTTCTCGAA 
TGTACTGCGG 
CCTTTCGTGG 
CCCAGGTGTT 
GCATCATCTT 
CCCGCTTCCT 
TGCGCGGGAC 
TGATGCGGGA 
CCATCCTCAT 
TCTATT ACT C 
TTGGCTCCGG 
CAGGCCGGCG 
TGACCAT CGC 
CCATCTTTGG 
TGGCTGAACT 
ACTGGACCTC 
CCTACGTCTT 
AAGTTCCTGA 
GAGCCAGCCA 
AAGTGTGAGT 
AGGAGCACAG 
GGGGCTCCTT 
AGGATTTTAA 
AATTTTTTTA 
CCAGGCTTCA 
TTGCCTTCTT 
GAACTATGAA 
CTGGATCTCC 
TACCCAACCA 
GGGAGGAGAG 
CCACACTATT 
CATGGAGACT 
CAATATTAAA 
CACCTCACTC 
TTGAAATGCT 
GGT TGCAACC 
CTCATCAGTG 
ATTCTTTATC 
AACACTAGTT 
AGGGAA 



41 

1 

ACCAAACGAC 
CTGAGCGGGA 
CCACCAGCGC 
GGCTGTGGGA 
TGCCCCCCAG 
GAGCATCCTG 
TGGGGGCATG 
TTCAATGCTG 
ACTGGGCAAG 
CCTGACCACA 
GGCCCTGGGC 
CGGCCTGGAC 
CATCCCGGCC 
GCTCATCAAC 
AGCTGACGTG 
GAAGAAGGTC 
CGCTGTGGTG 
CACGAGCATC 
TATCGTCAAC 
GACCCTGCAC 
GCTAGCACTG 
CTTTGTGGCC 
CTTCAGCCAG 
AAATTTCATT 
CATCATCTTC 
GACTAAAGGC 
AAGTGATAAG 
CGCCCCAGAT 
GCAGCTGGAT 
TCTCCAGCCA 
CAAAAGCAAG 
TTACTGATTT 
CCCTGAATGG 
CACCCAGCTA 
CTACAAAGCT 
CCACTCTAGG 
CTCAAATTAA 
GGGAAGGGCC 
ACCATGAGAA 
CCTGCCCTGT 
TACAGACACT 
CTGTTACTTA 
TGTGGATTGA 
ACTGCAACGG 
TCCTCTTGCT 
TTGACATTCA 
TTGTGCCAGC 



51 

I 

GGGGGTCGGA 
GAGCGCCGCT 
AGCGCTGCCA 
GGAGCAGTGC 
AAGGTGATCG 
CCCACCACGC 
ATTGGCTCCT 
ATGATGAACC 
TCCTTTGAGA 
GGCTTCGTGC 
ACCCTGCACC 
TCCATCATGG 
CTGCTGCAGT 
CGCAACGAGG 
ACCCATGACC 
ACCATCCTGG 
CTGCAGCTGT 
TTCGAGAAGG 
ACGGCCTTCA 
CTCATAGGCC 
CTGGAGCAGC 
TTCTTTGAAG 
GGTCCACGTC 
GTGGGCATGT 
ACTGTGCTCC 
CGGACCTTCG 
ACACCCGAGG 
CACCAGCCCG 
GAGACTTCCA 
GCAATGATGT 
ACTGTTGCTC 
TGTTATTTTT 
TTCCATGCCT 
AT CTGTAGGG 
TCTATCCCAG 
GGTCAGGCTC 
TCTTTCTTTA 
AGTCTGGGCT 
GAGGGCCTGT 
TGTGTATAGA 
AAGTTATAGT 
CCTAAACAGA 
GGGTAGGAGG 
CTTAGACTTC 
CAAAAATCTG 
AGGCATTTCT 
CGTGATGCTC 



Seq ID NO: 81 Protein sequence: 
Protein Accession #: NP 006507.1 



MEPSSKKLTG 
LTTLWSLSVA 
MLILGRFIIG 
GNKDLWPLLL 
LQEMKEESRQ 
AGVQQPVYAT 
DPWMSYLSIV 
CFQYVEQLCG 
ELFHPLGADS 



11 
I 

RLMLAVGGAV 
IFSVGGMIGS 
VYCGLTTGFV 
SIIFIPALLQ 
MMREKKVTIL 
IGSGIVNTAF 
AIFGFVAFFE 
PYVFIIFTVL 
QV 



21 

I 

LGSLQFGYNT 
FSVGLFVNRF 
PMYVGEVSPT 
CIVLPFCPES 
ELFRSPAYRQ 
TWSLFWER 
VGPGPIPWFI 
LVLFFIFTYF 



31 
I 

GVINAPQKVI 
GRRNSMLMMN 
AFRGALGTLH 
PRFLLINRNE 
PILIAWLQL 
AGRRTLHLIG 
VAELFSQGPR 
KVPETKGRTF 



41 

I 

EEFYNQTWVH 
LLAFVSAVLM 
QLGIWGILI 
ENRAKSVLKK 
SQQLSGINAV 
LAGMAGCAIL 
PAAIAVAGFS 
DEIASGPRQG 



51 

I 

RYGESILPTT 
GFSKLGKSFE 
AQVFGLDSIM 
LRGTADVTHD 
FYYSTSIFEK 
MTIALALLEQ 
NWTSNFIVGM 
GASQSDKTPE 



Seq ID NO: 82 DNA sequence 
Nucleic Acid Accession #: BC001291 
Coding sequence: 44-541 

1 11 21 31 41 51 

I 1 I I I I 

GGGGGCGCCG CGCGCTGACC CTCCCTGGGC ACCGCTGGGG ACGATGGCGC TGCTCGCCTT 
GCTGCTGGTC GTGGCCCTAC CGCGGGTGTG GACAGACGCC AACCTGACTG CGAGACAACG 



60 
120 
180 
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60 
120 
180 
240 
300 
360 
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60 
120 
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AGATCCAGAG GACTCCCAGC GAACGGACGA 
TGAGAGAGAA AACACTTTCG AGTGCCAGAA 
CTGCGTTATA GCGGCCGTGA AAATATTTCC 
CGCTGGTTGT GCAGCGATGG AGAGACCCAA 
GCCCATGCCC TTCTTTTACC TCAAGTGTTG 
ACCTATCAAC TCATCAGTGT TCAAAGAATA 
GCTGTGGCTG GCCATCCTCC TGCTGCTGGC 
AGCCACGGGA CTGCCACAGA CTGAGCCTTC 
ACCTGTTGCA TTAAACTTGT TTTCTGTTGA 
GGGATGGGAG AGTGGGGATC AGGTGCAGTT 
ACATTCAGAG GAAGTCCAGA TCTCCTGAGT 
AAATCAAACC TTGTAACTCA TTTATTGCTG 
CCTCTGAGGG CTTCAGTATT GATGGGGAGG 
TGCTGAGATG CTTCCGACCT TTCAGGTGAC 
GGGTGAAGAC ATCCCTGGAG TGAAGGACTC 
AGGGCTGCCC CCATTCCAGT GGTGGAGGCG 
CTACCAGATT CCAGGAGGCA GAAGATAACT 
ACCAGCTGGC ACAGGTGCAC AGATTCATAA 
ACTTAGGCCA AGTAGAGAGC ATCAGGGTAA 
CATCCATGGG GAGCTGAGAA ATCAGACTCA 
TTCAAAAGTT CACGAAAAAA AAAAAAAAAA 

Seq ID NO: 83 Protein sequence: 
Protein Accession #: AAH01291 



PCT/US02/12476 



GGGTGACAAT 
CCCAAGGAGG 
ACGTTTTTTC 
GCCAGAGGAG 
TAAAATTCGC 
TGCTGGGAGC 
CTCCATTGCA 
CGGAGCATGG 
TTACCTCTTG 
GGCTCTTAAC 
AGTGATTTTG 
ATGGCCACTC 
GAGGCCTAAG 
GCAGGAACAC 
CTCAGCATGG 
CTGTGGATGG 
AATTGTGTTG 
ATTCCCACAC 
ATGGCGTTCA 
AAGTTCCACC 
AAAAAAAAAA 



AGAGTGTGGT 
TGCAAATGGA 
ATGGTTGCGA 
AAGCGGTTTC 
TACTGCAATT 
ATGGGTGAGA 
GCCGGCCTCA 
ACTCGCTCCA 
GTTTGACTTC 
CCT CAAGGGT 
GTGACAAGTT 
TTTTCCTTGA 
TACCACTCAT 
TGGGGGAGTC 
GGGGCAGTGG 
CTGCTTTTCC 
AAGAAACT T A 
GTGTGTGTTC 
TTTCTCTGTT 
AAAAACAAAT 
AAAAAAAAAA 



GTCATGTTTG 
CAGAGCCATA 
AGCAGTGCTC 
TCCTGGAAGA 
TAGAGGGGCC 
GCTGTGGTGG 
GCCTGTCTTG 
GACCGTTGTC 
CCAGGGTCTT 
TCTTTAACTC 
TTTCTCTTTG 
CTCCCCTCTG 
GGAGAGTATG 
TGAATGATTG 
GGCACACGTT 
TCAACCTTTC 
GACTTCACCC 
AACAT CTGAA 
AAGATGCAGC 
ACAAGGGGAC 
AAA 



11 



21 



51 



31 41 

I 1 i 

MALLALLLW ALPRVWTDAN LTARQRDPED SQRTDEGDNR VWCHVCEREN TFECQNPRRC 
KWTEPYCVIA AVKIFPRFFM VAKQCSAGCA AMERPKPEEK RFLLEEPMPF FYLKCCKIRY 
CNLEGPPINS SVFKEYAGSM GESCGGLWLA ILLtLLASIAA GLSLS 

Seq ID NO: 84 DNA sequence 

Nucleic Acid Accession #: NM__022893.1 

Coding sequence; 229-2 726 



TGCGCCATCT 
TTTTCTCTGG 
CGCCCGCCGC 
AAGCAAGGCA 
ATTCTTACAG 
CTCCTCACCT 
GAGCACAAAC 
CCTTCCCCTT 
GTCACG CCAG 
GAACACATAG 
GGAGCTCTAA 
GATGAGCCCA 
CTCTTGCAAC 
AGTCCCCTGA 
CCACCTCTCC 
GGATCAGTAT 
CTGTTTAGTC 
GAAGAAATGG 
CCAATGGCTA 
AACACGTCTA 
CCATTCCAGC 
TCCGCCCCTC 
ACGTTCAAAT 
TACAAGTGCA 
AAGACGCACA 
GCCAGCTCCC 
TCCGTGGTGG 
GAGGAGGAAG 
CTGACGGAGA 
CACGAGAACA 
GACGTCATGC 
GTCCTGGGCG 
TGCGACGAAG 
CGCGGCTGCT 
AGCCCCAGCT 
CCCCCGGCCA 
GCCTCCAGGC 
GCCTCCTCGT 
GAGCTGGACG 
ATTAGTGGTC 
GAGTACTGTG 
ACGGGCGAAA 
CTCACCAGGC 
TGTAAGATGC 
GATCGAGTGT 
CTCCCACCTG 
CCTGTAGGAT 
ACGAAGCTAA 
TTCTTTTTTC 



11 

1 

TTTTTTGCTT 
TTGTATTATT 
AGTCTCCTTC 
CGCCGCCGCC 
AACCCCAGCA 
ATGATGAACC 
GTGGGCAGTG 
GGAAACAATG 
CACCAATCGA 
AAGATGACGA 
CAGATAAACT 
TCCCCACGCC 
GCAGCTACAC 
ACGCACAGAA 
CCCCGCGGGT 
ATGGGATTCA 
CGAGAGAGGC 
CACCACCGAG 
CCCTGGCCAC 
TGGAGCCTCC 
GCCCACCGCT 
CAGGTAGCAA 
CTCCCTCCCA 
TTCAGAGCAA 
ACCTGTGCGA 
TGCACAAATC 
CGGAACCCGG 
CCAAGTTCAA 
AGGAGGACGA 
GCGAGAGGGT 
GCTCGCGGGG 
AGGGCATGGT 
AGAAGCATAA 
ACTCGGTGGC 
CCCCGGGCGA 
CGCTGAGCCC 
CGATGCCCAA 
AGCTCAAAGA 
CGGAGCACTC 
GAGGGATCTC 
CGGGCACGGG 
GGAAAGTCTT 
GGCCTTATAA 
ACATGAAAAC 
CTTTTAGCGT 
TGAATAATGA 
ACACCCCCTT 
TTTTTTCTAG 
GAATATGAGA 
TTTTTCCTTT 



21 
I 

AAAAAAAAGC 
TCTAATTTAT 
TTTCTAACCC 
GCCGCCGCCG 
CTTAAGCAAA 
AGACCACGGC 
CCAGATGAAC 
CAATGGCAGC 
GATGAAAAAA 
TTGTTTATCA 
TCTGCACTGG 
TGGGATGAGT 
ATGTACAACT 
CACTCATGGA 
TGGTATCCCT 
TATTGCAGAC 
TTCCGGCCTG 
ACATCACTTG 
CCATCACCCG 
CGCCATGGAT 
GTCCCCAGGC 
GCCGCCCTTC 
GCCCCCGGTC 
CCTGGTGGTG 
CCACGCGTGC 
GTCCCCCATG 
CACCAGCGAC 
GAGCGAGAAC 
CGAGGAAGAG 
GGACTACGGC 
CGCGGTCGTG 
GCTCAGCTCC 
GCGCGGCCAC 
CGGCGAGTCG 
GTCGGCCTCG 
CTTCTCTAAG 
CACGGAGAAC 
TCCCTTCCTT 
CTCGGAGAAC 
GGGGCGCAGC 
CAGGCCCAGC 
CAAGAACTGT 
ATGCGAGCTG 
GCATGGCCAG 
GTACAGTACC 
TATAAAAACT 
TTTCACCACT 
TCCCATGTGA 
GTGCTTGTCA 



31 
I 

CATGACGGCT 
TTTGGATGTC 
GGCTCTCCCG 
CCCGCCCCGC 
CGGGAATTCT 
CCGTTGGGAG 
TTCCCATTGG 
CTCTGCTTAG 
GCATCCAATC 
ACGTCAT CT A 
AGGGGCCTCT 
GCAGAATATG 
TGCAAACAGC 
TTAAGAATCT 
TCAGGACTAG 
AATAACCCCT 
GCAGAAGGGC 
GACCCCCACC 
AGTGCCTTTG 
TTCTCTAGGA 
CGGCCCAGCC 
CTGGCGACGC 
AAGTCCAAGT 
CACCGGCGCA 
ACCCAGGCCA 
ACGGTCAAGT 
TTGGTGGGCA 
GACCCCAACC 
GAAGAAGAGG 
TTCGGGCTGA 
GGCGTGGGCG 
ATGCAGCACT 
CTGGCCGAGG 
GACCGCATAG 
GGGGGCCTGT 
CGCATCAAGC 
GTGTACTCGC 
AGCTTCGGAG 
GGGAGCTTGC 
GGCACGGGAA 
TCAAAAGAGG 
AGCAATCTCA 
TGCAACTATG 
GTGGGGAAGG 
CTGGAGAAAC 
GAATAGAGGT 
CCCTTTCCCC 
TTTAAACAAA 
CCAGCACACC 
TCCTTTATGT 



41 
I 

CTCCCACAAT 
AAAAGGCACT 
ATGTGAACCG 
AGCCCACCAT 
CGCCCGAGCC 
CTCCAGAAGG 
GGGACATTCT 
AAAAAGCTGT 
CCGTGGAGGT 
GAAGAATTTG 
CCTCCCCTCG 
CCCCGCAGGG 
CATTCACCAG 
ACTTAGAAAG 
GTGCAGAATG 
TTAACCTGCT 
GCTTTCCACC 
GCATAGAGCG 
ACAGGGTGCT 
GACTTAGAGA 
CTATGCAAAG 
CCCCCCTCCC 
CATGCGAGTT 
GCCACACGGG 
GCAAGCTGAA 
CCGACGACGG 
GCGCCAGCAG 
TGATCCCGGA 
AGGAAGAGGA 
GCCTGGAGGC 
ACGAGAGCCG 
TCAGCGAGGC 
CCGAGGGCCA 
ACGATGGCAC 
CCAAAAAGCT 
TCGAGAAGGA 
AGTGGCTCGC 
ACTCCAGACA 
GCTTCTCCAC 
GTGGAGGGAG 
GCAGACGCAG 
CTGT C CACAG 
CCTGTGCCCA 
ACGTTTACAA 
ACATGAAAAA 
ATATTAATAC 
ATCGCCCTCC 
CAAACAAACA 
TGTTTTTTTT 
TCTCACCGTT 



51 
I 

TCATCTTCCC 
GATGAAGATA 
AGCCGTCGTC 
GTCTCGCCGC 
TCTTGAAGCC 
GGAT CATGAC 
TATTTTTATC 
GGATAAGCCA 
TGGCATCCAG 
CCCCAAACAG 
TTCTGCACAT 
TATTTGTAAA 
TGCATGGTTT 
CGAACACGGA 
TCCTTCCCAG 
AAGAATACCA 
CACTCCCCCC 
CCTGGGGGCG 
GCGGTTGAAT 
GCTGGCAGGG 
GTTACTGCAA 
TCCTCTGCAA 
CTGCGGCAAG 
CGAGAAGCCC 
GCGCCACATG 
TCTCTCCACC 
CGCGCTCAAG 
GAACGGGGAC 
GGAGGAGGAG 
GGCGCGCCAC 
CGCCCTGCCC 
CTTCCACCAG 
CAGGGACACT 
TGTTAATGGC 
GCTGCTGGGC 
GTTCGACCTG 
CGGCTACGCG 
ATCGCCTTTT 
ACCGCCCGGG 
CACGCCCCAT 
CGACACT TGT 
GAGAAGCCAC 
GAGTAGCAAG 
ATGTGAAATT 
ATGGCACAGT 
CCCTCCCTCA 
AGCCCCACTC 
AACAGAAGTA 
CTTTTTCTTT 
TGAATGCATG 



180 
240 
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60 
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ATCTGTATGG GGCAATACTA TTGCATTTTA CGCAAACTTT GAGCCTTTCT CTTGTGCAAT 3060 

AATTTACATG TTGTGTATGT TTTTTTTTAA ACTTAGACAG CATGTATGGT ATGTTATGGC 3120 

TATTTTAAAT TGTCCCTAAT TCGTTGCTGA GCAAACATGT TGCTGTTTCC AGTTCCGTTC 3180 

TGAGAGAAAA AGAGAGAGAG AGAGAAAAAG ACCATGCTGC ATACATT CTG TAATACATAT 3240 

5 CATGTACAGT TTTATTTTAT AACGTGAGGA GGAAAAACAG TCTTTGGATT AACCCTCTAT 33 00 

AGACAGAATA GATAGCACTG AAAAAAAATC TCTATGAGCT AAATGTCTGT CTCTAAAGGG 33 60 

TTAAATGTAT CAATTGGAAA GGAAGAAAAA AGGCCTTGAA TTGACAAATT AACAGAAAAA 3420 

CAGAACAAGT TTATTCTATC ATTTGGTTTT AAAATATGAG TGCCTTGGAT CTATTAAAAC 3480 

CACAT CGATG GTTCTTTCTA CTTGTTATAA ACTTGTAGCT TAATTCAGCA TTGGGTGAGG 3540 

10 TAATAAACCT TAGGAACTAG CATATAATTC TATATTGTAT TTCTCACAAC AATGGCTACC 3600 

TAAAAAGATG ACCCATTATG TCCTAGTTAA TCATCATTTT TCCTTTAGTT TAATTTTATA 3660 

AACAAAACTG ATTATACCAG TATAAAAGCT ACTTTGCTCC TGGTGAGAGC TTAAAAGAAA 372 0 

TGGGCTGTTT TGCCCAAAGT TTTATTTTTT TTAAACAATG ATTAAATTGA ATGTGTAATG 3780 

TGCAAAAGCC CTGGAACGCA ATTAAATACA CTAGTAAGGA GTTCATTTTA TGAAGATATT 3840 

15 TGCTTTAATA ATGTCTTTTT AAAAATACTG GCACCAAAAG AAATAGATCC AGATCTACTT 3900 

GGTTGTCAAG TGGACAATCA AATGATAAAC TTTAAGACCT TGTATACCAT ATTGAAAGGA 3 960 

AGAGGCTGAC AATAAGGTTT GACAGAGGGG AACAGAAGAA AATAATATGA TTTATTAGCA 4020 

CAACGTGGTA CTATTTGCCA TTTAAAACTA GAACAGGTAT ATAAGCTAAT ATTGATACAA 4080 

TGATGATTAA CTATGAATTC TTAAGACTTG CATTTAAATG TGACATTCTT AAAAAAAGAA 4140 

20 GAGAAAGAAT TTTAAGAGTA GCAGTATATA TGTCTGTGCT CCCTAAAAGT TGTACTTCAT 4200 

TTCTTTTCCA TACACTGTGT GCTATTTGTG TTAACATGGA AGAGGATTCA TTGTTTTTAT 4260 

TTTTATTTTT TTAATTTTTT CTTTTTTATT AAGCTAGCAT CTGCCCCAGT TGGTGTTCAA 4320 

ATAGCACTTG ACTCTGCCTG TGATATCTGT ATCTTTTCTC TAATCAGAGA TACAGAGGTT 4380 

GAGTATAAAA TAAACCTGCT CAGATAGGAC AATTAAGTGC ACTGTACAAT TTTCCCAGTT 4440 

25 TACAGGTCTA TACTTAAGGG AAAAGT TGC A AGAATGCTGA AAAAAAATTG AACACAATCT 4500 

CATTGAGGAG CATTTTTTAA AAACTAAAAA AAAAAAAACT TTGCCAGCCA TTTACTTGAC 4560 

TATTGAGCTT ACTTACTTGG ACGCAACATT GCAAGCGCTG TGAATGGAAA CAGAATACAC 4620 

TTAACATAGA AATGAATGAT TGCTTTCGCT TCTACAGTGC AAGGATTTTT TTGTACAAAA 4680 

CTTTTTTAAA TATAAATGTT AAGAAAAATT TTTTTTAAAA AACACTTCAT TATGTTTAGG 4740 

30 GGGGAACTGC ATTTTAGGGT TCCATTGTCT TGGTGGTGTT ACAAGACTTG TTATCCATTT 4800 

AAAAATGGTA GTGGAAATTC TATGCCTTGG ATACACACCG CTGTTCAGGT TGTAAAAAAA 4860 

AAAAACATAC ATTGGGGAAA GGTTTAAGAT TATATAGTAC TTAAATATAG GAAAATGCAC 4920 

ACTCATGTTG ATTCCTATGC TAAAATACAT TTATGGTCTT TTTTCTGTAT TTCTAGAATG 4980 

GTATTTGAAT TAAATGTTCA T CTAGTGTT A GGCACTATAG TATTTATATT GAAGCTTGTA 5040 

35 TTTTTAACTG TTGCTTGTTC TCTTAAAAGG TATCAATGTA CCTTTTTTGG TAGTGGAAAA 5100 

AAAAAAGACA GGCTGCCACA GTATATTTTT TTAATTTGGC AGGATAATAT AGTGCAAATT 5160 

ATTTGTATGC TTCAAAAAAA AAAAAAAGAG AGAAACAAAA AAGTGTGACA TTACAGATGA 5220 

GAAGCCATAT AATGGCGGTT TGGGGGAGCC TGCTAGAATG TCACATGGAT GGCTGTCATA 5280 

GGGGTTGTAC ATATCCTTTT TTGTTCCTTT TTCCTGCTGC CATACTGTAT GCAGTACTGC 5340 

40 AAGCTAATAA CGTTGGTTTG TTATGTAGTG TGCTTTTTGT CCCTTTCCTT CTATCACCCT 54 00 

ACATTCCAGC ATCTTACCTT CATATGCAGT AAAAGAAAGA AAGAAAAAAA AAGGAAAAAA 5460 

AAAAAAAAAC CAATGTTTTG CAGTTTTTTT CATTGCCAAA AACTAAATGG TGCTTTATAT 5520 

TTAGATTGGA AAGAATTTCA TATGCAAAGC ATATTAAAGA GAAAGCCCGC TTTAGTCAAT 5580 

ACTTTTTTGT AAATGGCAAT GCAGAATATT TTGTTATTGG CCTTTTCTAT TCCTGTAATG 5640 

45 AAAGCTGTTT GTCGTAACTT GAAATTTTAT CTTTTACTAT GGGAGTCACT ATTTATTATT 5700 

GCTTATGTGC CCTGTTCAAA ACAGAGGCAC TTAATTTGAT CTTTTATTTT TCTTTGTTTT 5760 

TATTTTTTTT TTTATTTAGA TGACCAAAGG TCATTACAAC CTGGCTTTTT ATTGTATTTG 582 0 

TTTCTGGTCT TTGTTAAGTT CTATTGGAAA AACCACTGTC TGTGTTTTTT TGGCAGTTGT 5880 

CTGCATTAAC CTGTTCATAC ACCCATTTTG TCCGTTTATT GAAAAAATAA AAAAAATTAA 5940 

50 a 

Seq ID NO: 85 Protein sequence: 
Protein Accession #•• NP_075044.1 

55 1 11 21 31 41 51 

I I I I I I 

MSRRKQGKPQ HLSKREFSPE PLEAILTDDE PDHGPLGAPE GDHDLLTCGQ CQMNFPLGDI 60 

LIFIEHKRKQ CNGSLCLEKA VDKPPSPSPI EMKKASNPVE VGIQVTPEDD DCLSTSSRRI 120 

^ _ CPKQEHIADK LLHWRGLSSP RSAHGALIPT PGMSAEYAPQ GICKDEPSSY TCTTCKQPFT 180 

60 SAWFLLQHAQ NTHGLRIYLE SEHGSPLTFR VGIPSGLGAE CPSQPPIiHGI HIADNNPFNL 240 

LRIPGSVSRE ASGLAEGRFP PTPPLFSPPP RHHLDPHRIE RLGAEEMALA THHPSAFDRV 3 00 

LRLNPMAMEP PAMDFSRRLR ELAGNTSSPP LSPGRPSPMQ RLLQPFQPGS KPPFLATPPL 360 

PPLQSAPPPS QPPVKSKSCE FCGKTFKFQS NLWHRRSHT GEKPYKCNLC DHACTQASKL 42 0 

KRHMKTHMHK SSPMTVKSDD GLSTASSPEP GTSDLVGSAS SALKSWAKF KSENDPNLIP 4 80 

65 ENGDEEEEED DEEEEEEEEE EEEEL.TESER VDYGFGLSLE AARHHENSSR GAWGVGDES 540 

RALPDVMQGM VLSSMQHFSE AFHQVLGEKH KRGHLAEAEG HRDTCDEDSV AGESDRIDDG 600 

TVNGRGCSPG ESASGGLSKK LLLGSPSSLS PFSKRIKLEK EFDLPPATMP NTENVYSQWI* 660 

AGYAASRQLK DPFLSFGDSR QSPFASSSEH SSENGSLRFS TPPGELDGGI SGRSGTGSGG 720 

STPHISGPGT GRPSSKEGRR SDTCEYCGKV FKNCSNLTVH RRSHTGERPY KCELCNYACA 780 
70 QSSKLTRHMK THGQVGKDVY KCEICKMPFS VYSTLEKHMK KWHSDRVLNN DIKTE 



Seq ID NO: 86 DNA sequence 
Nucleic Acid Accession #: XM_0352 92.2 
75 Coding sequence: 53-1576 

1 11 21 31 41 51 

1)11)1 

GCTCGCTGGG CCGCGGCTCC CGGGTGTCCC AGGCCCGGCC GGTGCGCAGA GCATGGCGGG 60 

80 TGCGGGCCCG AAGCGGCGCG CGCTAGCGGC GCCGGCGGCC GAGGAGAAGG AAGAGGCGCG 120 

GGAGAAGATG CTGGCCGCCA AGAGCGCGGA CGGCTCGGCG CCGGCAGGCG AGGGCGAGGG 180 

CGTGACCCTG CAGCGGAACA TCACGCTGCT CAACGGCGTG GCCATCATCG TGGGGACCAT 240 

TATCGGCTCG GGCATCTTCG TGACGCCCAC GGGCGTGCTC AAGGAGGCAG GCTCGCCGGG 300 

GCTGGCGCTG GTGGTGTGGG CCGCGTGCGG CGTCTTCTCC ATCGTGGGCG CGCTCTGCTA 3 60 

85 CGCGGAGCTC GGCACCACCA TCTCCAAATC GGGCGGCGAC TACGCCTACA TGCTGGAGGT 420 

CTACGGCTCG CTGCCCGCCT TCCTCAAGCT CTGGATCGAG CTGCTCATCA TCCGGCCTTC 480 

ATCGCAGTAC AT CGTGGCCC TGGTCTTCGC CACCTACCTG CTCAAGCCGC TCTTCCCCAC 540 
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80 
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CTGCCCGGTG CCCGAGGAGG CAGCCAAGCT CGTGGCCTGC CTCTGCGTGC TGCTGCTCAC 600 

GGCCGTGAAC TGCTACAGCG TGAAGGCCGC CACCCGGGTC CAGGATGCCT TTGCCGCCGC 660 

CAAGCTCCTG GCCCTGGCCC TGATCATCCT GCTGGGCTTC GTCCAGATCG GGAAGGGTGA 720 

TGTGTCCAAT CTAGATCCCA ACTTCTCATT TGAAGGCACC AAACTGGATG TGGGGAACAT 780 

TGTGCTGGCA TTATACAGCG GCCTCTTTGC CTATGGAGGA TGGAATTACT TGAATTTCGT 840 

CACAGAGGAA ATGATCAACC CCTACAGAAA CCTGCCCCTG G C CAT CAT C A TCTCCCTGCC 900 

CATCGTGACG CTGGTGTACG TGCTGACCAA CCTGGCCTAC TTCACCACCC TGTCCACCGA 960 

GCAGATGCTG TCGTCCGAGG CCGTGGCCGT GGACTT CGGG AACTATCACC TGGGCGT CAT 1020 

GTCCTGGATC ATCCCCGTCT TCGTGGGCCT GTCCTGCTTC GGCTCCGTCA ATGGGTCCCT 1080 

GTTCACATCC TCCAGGCTCT TCTTCGTGGG GTCCCGGGAA GGCCACCTGC CCTCCATCCT 1140 

CTCCATGATC CACCCACAGC TCCTCACCCC CGTGCCGTCC CTCGTGTTCA CGTGTGTGAT 12 00 

GACGCTGCTC TACGCCTTCT CCAAGGACAT CTTCTCCGTC ATCAACTTCT TCAGCTTCTT 12 60 

CAACTGGCTC TGCGTGGCCC TGGCCATCAT CGGCATGATC TGGCTGCGCC ACAGAAAGCC 1320 

TGAGCTTGAG CGGCCCATCA AGGTGAACCT GGCCCTGCCT GTGTTCTTCA TCCTGGCCTG 1380 

CCTCTTCCTG ATCGCCGTCT CCTTCTGGAA GACACCCGTG GAGTGTGGCA TCGGCTTCAC 1440 

CATCATCCTC AGCGGGCTGC CCGTCTACTT CTTCGGGGTC TGGTGGAAAA ACAAGCCCAA 150 0 

GTGGCTCCTC CAGGGCATCT TCTCCACGAC CGTCCTGTGT CAGAAGCTCA TGCAGGTGGT 1560 
CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 

Seq ID NO: 87 Protein sequence: 
Protein Accession #: XP_0352 92.2 



PCT/US02/12476 



MAGAGPKRRA 
GTIIGSGIFV 
LEVYGSLPAF 
LLTAVNCYSV 
GNIVLALYSG 
STEQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWLLQGIF 



11 
i 

LAAPAAEEKE 
TPTGVLKEAG 
LKLWIELDII 
KAATRVQDAF 
LFAYGGWNYL 
VAVDFGNYHL 
LTPVPSLVFT 
VNLALPVFFI 
STTVLCQKLM 



21 

I 

EAREKMLAAK 
SPGLALiWWA 
RPSSQYIVAL 
AAAKLLALAIi 
NFVTEEMINP 
GVMSWIIPVF 
CVMTLLYAFS 
LACLFLIAVS 
QWPQET 



31 

SADGSAPAGE 
ACGVFSIVGA 
VFATYLLKPL 
IILLGFVQIG 
YRNLPLAIII 
VGLSCFGSVN 
KDIFSVINFF 
FWKTPVECGI 



41 
I 

GEGVTLQRNI 
LCYAELGTTI 
FPTCPVPEEA 
KGDVSNLDPN 
SLPIVTLVYV 
GSLFTSSRLF 
SFFNWLCVAL 
GFTIILSGLP 



51 
I 

TLLNGVAI IV 
SKSGGDYAYM 
AKLVACL CVL 
FSFEGTKLDV 
LTNLAYFTTL 
FVGSREGHLP 
AIIGMIWLRH 
VYFFGVWWKN 



Seq ID NO: 88 DNA sequence 

Nucleic Acid Accession #: NM_005268.1 

Coding sequence: 168-98 9 



TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
GTGGGCTCTG 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGG 
CATGAGGTAG 
TCAACTCCAG 
GCTCGGTTTC 



11 

i 

AAGAATTCGC 
AAATTCAAGC 
AGTAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCATGGGGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGG CAA 
CCACCTGCCC 
CTTTTCTAGA 



21 
I 

GGCCGCGTCG 
TGCTTGCTGA 
CAGTAGCAGC 
GGGGTCAACA 
CGCGTGCTGG 
GACTGCAATA 
TCCCATGTGC 
GTCATGCACG 
AACAGTGGGC 
GTCTGCAGCC 
TTCTACCCCA 
ATAGTGGACT 
GCCACAGCTG 
AGATGCCACG 
CACGGTACCA 
GGCTCAGACA 
ATCTTGTGAG 
CATCTCTCAT 
GAGAGAGGAT 
CAGCTCGACG 
ATGGAAATAG 



31 
I 

ACACGGGCTT 
GTCCTATTGC 
TGACGCGTGG 
AGTACTCCAC 
TGTACCTGGT 
CTCGCCAGCC 
GCCTCTGGGC 
TGGCCTACCG 
GCCTCTACCT 
TAGTGTTCAA 
AATATATCCT 
GCTTCATCTC 
CCATCTGCAT 
AGTGCCTGGC 
CCTCTTCCTG 
GTCATCCTCC 
GGGCTGCCTG 
AGGTGCAACC 
TCAGACG CTC 
GCACTGGGCC 
TGAGGGCCAA 



41 
I 

CCCCGAAAAC 
CGGCTGCTGG 
GTCCACCATG 
AGCCTTTGGG 
GACGGCCGAG 
CGGCTGCTCC 
CCTGCAGCTT 
GGAGGTT GAG 
GAACCCCGGC 
GGCGAGCGTG 
CCCTCCTGTG 
CAAGCCCTCA 
CCTGCTCAAC 
AG CAAGGAAA 
CAAACAAGAC 
TCTCTTACCA 
GACTGGTCTG 
TGAGAGTGGG 
TGGGAGCCAG 
AGTTCCCCCT 
TGC 



51 

1 

CTTCCCCGCT 
GAGCCAGGAG 
AACTGGAGTA 
CGCATCTGGC 
CGTGTGTGGA 
AACGTCTGCT 
ATCCTGGTGA 
GAGAAGAGGC 
AAGAAGCGGG 
GACATCGCCT 
GTCAAGTGCC 
GAGAAGAACA 
CTCGTGGAGC 
GCTCAAGCCA 
GACCTCCTTT 
GACCGCCCCC 
GCAGGTTGGG 
GGAGCTAAGC 
TTCCTAGTCC 
CTGCTCTGCA 



Seq ID NO: 89 Protein sequence: 
Protein Accession #: NP_005259.1 



MNWSIFEGLL 
SNVCFDEFFP 
GKKRGGLWWT 
SEKNIFTLFM 
DDLLSGDLIF 



11 

I 

SGVNKYSTAF 
VSHVRLWALQ 
YVCSLVFKAS 
VATAAICILL 
LGSDSHPPLL 



21 

i 

GRIWLSLVFI 
LILVTCPSLL 
VD I AFLYVFH 
NLVELIYLVS 
PDRPRDHVKK 



31 

! 

FRVLVYLVTA 
WMHVAYREV 
SFYPKYILPP 
KRCHECLAAR 
TIL 



41 

I 

ERVWSDDHKD 
QEKRHREAHG 
WKCHADPCP 
KAQAMCTGHH 



51 
I 

FDCNTRQPGC 
ENSGRLYLNP 
NIVDCFISKP 
PHGTTSSCKQ 



Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession #: NM_0023 91.1 

Coding sequence: 26-457 



CGGGCGAAGC 
CGCCCTGCTG 
CCCGGGGAGC 
CGGCGTGGGT 
GCCCTGCAAC 
TGCGTGTGAT 



11 
I 

AGCGCGGGCA 
GCGCTCACCT 
GAGTGCGCTG 
TTCCGCGAGG 
TGGAAGAAGG 
GGGGGCACAG 



21 
I 

GCGAGATGCA 
CCGCGGTCGC 
AGTGGGCCTG 
GCACCTGCGG 
AGTTTGGAGC 
GCACCAAAGT 



31 

1 

GCACCGAGGC 
CAAAAAGAAA 
GGGGCCCTGC 
GGCCCAGACC 
CGACTGCAAG 
CCGCCAAGGC 



41 



51 



TTCCTCCTCC TCACCCTCCT 
GATAAGGTGA AGAAGGGCGG 
ACCCCCAGCA GCAAGGATTG 
CAGCGCATCC GGTGCAGGGT 
TACAAGTTTG AGAACTGGGG 
ACCCTGAAGA AGGCGCGCTA 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 



222 



10 



WO 02/086443 

CAATGCTCAG TGCCAGGAGA CCATCCGCGT 
AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA 
GCCCCTGGTG TCACATGGGG CCTGGCCACG 
CACCAGTGCC TTCTGTCTGC TCGTTAGCTT 
ACTCCCCAGC CCCACCCCTA AGTGCCCAAA 
TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA 
ATTACTAAGA AACACATCAA ATAAACTGAC 
TAATAT 



PCT/US02/12476 



CACCAAGCCC 
GGACTAGACG 



TAATCAATCA 
GTGGGGAGGG 
GAGCCCGCTT 
TTTTTCCCCC 



TGCACCCCCA 
CCAAGCCTGG 
CCCAGGCCCG 
TGCCCTGCCT 
ACAAGGGATT 
TTGTTCTTCC 
CAATAAAAGC 



AGACCAAAGC 
ATGCCAAGGA 
AGATGTGACC 
TGTCCCTCTC 
CTGGGAAGCT 
CCACAATTCC 
TCTTCTTTTT 



Seq ID NO: 91 Protein sequence: 
Protein Accession #: NP_002382.1 



420 
480 
540 
600 
660 
720 
780 



1 11 21 31 41 51 

t< I I I 1 I I 

ID MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 

CGAQTQR IRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGThKKA RYNAQCQETI 
RVTKPCTPKT KAKAKAKKGK GKD 



60 
120 



20 Seq ID NO: 92 DNA sequence 

Nucleic Acid Accession #: NM_005130.1 
Coding sequence: 98-802 



25 
30 
35 
40 
45 



CTCTACCTGA 
CGTGTGCTCA 
GCTCTCCTTC 
GAATGGACTT 
TAAGCAGAAA 
CAGATGGGCT 
GGACCATGAA 
TGAGAGAGTC 
ATATTCCAAG 
TAAGCTAGTC 
GTCCCCCAGG 
GACCATGGCC 
GACTGCCCTG 
AGTGCAGGAC 
TGTCGTAAGT 
TGTGCTTAGT 
TGGAATTTGC 
TTCCATGGCC 
GAGTGATAAT 
TTTTTCAAAA 



11 

1 

CACAGCTGCA 
GAACAAGGTG 
CTCCTACTGG 
CACAGCAAAG 
AGCAGGCCCG 
GCTACTGAGC 
TTTTCCTGTG 
TATTGGAAAC 
ACAGCTGTGA 
AGCTCCACTC 
GAGCACATCA 
ACCAAAGCTC 
GAGTTCTGTG 
ACGTCATGCT 
CCCTCTGTAT 
GAGTGCAACG 
CTTATTTTTC 
CACACAGCTA 
TTCAGTGCAA 
AAAAAAAAAA 



21 

1 

GCCTGCAATT 
AACGCCCAGC 
CTGCTCAGGT 
TGGT CTCAGA 
GGAACAAAGG 
AGGAGGAGGG 
TCTTTGCTGG 
AAGTTGCCCG 
AAACCAGAGT 
TATTTGGGAA 
AGGGCAAAGA 
CCGAGTGTGT 
GAGAGACTTG 
AATGAGGTCA 
ACTTTAAAGC 
AAATATTTAA 
TTGGATGCGA 
TGTGTTTGAG 
CGAACTTTCT 
AAA 



31 
I 

CACTCCCACT 
TGCAGCCATG 
GCTCCTGGTG 
ACAAAAGGAC 
CAAGTTTGTC 
CATCTCTCTC 
CAATCCAACC 
GAATCTGCGC 
GTGCAGAAAG 
CACAAAGCCC 
GACCACCCCC 
GGAGGACCCA 
GAGCTCTCTC 
AAAGAGAACG 
TCTCTACAGT 
ACAAGTTTTG 
TGTT CAGAGG 
CAGCGAAGAG 
GCTGAATTAA 



41 

I 

GCCTGGGATT 
AAGATCTGTA 
GAGGGGAAAA 
ACTCTGGGCA 
ACCAAAGACC 
AAGGTTGAGT 
TCATGCCTAA 
TCACAGAAAG 
GATTTTCCAG 
AGGAAGGAGA 
TCTAGCCTAG 
GATATGGCAA 
TGCACATTCT 
GGTTCCTTTA 
CCCCCCAAAA 
TATTTTTTGC 
CTGTTTCCTG 
TCTTTGAGCT 
TGGTAATAAA 



51 
I 

GCACTGGATC 
GCCTCACCCT 
AAAAAGTGAA 
ACACCCAGAT 
AAGCCAACTG 
GCACTCAATT 
AGCTCAAGGA 
ACAT CTGTAG 
AATCCAGTCT 
AAACAGAGAT 
CAGTGACCCA 
ACCAGAGGAA 
TCCT CAGCAT 
AGAGATGTCA 
TATGAACTTT 
TTTTGTGTTT 
CAGCATGTAT 
GAATGAGCCA 
ACTCTGGGTG 



Seq ID NO: 93 Protein sequence: 
Protein Accession #: NP_005121.1 

50 1 11 21 31 41 51 

1 I I I 1 I 

MKICSLTLLS FLLLAAQVLL VEGKKKVKNG LHSKWSEQK DTLGNTQIKQ KSRPGNKGKF 
VTKDQANCRW AATEQEEGIS LKVECTQLDH EFSCVFAGNP TSCLKLKDER VYWKQVARNL 
RSQKDICRYS KTAVKTRVCR KDFPESSLKL VSSTLFGNTK PRKEKTEMSP REH I KGKETT 
55 PSSLAVTQTM ATKAPECVED PDMANQRKTA IiEFCGETWSS LCTFFLSIVQ DTSC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
12 0 
180 



60 



Seq ID NO: 94 DNA sequence 

Nucleic Acid Accession #: NM__012101 

Coding sequence: 125-1891 



11 



21 



31 



41 



51 



65 
70 
75 
80 
85 



CTCCTCACAG 
TGCCAGAAAG 
TGCGATGGAA 
CCGGAGCCCG 
TGCCAAGACC 
CCTGAAGCCA 
CATCCAGTTT 
GGAAGGCAAG 
TACCTTTGCC 

GGTGTCCATC 
CCTTTTTTCA 

CAAGCAGAAG 
CAAGCCCCAC 
CTTTGAGGCC 
CCAGACCTGC 
AGTGGAGGAG 
GCTCAAGATC 
CAAGAG CTTC 
GGACCTGGAG 
TGTGGACCAA 
GGACAAGCAG 
ATTTGGTGCA 
GCTGGAGGGG 
ATGCATGCGC 
GAACCACATG 



GTGTGTCTCT 
GTCACCTATC 
GCTG CAGATG 
TCGGGCCCCA 
ACCAACGGGC 
GGGGAAGGTA 
GTCGAGTCCG 
AGGTCGCCGT 
GAAAAGGGCG 
ATGGAGCCCG 
CGGTCCAAGT 
GCGGTCAAGT 
CTGGAGGGCG 
CGCAAGTGTC 
ATCTGCTACC 
GCCAAGGCCG 
ATTGAGATTG 
ACCACCAATG 
AAGCAAAAGG 
GTGAAGGTGA 
ACCCGGGAGC 
TTGATGAGCA 
GAGGGCCTGG 
CACGTTGAGA 
GAGAACGGTG 



AGTCCTCGTG 
CTGAACCCCA 
CCTCCAGGAG 
GTGGCAGCCT 
ACGGCGGGGA 
GGAGCGCCCT 
GGGACGACAA 
ACGCAGGGCT 
ACGTGCGCAA 
GGGAGACCCG 
CCGGCTCCGA 
CCTGCCTGGT 
CCGCCTTCCG 
CCGTGCATGG 
TTTGCATGTT 
AGAAGGAGAC 
AGGATGAAGC 
AGAAGGCCAT 
AGGAAGTGAG 
TCATGGATGC 
AGCTGCATAG 
ATTACTCTCT 
GACAGTCACT 
AGATGTGCAA 
GTGACCATCG 



GTTGCCTGCC 
GCAAGCCTGA 
CAACGGGTCG 
GGAGAATGGC 
GGCAGCTGAG 
GTTCGCGGGC 
GAACTCCAAC 
CCAGCTGGGG 
GTCCATTTTC 
GCGGAACAGC 
GGAGGTGCTG 
GTGCCAGGCC 
AGACCACCAG 
CAAGACGATG 
CCAGGAGCAC 
GGAGCTGTCA 
TGAGAAGTGG 
CCTGGAGCAG 
GGCTGCGCTG 
TCTGGATGAG 
CATCAGCGAC 
CCCCCCACCC 
AGGCAACTTC 
GGCGGACCTG 
CTATGTGAAC 



CCACTCCCTG 
AACAGCTCAG 
AGCCCAGAAG 
ACCAAGGCTG 
GGCAAGAGCC 
AATGAGTGGC 
TACTTCAGCA 
GCTGCCAAGA 
TCGGAGTCCC 
TACCCCCGGG 
TGCGACTCCT 
TCCTTCTGCG 
CTGCTCGAGC 
GAGCTCTTCT 
AAGAATCATA 
CTGCAAAAGG 
CAGAAGGAGA 
AACTTCCGGG 
GAGCAGCGGG 
AGAGCCAAGG 
TCTGTGTTGT 
CTGCCCACCT 
AAGGACGACC 
AGCCGTAACT 
AACTACACGA 



CCGAGACGCC 
CCAAGCACCC 
CCAGGGATGC 
ACGGCAAGGA 
TGGGCAGCGC 
GGCGACCCAT 
TGGACTCTAT 
AGCCACCCGT 
GGAAGCCCAC 
CCGACACGGG 
GCATCGGCAA 
AGCTGCATCT 
CCATCCGGGA 
GCCAGACCGA 
GCACCGTGAC 
AGCAGCTGCA 
AGGACCGCAT 
ACCTGGTGCG 
AGCAGGATGC 
TGCTGCATGA 
TTCTGCAGGA 
ATCATGTCCT 
TGCTCAATGT 
TCATTGAGAG 
ACAGCTTCGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



223 
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GGGTGAGTGG AGTGCACCGG ACACCATGAA GAGATACTCC ATGTACCTGA CACCCAAAGG 1560 

TGGGGTCCGG ACATCATACC AGCCCTCGTC TCCTGGCCGC TTCACCAAGG AGACCACCCA 1620 

GAAGAATTTC AACAATCTCT ATGGCACCAA AGGTAACTAC ACCTCCCGGG T CTGGGAGTA 1680 

CTCCTCCAGC ATTCAGAACT CTGACAATGA CCTGCCCGTC GTCCAAGGCA GCTCCTCCTT 1740 

CTCCCTGAAA GGCTATCCCT CCCTCATGCG GAGCCAAAGC CCCAAGGCCC AGCCCCAGAC 1800 

TTGGAAATCT GGCAAGCAGA CTATGCTGTC TCACTACCGG CCATTCTACG TCAACAAAGG 1860 

CAACGGGATT GGGTCCAACG AAGCCCCATG AGCTCCTGGC GGAAGGAACG AGGCGCCACA 1920 

CCCCTGCTCT TCCTCCTGAC CCTGCTGCTC TTGCCTTCTA AGCTACTGTG CTTGTCTGGG 1980 

TGGGAGGGAG CCTGGTCCTG CACCTGCCCT CTGCAGCCCT CTGCCAGCCT CTTGGGGGCA 2040 

GTTCCGGCCT CTCCGACTTC CCCACTGGCC ACACTCCATT CAGACTCCTT TCCTGCCTTG 2100 

TGACCTCAGA TGGTCACCAT CATTCCTGTG CTCAGAGGCC AACCCATCAC AGGGGTGAGA 2160 

TAGGTTGGGG CCTGCCCTAA CCCGCCAGCC TCCTCCTCTC GGGCTGGATC TGGGGGCTAG 2220 

CAGTGAGTAC CCGCATGGTA TCAGCCTGCC TCTCCCGCCC ACGCCCTGCT GTCTCCAGGC 22 80 

CTATAGACGT TTCTCTCCAA GGCCCTATCC CCCAATGTTG TCAGCAGATG CCTGGACAGC 2340 

ACAGCCACCC ATCTCCCATT CACATGGCCC ACCTCCTGCT TCCCAGAGGA CTGGCCCTAC 2400 

GTGCTCTCTC TCGTCCTACC TATCAATGCC CAGCATGGCA GAACCTGCAG TGGCCAAGGG 2460 

CTGCAGATGG AAACCTCTCA GTGTCTTGAC ATCACCCTAC CCAGGCGGTG GGTCTCCACC 2520 

ACAGCCACTT TGAGTCTGTG GTCCCTGGAG GGTGGCTTCT CCTGACTGGC AGGATGACCT 2580 

TAGCCAAGAT ATTCCTCTGT TCCCTCTGCT GAGATAAAGA ATTCCCTTAA CATGATATAA 2640 

TCCACCCATG CAAATAGCTA CTGGCCCAGC TACCATTTAC CATTTGCCTA CAGAATTTCA 2 700 

TTCAGTCTAC ACTTTGGCAT TCTCTCTGGC GATGGAGTGT GGCTGGGCTG ACCGCAAAAG 2760 

GTGCCTTACA CACTGCCCCC ACCCTCAGCC GTTGCCCCAT CAGAGGCTGC CTCCTCCTTC 2820 

TGATTACCCC CCATGTTGCA TAT CAGGGTG CTCAAGGATT GGAGAGGAGA CAAAACCAGG 2 880 

AGCAGCACAG TGGGGACATC TCCCGTCTCA ACAGCCCCAG GCCTATGGGG GCTCTGGAAG 2 940 

GATGGGCCAG CTTGCAGGGG TTGGGGAGGG AGACATCCAG CTTGGGCTTT CCCCTTTGGA 3 000 
ATAAACCATT GGTCTGTC 

Seq ID NO: 95 Protein sequence : 
Protein Accession #: NP_036233.1 



1 11 21 31 41 51 

III 

MEAADASRSN GSSPEARDAR SPSGPSGSLE NGTKADGKDA KTTNGHGGEA AEGKSLGSAL 60 

KPGEGRSALF AGNEWRRPII QFVESGDDKN SNYFSMDSME GKRSPYAGLQ LGAAKKPPVT 120 

FAEKGDVRKS IFSESRKPTV SIMEPGETRR NSYPRADTGL FSRSKSGSEE VIiCDSCIGNK 180 

QKAVKSCLVC QASFCELHLK PHLEGAAFRD HQLLEPIRDF EARKCPVHGK TMELFCQTDQ 240 

TCICYLCMFQ EHKNHSTVTV EEAKAEKETE LSLQKEQLQL KIIEIEDEAE KWQKEKDRIK 300 

SFTTNEKA.IL EQNFRDLVRD LEKQKEEVRA ALEQREQDAV DQVKVIMDAL DERAKVLHED 3 60 

KQTREQLHSI SDSVLFLQEF GALMSNYSLP PPLPTYHVLL EGEGLGQSLG NFKDDLLNVC 420 

MRHVEKMCKA DIiSRNFIERN HMENGGDHRY VNNYTNSFGG EWSAPDTMKR YSMYLTPKGG 480 

VRTSYQPSSP GRFTKETTQK NFNNLYGTKG NYTSRVWEY S SSIQNSDNDLi PWQGSSSFS 540 
LKGYPSLMRS QSPKAQPQTW KSGKQTMLSH YRPFYVNKGN GIGSNEAP 



Seq ID NO: 96 DNA sequence 

Nucleic Acid Accession # : NM_080668.1 

Coding sequence: 83-841 

1 11 21 31 41 51 

GGCACGAGGG CAGCGAGTGG CCTTCCCGGT TGGCGCGCGC CCGGGGCGGC GGCGCTGGAG 60 

GAGCTCGAGA CGGAGCCTAG TTATGTCTGG GAGGCGAACG CGGTCCGGAG GAGCCGCTCA 120 

GCGCTCCGGG CCAAGGGCCC CATCTCCTAC TAAGCCTCTG CGGAGGTCCC AGCGGAAATC 180 

AGGCTCTGAA CTCCCGAGCA TCCTCCCTGA AATCTGGCCG AAGACACCCA GTGCGGCTGC 240 

AGTCAGAAAG CCCATCGTCT TAAAGAGGAT CGTGGCCCAT GCTGTAGAGG TCCCAGCTGT 300 

CCAATCACCT CGCAGGAGCC CT AGGATTT C CTTTTTCTTG GAGAAAGAAA ACGAGCCCCC 360 

TGGCAGGGAG CTTACTAAGG AGGACCTTTT CAAGACACAC AGCGTCCCTG CCACCCCCAC 420 

CAGCACTCCT GTGCCGAACC CTGAGGCCGA GTCCAGCTCC AAGGAAGGAG AGCTGGACGC 480 

CAGAGACTTG GAAATGTCTA AGAAAGTCAG GCGTTCCTAC AGCCGGCTGG AGACCCTGGG 540 

CTCTGCCTCT ACCTCCACCC CAGGCCGCCG GTCCTGCTTT GGCTTCGAGG GGCTGCTGGG 600 

GGCAGAAGAC TTGTCCGGAG TCTCGCCAGT GGTGTGCTCC AAACTCACCG AGGTGCCCAG 660 

GGTTTGTGCA AAGCCCTGGG CCCCAGACAT GACTCTCCCT GGAATCTCCC CACCACCCGA 72 0 

GAAACAGAAA CGTAAGAAGA AGAAAATGCC AGAGATCTTG AAAACGGAGC TGGATGAGTG 78 0 

GGCTGCGGCC ATGAATGCCG AGTTTGAAGC TGCTGAGCAG TTTGATCTCC TGGTTGAATG 840 

AGATGCAGTG GGGGGTGCAC CTGGCCAGAC TCTCCCTCCT GTCCTGTACA TAGCCACCTC 900 

CCTGTGGAGA GGACACTTAG GGTCCCCTCC CCTGGTCTTG TTACCTGTGT GTGTGCTGGT 960 

GCTGCGCATG AGGACTGTCT GCCTTTGAGG GCTTGGGCAG CAGCGGCAGC CATCTTGGTT 1020 

TTAGGAAATG GGGCCGCCTG GCCCAGCCAC TCACTGGTGT CCTGTCTCTT GTCGTCCTGT 1080 

CCTTCCTATC TCCCCAAAGT ACCATAGCCA GTTTCCAGAT GGGCCACAGA CTGGGGAGGA 1140 

GAATCAGTGG CCCAGCCAGA AGTTAAAGGG CTGAGGGTTG AGGTGAGAGG CACCTCTGCT 1200 

CTTGTTGGGA GGGGTGGCTG CTTGGAAATA GGCCCAGGGG CTCTGCCAGC CTCGGCCTCT 1260 

CCCTCCTGAG TTGCCTTCTG TTGGTGGCTT TCTTCTTGAA CCCACCTGTG TAAAGAGGTT 1320 

TTCAGTTCCG TGGGTTTCCC CTTTGATTCT GTAAATAGTC CCAGAGAGAA TTCGTGGGCT 1380 

GAGGGCAATT CTGTCTTGGA GGAAGAAGCT GGACATTCAG CCTGTGGAGT CTGAGTTTTG 1440 

AAGGATGTAG GGAGCCTTAG TTGGGTCTCA GACCATAAGT GTGTACTACA CAGAAGCTGT 1500 

GTTTTCTAGT TCTGGTCTGC TGTTGAGATG TTTGGTAAAT GCCAGGTTGA TAGGGCGCTG 1560 

GCTGCTTGGA GCAAAGGGTG CATTTCAGGG TGTGGCCACC AGGTGCTGTG AGTTTCTGTG 162 0 

GCTCATGGCC TCTGGGCTGG TCCCTTGCAC AGGGCCCACG CTGGAGTCTT ACCACTCTGC 1680 

TGCAGGGGTG GAAGGTGGCC CCTCTTGTCA CCCATACCCA TTTCTTACAA AATAAGTTAC 1740 

ACCGAGTCTA CTTGGCCCTA GAAGAGAAAG TTGAAGAGTC CCAGACCTAC TAGCATTTTG 1800 

CAACTATGCT TGTAAAGTCC TCGGAAAGTT TCCTCGCGTA CCAGACAGCG GCGGGGGCTG I860 

ATAGCAATTT TAGTTTTTGG CCTCCCTATC CTCTCACATG AGAACACTGC CTGGATGCAT 1920 

CTCATGATCT CTGGAGAATT TCCCCATCTT TCTCTTCTTT CCATCGTGTG GATTCAATAG 1980 

TTTGGATTTG AAGGCTGCCC TGCCCCCGAC TCTCCTGCCG CACCCCTGGC CATTGTACCT 2 040 

TTTGATGTTT AGAAGTTCGT GGAAGTAGAC GCTGAGGTGT GCAGAGGAGC TGGTGGATAA 2100 

CAGAGAATGC CAGGGAAGAT GAGTGCTGGG TCAGGGTACT TGGATGAAAC GGTGCAGGCC 2160 

AGGCGGGCCC TAATAAAACC CTCTGCCAGG TCTGGGAGTC CCAGGCCATC TGCTCAACGC 222 0 
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TCTGTGGTTT GTCAGACCTG 
CTGAACCGCA CTGAAGAACT 
GTCTTAGTCC TGCAGAATCA 
AGTTCTCTGT TCCTGAGGAA 
AAAAAAAGCC TGATTAAAGA 
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CAAGCAAGCC CCCTGCTGGG GAAGCCTAGG TGTCCTTGAG 
CTTGTCCTCA CTGGCTGATG CAGCAGAACT CTTGGGAAAT 
GGAGT CACCA GATGATGCAG AGTTGAGATC ATCATTGCAA 
CTAAATTTAA GGAAAAAATG GGATTTTGTT TTAGAGTTGG 
GTTTCTGCCT GTTAAAAAAA AAAAAAAAAA AAAAAA 
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Seq ID NO: 97 Protein sequence: 
Protein Accession #: NP_5423 99.1 
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I I I I I I 

MSGRRTRSGG AAQRSGPRAP SPTKPLRRSQ RKSGSELPSI LPEIWPKTPS AAAVRKP I VL 
KR I VAHAVE V PAVQSPRRSP RISPFLEKEN EPPGRELTKE DLFKTHSVPA TPTSTPVPNP 
EAESSSKEGE LDARDLEMSK KVRRSYSRLE TLGSASTSTP GRRSCFGFEG LLGAEDLSGV 
SPWCSKLTE VPRVCAKPWA PDMTLPGISP PPEKQKRKKK KMPEILKTEL DEWAAAMNAE 
FEAAEQFDLL VE 

Seq ID NO: 98 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 58-12444 



GGGGCATTTC 
GCGGGCTCCG 
GCGGACCGCT 
TGCGT CCTGA 
GATTTCGGTT 
AGAGAAGAAA 
CCTTACTCTG 
AAATGTAAAA 
AGACTCATGG 
TTGAAAAAAA 
GGTGAAGTTC 
GGTGAACTTA 
GGATGTCTGA 
CCCCAGACTT 
CTGAAGAGAT 
TTTAGCACCT 
GCCCACACAA 
CAGGTTTCTA 
ATGGAGCAGT 
GCTATCCGTG 
GACTTCATGT 
ACTGGTGACG 
CTGTACCTTG 
CAGATAGACA 
GTGAAGGTGT 
GTGGT G CAT C 
GAGT CTGAAT 
CCCACATACA 
GATTCTATTT 
CATTTACTTT 
ACACTTGAAA 
ATGATCCCAA 
GCTTTCATTA 
TTTGAACCAT 
CTCATCAGTG 
TATTTCGAGG 
TCTTGCTTTG 
AAAGATGAAC 
GAACTCGATG 
TATACCCCCT 
AGACATGTAA 
ACTTCAGCCT 
GCCCAGAAAG 
TCAAACGAAG 
CTAGGAGGAC 
AGCTATGTGG 
AAACCTGTCA 
AGTGACAGAC 
TTGGGCAAAG 
TATAAG CGG A 
CAACTGTATG 
GAAAGTCAGG 
GACAGTACTT 
AAGCAAATAA 
CGACTTTATA 
TTTAATAATA 
GAAGCCTTGG 
GGTACAATTC 
CATGTTT CTT 
TCATTGTGTT 
GAATGTCGAC 
AGATCCCCTA 
AACACCTTTG 
TTGTACCTTC 



11 
I 

CGGGTCCGGG 
GAGCCGGTGT 
GCGGTGCTGC 
GCAGCAGCCC 
TGCTTGTATT 
TCCTAAAGTT 
TTGAAATTAA 
TTCCAGCCCT 
ATGAATTTAA 
AAATACCAGA 
ATCCTAGTGA 
AGACCCAGAT 
AGGGGTTGTC 
CAAGGGAGAT 
ATGCTGTGCC 
GCCTTCTGGA 
ATGTAGAATT 
ATATGGTGGC 
TTTATGGAAT 
GATATGGACT 
ACGTTGAGCT 
ACCGTGTTTA 
ACACAGTTCC 
GTTTCCCACA 
TCCTAGCTTT 
AGGGTTTAAT 
CTGAAGACCA 
AAGACTACGT 
TAGCAGATGA 
ATGATGAATT 
TACAGACTGT 
CTTCAGATCC 
ACCTGGTGGA 
GGGTGTACTC 
GTTTCTACAA 
GAGTTAGTCC 
CTTTATTTGT 
TTTTGGCCTC 
TTAGAGCCTA 
TGGCAGAAGT 
TGCAGCCTTA 
TGT CAGATG A 
GATTTAATAA 
CAATATCCTT 
AAATAAACAA 
CCTGGGACAG 
TTTTCCTGGA 
AAACTAAAGT 
CCACGCAGAT 
CGTTTCCTGT 
AGCCACTAGT 
ATACTGTTGC 
TAAGAGATTT 
CACCACAGCA 
GCCTTGCGCT 
TCTACAGGGA 
TGATATACAT 
AACAGTGTTG 
TAAATAAAGC 
TATTGGATCT 
ACAAATCCAT 
ATTTGTGGCT 
AGGGGGGTGG 
GGGGGCCATT 



21 
I 

CCGAGCGGGC 
GCGTTGCTCC 
CCTGGCCGGT 
CGCGGTGCTG 
TGT CCGGAAG 
TTTATGTATT 
GAACACTTGT 
GGACCTTCTT 
AATTGGAGAA 
TACAGTTTTA 
GATGATAAAT 
GACATCAGCA 
CTCACTTCTG 
TTTTAATTTT 
CTCAGCTGGC 
CAACTACGTG 
GAAAAAAGCT 
GAAAAATGCA 
CATCAGAAAT 
TTTTGCAGGA 
CATTCAGCGC 
TCAGATGCCA 
TGAGGTGTAT 
GTACAGTCCA 
GGCAGCAAAA 
CAGAATATGT 
CCGTGCTTCA 
GGATCTCTTC 
AGCATTTTTC 
TGTAAAATCC 
TGGGGAACAA 
AGCGGCTAAC 
ATTTTGCAGA 
ATTTTCATAT 
ATTGCTTTCT 
AAAGAGTCTG 
GAAATTTGGC 
TTGTTTGACC 
CGTTCCTGCA 
AGGCCTGAAT 
TTACAAAGAC 
GACCAAGAAT 
AGTGGTGTTA 
AGAAGAAATA 
AAATCTTCTG 
AGAGAAGCGG 
TGTGTTCCTG 
TGCAGCCTGT 
GCCAGAAGGG 
GCTGCTTCGA 
TATGCAGCTG 
CTTACTAGAA 
TTGTGGTCGG 
GCAGGAGAAG 
TCACCCCAAT 
ATTCAGGGAA 
GGAGAGTCTG 
TGATGCCATT 
AAAGAAACGA 
GGT CAAGTGG 
TGAACTCTTT 
GAAAGATGTT 
CTGTGGCCAG 
CAGCCTGCAG 
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GCACGCGCGG 
CTGCTGCGGC 
CATCAACTGA 
GCATTACAGA 
TCACTCAACA 
TTCTTAGAAA 
ACCAGTGTTT 
ATTAAGTTAC 
TTATTTAGTA 
GAAAAAGTAT 
AATGCAGAAA 
GTAAGAGAGC 
TGCAACTTCA 
GTACTAAAGG 
TTGCGCCTAT 
TCTCTATTTG 
GCACTTTCAG 
GAAATGCATA 
GTGGATTCGA 
CCGTGCAAGG 
TGCAAGCAGA 
AGCTTCCTCC 
ACTCCAGTTC 
AAAATG CAGC 
GGGCCAGTTC 
TCTAAACCAG 
GGGGAAGTCA 
AGACATCTCC 
TCTGTGAATT 
GTTTTGAAGA 
GAGAATGGAG 
TTGCATCCAG 
GAGATTCTCC 
GAATTAATTT 
ATTACAGTAA 
AAACACTCTC 
AAAGAGGTGG 
TTTCTTCTGT 
CTGCAGATGG 
GCTCTAGAAG 
ATTCTCCCCT 
AACTGGGAAG 
AAGCATCTGA 
AGAATTAGAG 
ACAGTCACGT 
CTGAGCTTTG 
CCTCGAGTCA 
GAACTTTTAC 
GGACAGGGAG 
CTTGCGTGTG 
ATT CACTGGT 
GCTATATTGG 
TGTATTCGAG 
AGTCCAGTAA 
GCTTTCAAGA 
GAAGAGTCTC 
GCCTTAGCAC 
GATCACCTAT 
CGTTTGCCGC 
CTTTTAGCTC 
TATAAATTCG 
CTCAAGGAAG 
CCCTCGGGCA 
GCCACGCTAT 
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GAGCGGGACT 
TGCAGGAGAC 
TCCGCGGCCT 
CATCTTTAGT 
GTATTGAATT 
AAATGGGCCA 
ATACAAAAGA 
TTCAGACTTT 
AATTCTATGG 
ATGAGCTCCT 
ACCTGTTCCG 
CCAAACTACC 
CTAAGTCCAT 
CAATTCGTCC 
TTGCCCTGCA 
AAGTCTTGTT 
CCCTGGAATC 
AAAATAAACT 
ACAACAAGGA 
TTATAAACGC 
TGTTCCTCAC 
AGTCTGTTGC 
TGGAGCACCT 
TGGTGTGTTG 
TCAGGAATTG 
TGGTCCTTCC 
GAACTGGCAA 
TGAGCTCTGA 
CCTCCAGTGA 
TTGTTGAGAA 
ATGAGGCGCC 
CTAAACCTAA 
CTGAGAAACA 
TGCAATCTAC 
GAAATGCCAA 
CTGAAGACCC 
CAGTTAAAAT 
CCTTGCCACA 
CTTTCAAACT 
AATGGTCAAT 
GCCTGGATGG 
TGTCAGCTCT 
AGAAGACAAA 
TAGTACAAAT 
CCTCAGATGA 
CAGTGCCCTT 
CAGAATTAGC 
ATAGCATGGT 
CCCCACCCAT 
ATGTTGATCA 
TCACTAACAA 
ATGGAATTGT 
AATTCCTTAA 
ACACCAAATC 
GGCTGGGAGC 
TGGTGGAACA 
ATGCAGATGA 
GCCGCATCAT 
GAGGATTTCC 
AT TGTGGGAG 
TTCCTTTATT 
AAGGTGTCrC 
TCCTGGCCCA 
GCTGGCTGGA 
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CGGCGGCATG 
CTTGTCCGCT 
GGGG CAGGAA 
TTTTTCCAGA 
TCGTGAATGT 
GAAGATCGCA 
TAGAGCTGCT 
TAGAAGTTCT 
AGAACTTGCA 
AGGATTATTG 
CGCTTTTCTG 
TGTTCTGGCA 
GGAAGAAGAT 
TCAGATTGAT 
TGCATCT CAG 
AAAGTGGTGT 
CTTTCTGAAA 
GCAGTACTTT 
GTTATCTATT 
AAAAGATGTT 
CCAGACAGAC 
AAGCGTCTTG 
CGTGGTGATG 
CAGAGCCATA 
CATTAGTACT 
AAAGGGCCCT 
ATGGAAGGTG 
CCAGATGATG 
AAGTCTGAAT 
ATTGGATCTT 
TGGTGTTTGG 
AGATTTTTCG 
AG CAGAATTT 
AAGGTTGCCC 
GAAAATAAAA 
AGAAAAGTAT 
GAAGCAGTAC 
CAACATCATT 
GGGCCTGAGC 
TTATATTGAC 
ATACCTGAAG 
TTCTCGGGCT 
GAACCTTTCA 
GCTTGGATCT 
GATGATGAAG 
TAGAGAGATG 
GCTCACAGCC 
TATGTTTATG 
GTACCAGCTC 
GGTGACAAGG 
CAAGAAATTT 
GGACCCTGTT 
ATGGTCCATT 
GCTTTTCAAG 
ATCACTTGCC 
GTTTGTGTTT 
GAAGTCCTTA 
TGAAAAGAAG 
ACCTTCCGCA 
GCCCCAGACA 
GCCAGGCAAC 
TTTTCTCATC 
GCCCACCCTC 
CCTGCTCCTG 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
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GCCGCGTTGG AGTGCTACAA CACGTTCATT GGCGAGAGAA CTGTAGGAGC GCTCCAGGTC 3900 

CTAGGTACTG AAGCCCAGTC TTCACTTTTG AAAGCAGTGG CTTTCTTCTT AGAAAGCATT 3 960 

GCCATGCATG ACAT TAT AG C AGCAGAAAAG TGCTTTGGCA CTGGGGCAGC AGGTAACAGA 4020 

ACAAGCCCAC AAGAGGGAGA AAGGTACAAC TACAGCAAAT GCACCGTTGT GGTCCGGATT 4080 

ATGGAGTTTA CCACGACTCT GCTAAACACC TCCCCGGAAG GATGGAAGCT CCTGAAGAAG 4140 

GACTTGTGTA ATACACACCT GATGAGAGTC CTGGTGCAGA CGCTGTGTGA GCCCGCAAGC 42 00 

ATAGGTTTCA ACATCGGAGA CGTCCAGGTT ATGGCTCATC TTCCTGATGT TTGTGTGAAT 42 60 

CTGATGAAAG CTCTAAAGAT GTCCCCATAC AAAGATATCC TAGAGACCCA TCTGAGAGAG 4320 

AAAATAACAG CAC AG AG CAT TGAGGAGCTT TGTGCCGTCA ACTTGTATGG CCCTGACGCG 43 80 

CAAGTGGACA GGAGCAGGCT GGCTGCTGTT GTGTCTGCCT GTAAACAGCT TCACAGAGCT 4440 

GGGCTTCTGC ATAATATATT ACCGTCTCAG TCCACAGATT TGCATCATTC TGTTGGCACA 4500 

GAACTTCTTT CCCTGGTTTA TAAAGGCATT GCCCCTGGAG ATGAGAGACA GTGTCTGCCT 4560 

TCTCTAGACC TCAGTTGTAA GCAGCTGGCC AGCGGACTTC TGGAGTTAGC CTTTGCTTTT 4620 

GGAGGACTGT GTGAGCGCCT TGTGAGTCTT CTCCTGAACC CAGCGGTGCT GTCCACGGCG 4680 

TCCTTGGGCA GCTCACAGGG CAGCGTCATC CACTTCTCCC ATGGGGAGTA TTTCTATAGC 4740 

TTGTTCTCAG AAACG AT CAA CACGGAATTA TTGAAAAATC TGGATCTTGC TGTATTGGAG 4800 

CTCATGCAGT CTTCAGTGGA TAATACCAAA ATGGTGAGTG CCGTTTTGAA CGGCATGTTA 4860 

GACCAGAGCT TCAGGGAGCG AGCAAACCAG AAACACCAAG GACTGAAACT TGCGACTACA 4920 

ATTCTGCAAC ACTGGAAGAA GTGTGATTCA TGGTGGGCCA AAGATTCCCC TCTCGAAACT 4980 

AAAATGGCAG TGCTGGCCTT ACTGGCAAAA ATTTTACAGA TTGATTCATC TGTATCTTTT 5040 

AATACAAGTC ATGGTTCATT CCCTGAAGTC TTTACAACAT ATATTAGTCT ACTTGCTGAC 510 0 

ACAAAGCTGG ATCTACATTT AAAGGGCCAA GCTGTCACTC TTCTTCCATT CTTCACCAGC 5160 

CTCACTGGAG GCAGTCTGGA GGAACTTAGA CGTGTTCTGG AGCAGCTCAT CGTTGCTCAC 5220 

TTCCCCATGC AGTCCAGGGA ATTTCCTCCA GGAACTCCGC GGTT CAATAA TTATGTGGAC 52 80 

TGCATGAAAA AGTTTCTAGA TGCATTGGAA TTATCTCAAA GCCCTATGTT GTTGGAATTG 5340 

ATGACAGAAG TTCTTTGTCG GGAACAGCAG CATGTCATGG AAGAATTATT TCAAT CCAGT 540 0 

TTCAGGAGGA TTGCCAGAAG GGGTTCATGT GTCACACAAG TAGGCCTTCT GGAAAGCGTG 54 60 

TATGAAATGT T C AGGAAGG A TGACCCCCGC CTAAGTTTCA CACGCCAGTC CTTTGTGGAC 552 0 

CGCTCCCTCC TCACTCTGCT GTGGCACTGT AGCCTGGATG CTTTGAGAGA ATTCTTCAGC 5580 

ACAATTGTGG TGGATGCCAT TGATGTGTTG AAGTCCAGGT TTACAAAGCT AAATGAATCT 5640 

ACCTTTGATA CTCAAATCAC CAAGAAGATG GGCTACTATA AGATTCTAGA CGTGATGTAT 5700 

TCTCGCCTTC CCAAAGATGA TGTTCATGCT AAGGAATCAA AAATTAATCA AGTTTTCCAT 5760 

GGCTCGTGTA TTACAGAAGG AAATGAACTT ACAAAGACAT TGATTAAATT GTGCTACGAT 5820 

GCATTTACAG AGAACATGGC AGGAGAGAAT CAGCTGCTGG AGAGGAGAAG ACTTTACCAT 5880 

TGTGCAGCAT ACAACTGCGC CAT ATCTGT C ATCTGCTGTG TCTTCAATGA GTTAAAATTT 5940 

TACCAAGGTT TTCTGTTTAG TGAAAAACCA GAAAAGAACT TGCTTATTTT TGAAAATCTG 6000 

ATCGACCTGA AGCGCCGCTA TAATTTTCCT GTAGAAGTTG AGGTTCCTAT GGAAAGAAAG 6060 

AAAAAGTACA TTGAAATTAG GAAAGAAGCC AGAGAAGCAG CAAATGGGGA TTCAGATGGT 6120 

CCTTCCTATA TGTCTTCCCT GTCATATTTG GCAGACAGTA CCCTGAGTGA GGAAATGAGT 6180 

CAATTTGATT TCTCAACCGG AGTT CAGAGC TATTCATACA GCTCCCAAGA CCCTAGACCT 6240 

GCCACTGGTC GTTTTCGGAG ACGGGAGCAG CGGGACCCCA CGGTGCATGA TGATGTGCTG 6300 

GAGCTGGAGA TGGACGAGCT CAATCGGCAT GAGTGCATGG CGCCCCTGAC GGCCCTGGTC 6360 

AAGCACATGC ACAGAAGCCT GGGCCCGCCT CAAGGAGAAG AGGATTCAGT GCCAAGAGAT 6420 

CTTCCTTCTT GGATGAAATT CCTCCATGGC AAACTGGGAA ATCCAATAGT ACCATTAAAT 6480 

ATCCGTCTCT TCTTAGCCAA GCTTGTTATT AATACAGAAG AGGTCTTTCG CCCTTACGCG 6540 

AAGCACTGGC TTAGCCCCTT GCTGCAGCTG GCTGCTTCTG AAAACAATGG AGGAGAAGGA 660 0 

ATTCACTACA TGGTGGTTGA GATAGTGGCC ACTATTCTTT CATGGACAGG CTTGGCCACT 6660 

CCAACAGGGG TCCCTAAAGA TGAAGTGTTA GCAAAT CGAT TGCTTAATTT CCTAATGAAA 672 0 

CATGTCTTTC ATCCAAAAAG AGCTGTGTTT AG ACACAAC C TTGAAATTAT AAAGACCCTT 6780 

GTCGAGTGCT GGAAGGATTG TTTATCCATC CCTTATAGGT TAATATTTGA AAAGTTTTCC 6840 

GGTAAAGATC CTAATTCTAA AGACAACTCA GTAGGGATTC AATTGCTAGG CATCGTGATG 6900 

GCCAATGACC TGCCTCCCTA TGACCCACAG TGTGGCATCC AGAGTAGCGA ATACTTCCAG 6 960 

GCTTTGGTGA ATAATATGTC CTTTGTAAGA TATAAAGAAG TGTATGCCGC TGCAGCAGAA 7020 

GTT CTAGGAC TTATACTTCG ATATGTTATG GAGAGAAAAA ACATACTGGA GGAGTCTCTG 7080 

TGTGAACTGG TTGCGAAACA ATTGAAGCAA CATCAGAATA CTATGGAGGA CAAGTTTATT 7140 

GTGTGCTTGA ACAAAGTGAC CAAGAGCTTC CCTCCTCTTG CAGACAGGTT CATGAATGCT 7200 

GTGTTCTTTC TGCTGCCAAA ATTTCATGGA GTGTTGAAAA CACTCTGTCT GGAGGTGGTA 72 60 

CTTTGTCGTG TGGAGGGAAT GACAGAGCTG TACTTCCAGT TAAAGAGCAA GGACTTCGTT 7320 

CAAGTCATGA GACATAGAGA TGATGAAAGA CAAAAAGTAT GTTTGGACAT AATTTATAAG 7380 

ATGATGCCAA AGTTAAAACC AGTAGAACTC CGAGAACTTC TGAACCCCGT TGTGGAATTC 7440 

GTTTCCCATC CTTCTACAAC ATGTAGGGAA CAAATGTATA ATATTCTCAT GTGGATTCAT 7500 

GATAATTACA GAGATCCAGA AAGTGAGACA GATAATGACT CCCAGGAAAT ATTTAAGTTG 7560 

GCAAAAGATG TGCTGATTCA AGGATTGATC GATGAGAACC CTGGACTTCA ATTAATTATT 7620 

CGAAATTTCT GGAGCCATGA AACTAGGTTA CCTTCAAATA CCTTGGACCG GTTGCTGGCA 7680 

CTAAATTCCT TATATTCTCC TAAGATAGAA GTGCACTTTT TAAGTTTAGC AACAAATTTT 7740 

CTGCTCGAAA TGACCAGCAT GAGCCCAGAT TATCCAAACC CCATGTTCGA GCATCCTCTG 7800 

TCAGAATGCG AATTT CAGGA ATATACCATT GATTCTGATT GGCGTTTCCG AAGTACTGTT 7860 

CTCACTCCGA TGTTTGTGGA GACCCAGGCC TCCCAGGGCA CTCTCCAGAC CCGTACCCAG 7920 

GAAGGGTCCC TCTCAGCTCG CTGGCCAGTG GCAGGGCAGA TAAGGGCCAC CCAGCAGCAG 7980 

CATGACTTCA CACTGACACA GACTGCAGAT GGAAGAAGCT CATTTGATTG GCTGACCGGG 8040 

AGCAGCACTG ACCCGCTGGT CGACCACACC AGTCCCTCAT CTGACTCCTT GCTGTTTGCC 8100 

CACAAGAGGA GTGAAAGGTT ACAGAGAGCA CCCTTGAAGT CAGTGGGGCC TGATTTTGGG 8160 

AAAAAAAGGC TGGGCCTTCC AGGGGACGAG GTGGATAACA AAGTGAAAGG TGCGGCCGGC 8220 

CGGACGGACC TACTACGACT GCGCAGACGG TTTATGAGGG ACCAGGAGAA GCTCAGTTTG 82 80 

ATGTATGCCA GAAAAGGCGT TGCTGAGCAA AAACGAGAGA AGGAAATCAA GAGTGAGTTA 8340 

AAAATGAAGC AGGATGCCCA GGTCGTTCTG TACAGAAGCT ACCGGCACGG AGACCTTCCT 8400 

GACATTCAGA TCAAGCACAG CAGCCTCATC ACCCCGTTAC AGGCCGTGGC CCAGAGGGAC 8460 

CCAATAATTG CAAAACAGCT CTTTAGCAGC TTGTTTTCTG GAATTTTGAA AGAGATGGAT 8520 

AAATTTAAGA CACTGTCTGA AAAAAACAAC ATCACTCAAA AGTTGCTTCA AGACTTCAAT 8580 

CGTTTTCTTA ATACCACCTT CTCTTTCTTT CCACCCTTTG TCTCTTGTAT TCAGGACATT 8640 

AGCTGTCAGC ACGCAGCCCT GCTGAGCCTC GACCCAGCGG CTGTTAGCGC TGGTTGCCTG 8700 

GCCAGCCTAC AGCAGCCCGT GGGCAT CCGC CTGCTAGAGG AGGCTCTGCT CCGCCTGCTG 8760 

CCTGCTGAGC TGCCTGCCAA GCGAGTCCGT GGGAAGGCCC GCCTCCCTCC TGATGTCCTC 882 0 

AGATGGGTGG AGCTTGCTAA GCTGTATAGA TCAATTGGAG AATACGACGT CCTCCGTGGG 88 80 

ATTTTTACCA GTGAGATAGG AACAAAGCAA AT CACT GAG A GTGCATTATT AGCAGAAGCC 8940 

AGAAGTGATT ATT CTGAAGC TGCTAAGCAG TATGATGAGG CTCTCAATAA ACAAGACTGG 9000 

GTAGATGGTG AGCCCACAGA AGCCGAGAAG GATTTTTGGG AACTTGCATC CCTTGACTGT 9060 
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TACAACCACC TTGCTGAGTG GAAATCACTT GAATACTGTT CTACAGCCAG TATAGACAGT 912 0 

GAGAACCCCC CAGACCTAAA TAAAATCTGG AGTGAACCAT TTTATCAGGA AACATATCTA 9180 

CCTTACATGA TCCGCAGCAA GCTGAAGCTG CTGCTCCAGG GAGAGGCTGA CCAGTCCCTG 9240 

CTGACATTTA TTGACAAAGC TATGCACGGG GAGCTCCAGA AGGCGATTCT AGAGCTTCAT 9300 

5 TACAGTCAAG AGCTGAGTCT GCTTTACCTC CTGCAAGATG ATGTTGACAG AGCCAAATAT 9360 

TACATTCAAA ATGGCATTCA GAGTTTTATG CAGAATTATT CTAGTATTGA TGTCCTCTTA 9420 

CACCAAAGTA GACTCACCAA ATTGCAGTCT GTACAGGCTT TAACAGAAAT TCAGGAGTTC 9480 

ATCAGCTTTA TAAGCAAACA AGGCAATTTA TCATCTCAAG TTCCCCTTAA GAGACTTCTG 9540 

AACACCTGGA CAAACAGATA TCCAGATGCT AAAATGGACC CAATGAACAT CTGGGATGAC 9600 

10 ATCATCACAA ATCGATGTTT CTTTCTCAGC AAAATAGAGG AGAAGCTTAC CCCTCTTCCA 9660 

GAAGATAATA GTATGAATGT GGATCAAGAT GGAGACCCCA GTGACAGGAT GGAAGTGCAA 972 0 

GAGCAGGAAG AAGATATCAG CTCCCTGATC AGGAGTTGCA AGTTTTCCAT GAAAATGAAG 9780 

ATGATAGACA GTGCCCGGAA GCAGAACAAT TTCTCACTTG CTATGAAACT ACTGAAGGAG 9840 

CTGCATAAAG AGTCAAAAAC CAGAGACGAT TGGCTGGTGA GCTGGGTGCA GAGCTACTGC 9900 

15 CGCCTGAGCC ACTGCCGGAG CCGGTCCCAG GGCTGCTCTG AGCAGGTGCT CACTGTGCTG 9960 

AAAACAGTCT CTTTGTTGGA TGAGAACAAC GTGTCAAGCT ACTTAAGCAA AAATATTCTG 1002 0 

GCTTTCCGTG ACCAGAACAT TCTCTTGGGT ACAACTTACA GGATCATAGC GAATGCTCTC 10080 

AGCAGTGAGC CAGCCTGCCT TGCTGAAATC GAGGAGGACA AGGCTAGAAG AATCTTAGAG 10140 

CTTTCTGGAT CCAGTTCAGA GGATTCAGAG AAGGTGATCG CGGGTCTGTA CCAGAGAGCA 10200 

20 TTCCAGCACC TCTCTGAGGC TGTGCAGGCG GCTGAGGAGG AGGCCCAGCC TCCCTCCTGG 10260 

AGCTGTGGGC CTGCAGCTGG GGTGATTGAT GCTTACATGA CGCTGGCAGA TTTCTGTGAC 10320 

CAACAGCTGC GCAAGGAGGA AGAGAATGCA TCAGTTATTG ATTCTGCAGA ACTGCAGGCG 10380 

TATCCAGCAC TTGTGGTGGA GAAAATGTTG AAAGCTTTAA AATTAAATTC CAATGAAGCC 10440 

AGAT TGAAGT TTCCTAGATT ACTTCAGATT ATAGAACGGT ATCCAGAGGA GACTTTGAGC 10500 

25 CTCATGACAA AAGAGATCTC TTCCGTTCCC TGCTGGCAGT TCATCAGCTG GATCAGCCAC 10560 

ATGGTGGCCT TACTGGACAA AGACCAAGCC GTTGCTGTTC AGCACTCTGT GGAAGAAATC 10620 

ACTGATAACT ACCCGCAGGC TATTGTTTAT CCCTTCATCA TAAGCAGCGA AAGCTATTCC 10680 

TTCAAGGATA CTTCTACTGG T CAT AAGAAT AAGGAGTTTG TGGCAAGGAT TAAAAGTAAG 10740 

TTGGATCAAG GAGGAGTGAT TCAAGATTTT ATTAATGCCT TAGATCAGCT CTCTAATCCT 10800 

30 GAACTGCTCT TTAAGGATTG GAGCAATGAT GTAAGAGCTG AAC TAG CAAA AACCCCTGTA 10860 

AATAAAAAAA ACATTGAAAA AATGTATGAA AGAATGTATG CAGCCTTGGG TGACCCAAAG 10 920 

GCTCCAGGCC TGGGGGCCTT TAGAAGGAAG TTTATTCAGA CTTTTGGAAA AGAATTTGAT 10980 

AAACATTTTG GGAAAGGAGG TTCTAAACTA CTGAGAATGA AGCTCAGTGA CTTCAACGAC 11040 

ATTACCAACA TGCTACTTTT AAAAATGAAC AAAGACT CAA AGCCCCCTGG GAATCTGAAA 11100 

35 GAATGTTCAC CCTGGATGAG CGACTTCAAA GTGGAGTTCC TGAGAAATGA GCTGGAGATT 11160 

CCCGGTCAGT ATGACGGTAG GGGAAAGCCA TTGCCAGAGT ACCACGTGCG AATCGCCGGG 11220 

TTTGATGAGC GGGTGACAGT CATGGCGTCT CTGCGAAGGC CCAAGCGCAT CATCATCCGT 11280 

GGCCATGACG AGAGGGAACA CCCTTTCCTG GTGAAGGGTG GCGAGGACCT GCGGCAGGAC 11340 

CAGCGCGTGG AGCAGCT CTT CCAGGTCATG AATGGGATCC TGGCCCAAGA CTCCGCCTGC 11400 

40 AGCCAGAGGG CCCTGCAGCT GAGGACCTAT AGCGTTGTGC CCATGACCTC CAGGTTAGGA 11460 

TTAATTGAGT GGCTTGAAAA TACTGTTACC TTGAAGGACC TTCTTTTGAA CACCATGTCC 11520 

CAAGAGGAGA AGGCGGCTTA CCTGAGTGAT CCCAGGGCAC CGCCGTGTGA ATATAAAGAT 11580 

TGGCTGACAA AAATGTCAGG AAAACATGAT GTTGGAGCTT ACATGCTAAT GTATAAGGGC 11640 

GCTAATCGTA CTGAAACAGT CACGTCTTTT AGAAAACGAG AAAGTAAAGT GCCTGCTGAT 11700 

45 CTCTTAAAGC GGGCCTTCGT GAGGATGAGT ACAAGCCCTG AGGCTTTCCT GGCGCTCCGC 11760 

TCCCACTTCG CCAGCTCTCA CGCTCTGATA TGCATCAGCC ACTGGATCCT CGGGATTGGA 11820 

GACAGACATC TGAACAACTT TATGGTGGCC ATGGAGACTG GCGGCGTGAT CGGGATCGAC 11880 

TTTGGGCATG CGTTTGGATC CGCTACACAG TTTCTGCCAG TCCCTGAGTT GATGCCTTTT 11940 

CGGCTAACTC GCCAGTTTAT CAAT CTGATG TTACCAATGA AAGAAACGGG CCTTATGTAC 12000 

50 AGCAT CATGG TACACGCACT CCGGGCCTTC CGCTCAGACC CTGGCCTGCT CACCAACACC 12060 

ATGGATGTGT TTGTCAAGGA GCCCTCCTTT GATTGGAAAA ATTTTGAACA GAAAATGCTG 12120 

AAAAAAGGAG GGTCATGGAT TCAAGAAATA AATGTTGCTG AAAAAAATTG GTACCCCCGA 12180 

CAGAAAATAT GTTACGCTAA GAGAAAGTTA GCAGGTGCCA ATCCAGCAGT CATTACTTGT 12240 

GATGAGCTAC TCCTGGGTCA TGAGAAGGCC CCTGCCTTCA GAGACTATGT GGCTGTGGCA 12300 

55 CGAGGAAGCA AAGATCACAA CATTCGTGCC CAAGAACCAG AGAGTGGGCT TTCAGAAGAG 12360 

ACTCAAGTGA AGTGCCTGAT GGACCAGGCA ACAGACCCCA ACATCCTTGG CAGAACCTGG 12420 

GAAGGATGGG AGCCCTGGAT GTGAGGTCTG TGGGAGTCTG CAGATAGAAA GCATTACATT 12480 

GTTTAAAGAA TCTACTATAC TTTGGTTGGC AGCATTCCAT GAGCTGATTT TCCTGAAACA 12540 

CTAAAGAGAA ATGTCT TTTG TGCTACAGTT TCGTAGCATG AGTTTAAATC AAGATTATGA 12 600 

60 TGAGTAAATG TGTATGGGTT AAATCAAAGA TAAGGT TATA GTAACATCAA AGATTAGGTG 12660 

AGGTTTATAG AAAGATAGAT ATCCAGGCTT ACCAAAGTAT TAAGTCAAGA ATATAATATG 1272 0 

TG AT CAG CTT T C AAAG CAT T TACAAGTGCT GCAAGTTAGT GAAACAGCTG TCTCCGTAAA 12780 

TGGAGGAAAT GTGGGGAAGC CTTGGAATGC CCTTCTGGTT CTGGCACATT GGAAAGCACA 12840 

CTCAGAAGGC TTCATCACCA AGATTTTGGG AGAGTAAAGC TAAGTATAGT TGATGTAACA 12 900 

65 TTGTAGAAGC AGCAT AGGAA CAATAAGAAC AATAGGTAAA GCTATAATTA TGGCTTATAT 12960 

TTAGAAATGA CTGCATTTGA TATTTTAGGA TATTTTTCTA GGTTTTTTCC TTTCATTTTA 1302 0 

TTCTCTTCTA GTTTTGACAT TTTATGATAG ATTTGCTCTC TAGAAGGAAA CGTCTTTATT 13080 

TAGGAGGGCA AAAATTTTGG TCATAGCATT CACTTTTGCT ATTCCAATCT ACAACTGGAA 13140 

GATACATAAA AGTGCTTTGC ATTGAATTTG GGATAACTTC AAAAATCCCA TGGTTGTTGT 132 00 

70 TAGGGATAGT ACTAAGCATT TCAGTTCCAG GAGAATAAAA GAAATTCCTA TTTGAAATGA 13260 

ATTCCTCATT TGGAGGAAAA AAAGCATGCA TTCTAGCACA ACAAGATGAA ATTATGGAAT 13320 

ACAAAAGTGG CTCCTTCCCA TGTGCAGTCC CTGTCCCCCC CCGCCAGTCC TCCACACCCA 133 80 

AACTGTTTCT GATTGGCTTT TAGCTTTTTG TTGTTTTTTT TTTTCCTTCT AACACTTGTA 13440 

TTTGGAGGCT CTTCTGTGAT TTTGAGAAGT ATACTCTTGA GTGTTTAATA AAGTTTTTTT 13500 
75 CCAAAAGTA 

Seq ID NO: 99 Protein sequence: 
Protein Accession #; NP_008835.5 

80 1 11 21 31 41 51 

1 I I I I I 

MAGSGAGVRC SLLRLQETLS AADRCGAALA GHQLIRGLGQ ECVLSSSPAV LALQTSLVFS 60 

RDFGLLVFVR KSLNSIEFRE CREE I LKFLC IFLEKMGQKI APYSVEIKNT CTSVYTKDRA 120 

AKCKI PALDL LIKLLQTFRS SRLMDEFKI G ELFSKFYGEL ALKKKI PDTV LEKVYELLGL 180 

85 LGEVHPSEMI NNAENLFRAF LGELKTQMTS AVREPKLPVL AGCLKGLSSL LCNFTKSMEE 240 

DPQTSREIFN FVLKAIRPQI DLKRYAVP S A GIiRLFALHAS QFSTCLLDNY VSLFEVLLKW 300 

CAHTNVELKK AALSALESFXj KQVSNMVAKN AEMHKNKLQY FMEQFYGIIR NVDSNNKELS 360 
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IAIRGYGLFA GPCKVINAKD VDFMYVELIQ RCKQMPLTQT DTGDDRVYQM PSFLQSVASV 420 

LLYLDTVPEV YTPVLEHLW MQIDSFPQYS PKMQLVCCRA IVKVFLALAA KGPVLRNCIS 480 

TWHQGLIRI CSKPWLPKG PESESEDHRA SGEVRTGKWK VPTYKDYVDL FRHLLSSDQM 540 

MDSIIiADEAF FSVNSSSESL NHLLYDEFVK SVLKIVEKLD LTLEIQTVGE QENGDEAPGV 600 

WMIPTSDPAA NLHPAKPKDF SAFINLVEFC REILPEKQAE FFEPWVYSFS YELILQSTRL 660 

PLISGFYKLL S I T VRNAKK I KYFEGVSPKS LKHSPEDPEK YSCFALFVKF GKEVAVKMKQ 720 

YKDELLASCL TFLLSLPHNI IELDVRAYVP ALQMAFKLGL SYTPLAEVGL NALEEWSIYI 780 

DRHVMQPYYK DILPCLDGYL KTSALSDETK NNWEVSALSR AAQKGFNKW LKHLKKTKNL 840 

SSNEAISLEE IRIRWQMLG SLGGQINKNL LTVTSSDEMM KSYVAWDREK RLSFAVPFRE 900 

MKPVIFLDVF LPRVTELALT ASDRQTKVAA CELLHSMVMF MLGKATQMPE GGQGAPPMYQ 960 

LYKRTFPVIiL RLACDVDQVT RQLYEPLVMQ LIHWFTNNKK FESQDTVALL EAILDGIVDP 1020 

VDSTLRDFCG RCIREFLKWS IKQITPQQQE KSPVNTKSLF KRLYSLAL.HP NAFKRLGASL 1080 

AFNNIYREFR EEESLVEQFV FEALVIYMES LALAHADEKS LGTIQQCCDA IDHLCRIIEK 1140 

KHVSLNKAKK RRLPRGFPPS ASLCLLDLVK WLLAHCGRPQ TECRHKSIEL FYKFVPLLPG 1200 

NRSPNLWLKD VLKEEGVSFL INTFEGGGCG QPSGILAQPT LLYLRGPFSL QATLCWLDLL 1260 

LAAIiEC YNT F I GERT VGALQ VLGTEAQSSL LKAVAFFLES IAMHDIIAAE KCFGTGAAGN 1320 

RTSPQEGERY NYSKCTWVR IMEFTTTLLN TSPEGWKLLK KDLCNTHLMR VLVQTLCEPA 1380 

SIGFNIGDVQ VMAHLPDVCV NLMKALKMSP YKDILETHLR EKITAQSIEE LCAVNLYGPD 1440 

AQVDRSRLAA WSACKQLHR AGLLHNILPS QSTDLHHSVG TELLSLVYKG IAPGDERQCIi 150 0 

PSLDLSCKQL ASGLLELAFA FGGLCERLVS LLLNPAVLST ASLGSSQGSV IHFSHGEYFY 1560 

SLFSETINTE LLKNLDLAVL ELMQSSVDNT KMVSAVLNGM LDQSFRERAN QKHQGLKLAT 162 0 

TILQHWKKCD SWWAKDSPLE TKMAVLALLA KILQIDSSVS FNTSHGSFPE VFTTYISLLA 1680 

DTKLDLHLKG QAVTLLPFFT SLTGGSLEEL RRVLEQLIVA HFPMQSREFP PGTPRFNNYV 1740 

DCMKKFLDAL ELSQSPMLLE LMTEVLCREQ QHVMEELFQS SFRRIARRGS CVTQVGLLES 1800 

VYEMFRKDDP RLSFTRQSFV DRSLLTLLWH CSLDALREFF STIWDAIDV LKSRFTKLNE 1860 

STFDTQITKK MGYYKILDVM YSRLPKDDVH AKESKINQVF HGSCITEGNE LTKTLIKLCY 1920 

DAFTENMAGE NQXjLERRRLY HCAAYNCAIS VICCVFNELK FYQGFLFSEK PEKNLLIFEN 1980 

LIDLKRRYNF PVEVEVPMER KKKYIEIRKE AREAANGDSD GPSYMSSDSY LADSTLSEEM 2040 

SQFDFSTGVQ SYSYSSQDPR PATGRFRRRE QRDPTVHDDV LELEMDELNR HECMAPLTAL 2100 

VKHMHR SLGP PQGEEDSVPR DLPSWMKFLH GKLGNPIVPL NIRLFLAKLV INTEEVFRPY 2160 

AKHWLSPLLQ LAASENNGGE GIHYMWEIV ATILSWTGLA TPTGVPKDEV LANRLLNFLM 2220 

KHVFHPKRAV FRHNLE 1 1 KT LVECWKDCLS IPYRLIFEKF SGKDPNSKDN SVGIQLLGIV 2280 

MANDLPPYDP QCGIQSSEYF QALVNNMSFV RYKEVYAAAA EVLGLILRYV MERKNILEES 2340 

LCELVAKQLK QHQNTMEDKF IVCLNKVTKS FPPLADRFMN AVFFLLPKFH GVLKTLCLEV 2400 

VLCRVEGMTE LYFQLKSKDF VQVMRHRDDE RQKVCLDI I Y KMMPKLKPVE LRELLNPWE 2460 

FVSHPSTTCR EQMYNILMWI HDNYRDPESE TDNDSQEIFK LAKDVLIQGL IDENPGLQLI 2520 

IRNFWSHETR LPSNTLDRLL ALNSLYSPKI EVHFLSLATN FLLEMTSMSP DYPWPMFEHP 2580 

LSECEFQEYT IDSDWRFRST VLTPMFVETQ ASQGTLQTRT QEGSLSARWP VAGQIRATQQ 2 640 

QHDFTLTQTA DGRSSFDWLT GSSTDPLVDH TSPSSDSLLF AHKRSERLQR APLKSVGPDF 2700 

GKKRLGLPGD EVDNKVKGAA GRTDLLRLRR RFMRDQEKLS LMYARKGVAE QKREKEIKSE 2760 

LKMKQDAQW LYRSYRHGDL PDIQIKHSSL ITPLQAVAQR DPIIAKQDFS SLFSGILKEM 2820 

DKFKTLSEKN NITQKLLQDF NRFLNTTFSF FPPFVSCIQD ISCQHAALLS LDPAAVSAGC 2880 

LASLQQPVGI RI/LEEALLRL LPAELPAKRV RGKARLPPDV LRWVELAKtiY RSIGEYDVLR 2 940 

GIFTSEIGTK QITQSALLAE ARSDYSEAAK QYDEALNKQD WVDGEPTEAE KDFWELASLD 3 000 

CYNHLAEWKS LEYCSTASID SENPPDLNKI WSEPFYQETY LPYMIRSKLK LLLQGEADQS 3060 

LLTFIDKAMH GELQKAILEL HYSQELSLLY LLQDDVDRAK YYIQNGIQSF MQNYSSIDVL 3120 

LHQSRLTKLQ SVQALTEIQE FISFISKQGN LSSQVPLKRL LNTWTNRYPD AKMDPMNIWD 3180 

DIITNRCFFIi SKIEEKLTPL PEDNSMNVDQ DGDPSDRMEV QEQEEDISSL IRSCKFSMKM 3240 

KMIDSARKQN NFSLAMKLLK ELHKESKTRD DWLVSWVQSY CRLSHCRSRS QGCSEQVLTV 33 00 

LKTVSLLDEN NVSSYLSKNI LAFRDQNILL GTTYRI I ANA LSSEPACLAE IEEDKARRIL 33 60 

ELSGSSSEDS EKVIAGLYQR AFQHLSEAVQ AAEEEAQPPS WSCGPAAGVI DAYMTLADFC 3420 

DQQLRKEEEN ASVIDSAELQ AYPALWEKM LKALKLNSNE ARLKFPRLLQ IIERYPEETL 3 480 

SLMTKEISSV PCWQFISWIS HMVALLDKDQ AVAVQHSVEE ITDNYPQAIV YPFIISSESY 3540 

SFKDTSTGHK NKEFVARIKS KLDQGGVIQD FINALDQLSN PELLFKDWSN DVRAELAKTP 3600 

VNKKN I EKMY ERMYAALGDP KAPGLGAFRR KFIQTFGKEF DKHFGKGGSK LLRMKLSDFN 3660 

DITNMLLLKM NKDSKPPGNL KECSPWMSDF KVEFLRNELE I PGQYDGRGK PLPEYHVRIA 3720 

GFDERVTVMA SLRRPKRIII RGHDEREHPF LVKGGEDLRQ DQRVEQLFQV MNGILAQDSA 3 780 

CSQRALQLRT YSWPMTSRL GLIEWLENTV TLKDLLLNTM SQEEKAAYLS DPRAPPCEYK 3840 

DWLTKMSGKH DVGAYMLMYK GANRTETVTS FRKRESKVPA DLLKRAFVRM STSPEAFLAL 3 900 

RSHF AS SHAL ICISHWILGI GDRHLNNFMV AMBTGGVIGI DFGHAFGSAT QFLPVPELMP 3 960 

FRLTRQFIWL MLPMKETGLM YSIMVHALRA FRSDPGI1I1TN TMDVFVKEPS FDWKNFEQKM 4 020 

LKKGGSWIQE INVAEKNWYP RQKICYAKRK LAGANPAVIT CDELLLGHEK APAFRDYVAV 4080 
ARGSKDHNIR AQEPESGLSE ETQVKCLMDQ ATDPNILGRT WEGWEPWM 

Seq ID NO: 100 DNA sequence 
Nucleic Acid Accession #: NM_000673 
Coding sequence: 101-1225 

1 11 21 31 41 51 

I I I I I I 

ATGTGAAGGC ACAAGCTGCT GTTATATACA ACAGAGTGAA CTGAG CATCA GTCAGAAAAA 60 

GTCTATGTTT GCAGAAATAC AGATCCAAGA CAAAGACAGG ATGGGCACTG CTGGAAAAGT 120 

TATTAAATGC AAAGCAGCTG TGCTTTGGGA GCAGAAGCAA CCCTTCTCCA TTGAGGAAAT 180 

AGAAGTTGCC CCACCAAAGA CTAAAGAAGT TCGCATTAAG ATTTTGGCCA CAGGAATCTG 240 

TCGCACAGAT GACCATGTGA TAAAAGGAAC AATGGTGTCC AAGTTTCCAG TG AT TGTGGG 300 

ACATGAGGCA ACTGGGATTG TAGAGAGCAT TGGAGAAGGA GTGACTACAG TGAAACCAGG 360 

TGACAAAGTC ATCCCTCTCT TTCTGCCACA ATGTAGAGAA TGCAATGCTT GTCGCAACCC 420 

AGATGGCAAC CTTTGCATTA GGAGCGATAT TACTGGTCGT GGAGTACTGG CTGATGGCAC 480 

CACCAGATTT ACATGCAAGG GCAAACCAGT ACACCACTTC ATGAACACCA GTACATTTAC 540 

CGAGTACACA GTGGTGGATG AATCTTCTGT TGCTAAGATT GATGATGCAG CTCCTCCTGA 600 

GAAAGTCTGT TTAATTGGCT GTGGGTTTTC CACTGGATAT GGCGCTGCTG TTAAAACTGG 660 

CAAGGTCAAA CCTGGTTCCA CTTGCGTCGT CTTTGGCCTG GGAGGAGTTG GCCTGTCAGT 720 

CATCATGGGC TGTAAGTCAG CTGGTGCATC TAGGATCATT GGGATTGACC TCAACAAAGA 780 

CAAATTTGAG AAGGCCATGG CTGTAGGTGC CACTGAGTGT ATCAGTCCCA AGGACTCTAC 840 

CAAACCCATC AGTGAGGTGC TGTCAGAAAT GACAGGCAAC AACGTGGGAT ACACCTTTGA 900 

AGTTATTGGG CATCTTGAAA CCATGATTGA TGCCCTGGCA TCCTGCCACA TGAACTATGG 960 

GACCAGCGTG GTTGTAGGAG TTCCTCCATC AGCCAAGATG CTCACCTATG ACCCGATGTT 1020 
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GCTCTTCACT GGACGCACAT GGAAGGGATG TGTCTTTGGA GGTTTGAAAA GCAGAGATGA 1080 

TGTCCCAAAA CTAGTGACTG AGTTCCTGGC AAAGAAATTT GACCTGGACC AGTTGATAAC 1140 

TCATGTTTTA CCATTTAAAA AAAT CAGTGA AGGATTTGAG CTGCTCAATT CAGGACAAAG 1200 

CATTCGAACG GTCCTGACGT TTTGAGATCC AAAGTGGCAG GAGGTCTGTG TTGTCATGGT 1260 

GAACTGGAGT TTCTCTTGTG AGAGTTCCCT CATCTGAAAT CATGTATCTG TCTCACAAAT 1320 

ACAAGCATAA GTAGAAGATT TGTTGAAGAC ATAGAACCCT TATAAAGAAT TATTAACCTT 13 80 

TATAAACATT TAAAGTCTTG TGAGCACCTG GGAATTAGTA TAATAACAAT GTTAATATTT 1440 

TTGATTTACA TTTTGTAAGG CTATAATTGT ATCTTTTAAG AAAACATACA CTTGGATTTC 1500 

TATGTTGAAA TGGAGATTTT TAAGAGTTTT AACCAGCTGC TGCAGATATA TAACTCAAAA 1560 

CAGATATAGC GTATAAAGAT ATAGTAAATG CATCTCCCAG AGTAATATTC ACTTAACACA 1620 

TTGAAACTAT TATTTTTTAG ATTTGAATAT AAATGTATTT TTTAAACACT TGTTATGAGT 1680 

TAACTTGGAT TACATTTTGA AATCAGTTCA TTCCATGATG CATATTACTG GATTAGATTA 1740 

AGAAAGACAG AAAAGATTAA GGGACGGGCA CATTTTTCAA CGATTAAGAA TCATCATTAC 1800 

ATAACTTGGT GAAACTGAAA AAGTATATCA TATGGGTACA CAAGGCTATT TGCCAGCATA 1860 

TATTAATATT TTAGAAAATA TTCCTTTTGT AATACTGAAT ATAAACATAG AGCTAGAGTC 1920 

ATATTATCAT ACTT AT CAT A ATGTTCAATT TGATACAGTA GAATTGCAAG TCCCTAAGTC 1980 

CCTATTCACT GTGCTTAGTA GTGACTCCAT TTAATAAAAA GTGTTTTTAG TTTTTAACAA 2 040 
CTAAACCG 



Seq ID NO: 101 Protein sequence: 
Protein Accession #: NP_00 0 664 

1 11 21 31 41 51 

I I I I 1 I 

MGTAGKVIKC KAAVLWEQKQ PFSIEEIEVA PPKTKEVRIK ILATGICRTD DHVIKGTMVS 60 
KFPVIVGHEA TGIVESIGEG VTTVKPGDKV IPLFLPQCRE CNACRNPDGN LCIRSDITGR 120 
GVLADGTTRF TCKGKPVHHF MNTSTFTEYT WDESSVAKX DDAAPPEKVC LIGCGFSTGY 180 
GAAVKTGKVK PGSTCWFGL GGVGL.SVIMG CKSAGASRII GIDLNKDKFE KAMAVGATEC 240 
ISPKDSTKPI SEVLSEMTGN NVGYTFEVIG HLETM I DAL A SCHMNYGTSV WGVPPSAKM 300 
LTYDPMLLFT GRTWKGCVFG GLKSRDDVPK LVTEFLAKKF DLDQLITHVL PFKKISEGFE 360 
LLNSGQSIRT VLTF 

Seq ID NO: 102 DNA sequence 

Nucleic Acid Accession # : NM_006783.1 

Coding sequence: 1..786 

1 11 21 31 41 51 

i I i I I I 

ATGGATTGGG GGACGCTGCA CACTTTCATC GGGGGTGTCA ACAAACACTC CACCAGCATC 60 

GGGAAGGTGT GGATCACAGT CATCTTTATT TTCCGAGTCA TGATCCTAGT GGTGGCTGCC 120 

CAGGAAGTGT GGGGTGACGA GCAAGAGGAC TTCGTCTGCA ACACACTGCA ACCGGGATGC 180 

AAAAATGTGT GCTATGACCA CTTTTTCCCG GTGTCCCACA TCCGGCTGTG GGCCCTCCAG 240 

CTGATCTTCG TCTCCACCCC AGCGCTGCTG GTGGCCATGC ATGTGGCCTA CTACAGGCAC 300 

GAAACCACTC GCAAGTT C AG GCGAGGAGAG AAGAGGAATG ATTTCAAAGA CATAGAGGAC 360 

ATTAAAAAGC ACAAGGTTCG GATAGAGGGG TCGCTGTGGT GGACGTACAC CAGCAGCATC 420 

TTTTTCCGAA TCATCTTTGA AGCAGCCTTT ATGTATGTGT TTTACTTCCT TTACAATGGG 480 

TACCACCTGC CCTGGGTGTT GAAATGTGGG ATTGACCCCT GCCCCAACCT TGTTGACTGC 540 

TTTATTTCTA GGCCAACAGA GAAGACCGTG TTTACCATTT TTATGATTTC TGCGTCTGTG 600 

ATTTGCATGC TGCTTAACGT GGCAGAGTTG TGCTACCTGC TGCTGAAAGT GTGTTTTAGG 660 

AGATCAAAGA GAGCACAGAC GCAAAAAAAT CACCCCAATC ATGCCCTAAA GGAGAGTAAG 720 

CAGAATGAAA TGAATGAGCT GATTTCAGAT AGTGGTCAAA ATGCAAT CAC AGGTTTCCCA 780 
AGCTAA 



Seq ID NO: 103 Protein sequence: 
Protein Accession #: NP_006774.1 



1 11 21 31 41 51 

I I I I 1 1 

MDWGTLHTFI GGVNKHSTS I GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 60 

KNVCYDHFFP VSHIRLWALQ LIFVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 120 

IKKHKVRIEG SLWWTYTSSI FFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 180 

FISRPTEKTV FTIFMISASV ICMLLNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 240 

QNEMNELISD SGQNAITGFP S 



Seq ID NO: 104 DNA sequence 
Nucleic Acid Accession #: NM_020411 
Coding sequence: 86-52 6 



1 11 21 31 41 51 

I 1 1 I I I 

GGACCTGGGA AGGAGCATAG G ACAGGG CAA GGCGGGATAA GGAGGGGCAC CACAGCCCTT 60 

AAGGCACGAG GGAACCTCAC TGCGCATGCT CCTTTGGTGC CCACCTCAGT GCGCATGTTC 120 

ACTGGGCGTC TTCCCATCGG CCCCTTCGCC AGTGTGGGGA ACGCGGCGGA GCTGTGAGCC 180 

GGCGACTCGG GTCCCTGAGG TCTGGATTCT TTCTCCGCTA CTGAGACACG GCGGACACAC 240 

ACAAACACAG AACCACACAG CCAGTCCCAG GAGCCCAGTA ATGGAGAGCC CCAAAAAGAA 300 

GAACCAGCAG CTGAAAGTCG GGATCCTACA CCTGGGCAGC AGACAGAAGA AGATCAGGAT 360 

ACAGCTGAGA TCCCAGTGCG CGACATGGAA GGTGATCTGC AAGAGCTGCA TCAGTCAAAC 420 

ACCGGGGATA AATCTGGATT TGGGTTCCGG CGTCAAGGTG AAGATAATAC CTAAAGAGGA 480 

ACACTGTAAA ATGCCAGAAG CAGGTGAAGA GCAACCACAA GTTTAAATGA AGACAAGCTG 540 

AAACAACGCA AGCTGGTTTT ATATTAGATA TTTGACTTAA ACTATCTCAA TAAAGTTTTG 600 
CAGCTTTCAC CAAAAAAAAA AAAAAA 



Seq ID NO: 105 Protein sequence: 
Protein Accession #: NP_065144.1 

1 11 21 31 41 51 
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I I I I I 1 

MLLWCPPQCA CSLGVFPSAP SPVWGTRRSC EPATRVPEVW ILSPLLRHGG HTQTQNHTAS 
PRSPVMESPK KKNQQLKVGI LHLGSRQKKI RIQLRSQCAT WKVICKSCIS QTPGINLDLG 
SGVKVKIIPK EEHCKMPEAG EEQPQV 



Seq ID NO: 106 DNA sequence 
Nucleic Acid Accession #: J04129 
Coding sequence: 99-587 



1 11 21 31 41 51 

I I I 1 I I 

CATCCCTCTG GCTCCAGAGC TCAGAGCCAC CCACAGCCGC AGCCATGCTG TGCCTCCTGC 
TCACCCTGGG CGTGGCCCTG GTCTGTGGTG TCCCGGCCAT GGACATCCCC CAGACCAAGC 
AGGACCTGGA GCTCCCAAAG TTGGCAGGGA CCTGGCACTC CATGGCCATG GCGACCAACA 
ACATCTCCCT CATGGCGACA CTGAAGGCCC CTCTGAGGGT CCACATCACC TCACTGTTGC 
CCACCCCCGA GGACAACCTG GAGAT CGTTC TGCACAGATG GGAGAACAAC AGCTGTGTTG 
AGAAGAAGGT CCTTGGAGAG AAGACTGGGA ATCCAAAGAA GTTCAAGATC AACTATACGG 
TGGCGAACGA GGCCACGCTG CTCGATACTG ACTACGACAA TTTCCTGTTT CTCTGCCTAC 
AGGACACCAC CACCCCCATC CAGAGCATGA TGTGCCAGTA CCTGG CCAGA GTCCTGGTGG 
AGGACGATGA GATCATGCAG GGATTCATCA GGGCTTTCAG GCCCCTGCCC AGGCACCTAT 
GGTACTTGCT GGACTTGAAA CAGATGGAAG AGCCGTGCCG TTTCTAGCTC ACCTCCGCCT 
CCAGGAAGAC CAGACTCCCA CCCTTCCACA CCTCCAGAGC AGTGGGACTT CCTCCTGCCC 
TTTCAAAGAA TAACCACAGC TCAGAAGACG ATGACGTGGT CAT CTGTGT C GCCATCCCCT 
TCCTGCTGCA CACCTGCACC AT TGCCATGG GGAGGCTGCT CCCTGGGGGC AGAGTCTCTG 
GCAGAGGTTA TTAATAAACC CTTGGAGCAT G 



Seq ID NO: 107 Protein sequence: 
Protein Accession #: AAA60147 

1 11 21 31 41 51 

I I I I I 1 

MDIPQTKQDIj ELPKLAGTWH SMAMATNNI S LMATLKAPLR VHITSLLPTP EDNLEIVLHR 

WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 

YLARVLVEDD EIMQGFIRAF RPLPRHLWYL LDLKQMEEPC RF 



Seq ID NO: 108 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 48-7 94 

1 11 21 31 41 51 

11)111 

TCCCAGGCAG CAGTTAGCCC GCCGCCCGCC TGTGTGT CCC C AG AG CCATG GAGAGAGCCA 

GTCTGATCCA GAAGGCCAAG CTGGCAGAGC AGGCCGAACG CTATGAGGAC ATGGCAGCCT 

TCATGAAAGG CGCCGTGGAG AAGGGCGAGG AGCTCTCCTG CGAAGAGCGA AACCTGCTCT 

CAGTAGCCTA TAAGAACGTG GTGGGCGGCC AGAGGGCTGC CTGGAGGGTG CTGTCCAGTA 

TTGAGCAGAA AAGCAACGAG GAGGGCTCGG AGGAGAAGGG GCCCGAGGTG CGTGAGTACC 

GGGAGAAGGT GGAGACTGAG CTCCAGGGCG TGTGCGACAC CGTGCTGGGC CTG CTGGACA 

GCCACCTCAT CAAGGAGGCC GGGGACGCCG AGAGCCGGGT CTTCTACCTG AAGATGAAGG 

GTGACTACTA CCGCTACCTG GCCGAGGTGG CCACCGGTGA CGACAAGAAG CGCATCATTG 

ACTCAGCCCG GTCAGCCTAC CAGGAGGCCA TGGACATCAG CAAGAAGGAG ATGCCGCCCA 

CCAACCCCAT CCGCCTGGGC CTGGCCCTGA ACTTTTCCGT CTTCCACTAC GAGAT CGCCA 

ACAGCCCCGA GGAGGCCATC TCTCTGGCCA AGACCACTTT CGACGAGGCC ATGGCTGATC 

TGCACACCCT CAGCGAGGAC TCCTACAAAG ACAGCACCCT CATCATGCAG CTGCTGCGAG 

ACAACCTGAC ACTGTGGACG GCCGACAACG CCGGGGAAGA GGGGGGCGAG GCTCCCCAGG 

AGCCCCAGAG CTGAGTGTTG CCCGCCACCG CCCCGCCCTG CCCCCTCCAG TCCCCCACCC 

TGCCGAGAGG ACTAGTATGG GGTGGGAGGC CCCACCCTTC TCCCCTAGGC GCTGTTCTTG 

CTCCAAAGGG CTCCGTGGAG AGGGACTGGC AGAGCTGAGG CCACCTGGGG CTGGGGATCC 

CACTCTTCTT GCAGCTGTTG AGCGCACCTA ACCACTGGTC ATGCCCCCAC CCCTGCTCTC 

CGCACCCGCT TCCTCCCGAC CCCAGGACCA GGCTACTTCT CCCCTCCTCT TGCCTCCCTC 

CTGCCCCTGC TGCCTCTGAT CGTAGGAATT GAGGAGTGTC CCGCCTTGTG GCTGAGAACT 

GGACAGTGGC AGGGGCTGGA GATGGGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG 

CGCGCGCGCC AGTGCAAGAC CGAGATTGAG GGAAAGCATG TCTGCTGGGT GTGACCATGT 
TTCCTCTCAA TAAAGTTCCC CTGTGACACT C 



Seq ID NO: 109 Protein sequence: 
Protein Accession #: NP_006133.1 

1 11 21 31 41 51 

1 I I I.I 1 

MERASLIQKA KLAEQAERYE DMAAFMKGAV EKGEELSCEE RNLLSVAYKN WGGQRAAWR 

VLSSIEQKSN EEGSEEKGPE VREYREKVET ELQGVCDTVL GLLDSHLIKE AGDAESRVFY 

LKMKGDYYRY LAEVATGDDK KRIIDSARSA YQEAMDISKK EMPPTNPIRL GLALNFSVFH 

YEIANSPEEA I SLAKTTFDE AMADLHTLSE DSYKDSTItlM QLLRDNLTLW TADNAGEEGG 
EAPQEPQS 



Seq ID NO: 110 DNA sequence 
Nucleic Acid Accession #: NM_000695 
Coding sequence: 407-1564 

1 11 21 31 41 51 

1 1 1 I I I 

CACGAGTTGG TTTGGGAGCT GCCAGTCTCC TGGGAGGATC GCAGTCAGCA GAGCAGGGCT 
GAGGCCTGGG GGTAGGAGCA GAGCCTGCGC ATCTGGAGGC AGCATGTCCA AGAAAGGGAG 
TGGAGGTGCA GCGAAGGACC CAGGGGCAGA GCCCACGCTG GGGATGGACC CCTTCGAGGA 
CACACTGCGG CGGCTGCGTG AGGCCT TCAA CTGAGGGCGC ACGCGGCCGG CCGAGTTCCG 
GGCTGCGCAG CTCCAGGGCC TGGGCCACTT CCTTCAAGAA AACAAGCAGC TTCTGCGCGA 
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10 



15 



20 



25 



30 



35 



40 



WO 02/086443 

CGTGCTGGCG CAGGACCTGC ATAAGCCAGC TTTCGAGGCA GACATATCTG AGCTCATCCT 3 60 

TTGCCAGAAC GAGGTTGACT ACGCTCT CAA GAACCTTCAG GCCTGGATGA AGGATGAACC 42 0 

ACGGTCCACG AACCTGTTCA TGAAGCTGGA CTCGGTCTTC ATCTGGAAGG AACCCTTTGG 480 

CCTGGTCCTC ATCATCGCAC CCTGGAACTA CCCATTGAAC CTGACCCTGG TGCTCCTGGT 540 

GGGCACCCTC CCCGCAGGGA ATTGCGTGGT GCTGAAGCCG TCAGAAATCA GCCAGGGCAC 600 

AGAGAAGGTC CTGGCTGAGG TGCTGCCCCA GTACCTGGAC CAGAGCTGCT TTGCCGTGGT 660 

GCTGGGCGGA CCCCAGGAGA CAGGGCAGCT GCTAGAGCAC AAGTTGGACT ACATCTTCTT 72 0 

CACAGGGAGC CCTCGTGTGG GCAAGATTGT CATGACTGCT GCCACCAAGC ACCTGACGCC 780 

TGTCACCCTG GAGCTGGGGG GCAAGAACCC CTGCTACGTG GACGACAACT GCGACCCCCA 840 

GACCGTGGCC AACCGCGTGG CCTGGTTCTG CTACTTCAAT GCCGGCCAGA CCTGCGTGGC 900 

CCCTGACTAC GTCCTGTGCA GCCCCGAGAT GCAGGAGAGG CTGCTGCCCG CCCTGCAGAG 960 

CACCATCACC CGTTTCTATG GCGACGACCC CCAGAGCTCC CCAAACCTGG GCCGCATCAT 1020 

CAACCAGAAA CAGTTCCAGC GGCTGCGGGC ATTGCTGGGC TGCGGCCGCG TGGCCATTGG 1080 

GGGCCAGAGC AACGAGAGCG ATCGCTACAT CGCCCCCACG GTGCTGGTGG ACGTGCAGGA 1140 

GACGGAGCCT GTGATGCAGG AGGAGATCTT CGGGCCCATC CTGCCCATCG TGAACGTGCA 12 00 

GAGCGTGGAC GAGGCCATCA AGTT CATCAA CCGGCAGGAG AAGCCCCTGG CCCTGTACGC 12 60 

CTTCTCCAAC AGCAGACAGG TTGTGAACCA GATGCTGGAG CGGACCAGCA GCGGCAGCTT 1320 

TGGAGGCAAT GAGGGCTTCA CCTACATATC TCTGCTGTCC GTGCCATTCG GGGGAGTCGG 1380 

CCACAGTGGG ATGGGCCGGT ACCACGGCAA GTTCACCTTC GACACCTTCT CCCACCACCG 1440 

CACCTGCCTG CTCGCCCCCT CCGGCCTGGA GAAATTAAAG GAGATCCGCT ACCCACCCTA 1500 

TACCGACTGG AACCAGCAGC TGTTACGCTG GGGCATGGGC TCCCAGAGCT GCACCCTCCT 1560 

GTGAGCGTCC CACCCGCCTC CAACGGGTCA CACAGAGAAA CCTGAGTCTA GCCATGAGGG 162 0 

GCTTATGCTC CCAACTCACA TTGTTCCTCC AGACCGCAGG CTCCCCCAGC CTCAGGTTGC 1680 

TGGAGCTGTC ACATGACTGC ATCCTGCCTG CCAGGGCTGC AAAGCAAGGT CTTGCTTCTA 1740 

TCTGGGGGAC GCTGCTCGAG AGAGGCCGAG AGGCCGCAGA ACATGCCAGG TGTCCTCACT 180 0 

CACCCCACCC TCCCCAATTC CAGCCCTTTG CCCTCTCGGT CAGGGTTGGC CAGGCCCAGT 1860 

CACAGGGGCA GTGTCACCCT GGAAAATACA GTGCCCTGCC TTCTTAGGGG CATCAGCCCT 1920 

GAACGGTTGA GAGCGTGGAG CCCTCCAGGC CTTTGCTCTC CCCTCTAGGC ACACGCGCAC 1980 

TTCCACCTCT GCCCCATCCC AACTGCACCA GCACTGCCTC CCCCAGGGAT CCTCTCACAT 2 040 

CCCACACTGG TCTCTGCACC ACCCCTCTGG TTCACACCGC ACCCTGCACT CACCCACAGC 210 0 

AGCTCCATCC ACTGGGAAAA CTGGGGTTTG CATCACTCCA CTGCACAGTG TTAGTGGGAC 2160 

CTGGGGGCAA GTCCCTTGAC TTCTCTGAGC CTCAGTTTCC TTATGTGAAA GTTGCTGGAA 2220 

CCAAAATGGA GTCACTTATG CCAAACTCTA ATAAAATGGA GTCGGGGGGG CACATAGAAG 2280 

CCCTCACACA CACATGCCCG TAACAGGATT TATCACCAAG ACACGCCTGC ATGTAAGACC 2 340 

AGACACAGGG CGTATGGAAA AGCACGTCCT CAAAGACTGT AGTATTCCAG ATGAGCTGCA 2400 

GATGCTTACC TACCACGGCC GTCTCCACCA GAAAACCATC GCCAACTCCT GCGATCAGCT 2460 

TGTGACTTAC AAACCTTGTT TAAAAGCTGC TTACATGGAC TTCTGTCCTT TAAAACGTTC 2520 

CCCTTGGCTG TGGCCCTCTG TGTATGCCTG GGATCCTTCC AAGCACTCAT AGCCCAGATA 2580 
GGAATCCTCT GCTCCTCCCA AATAAATTCA TCTGTTC 
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Seq ID NO: 111 Protein sequences 
Protein Accession #: NP_000686 
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MKDEPRSTNL 
I SQGTEKVLA 
KHLTPVTLEL 
PALQSTITRF 
VDVQETEPVM 
SSGSFGGNEG 
RYPPYTDWNQ 



11 
I 

FMKLDSVFIW 
EVLPQYLDQS 
GGKNPCYVDD 
YGDDPQSSPN 
QEEIFGPILP 
FTYISLLSVP 
QLLRWGMGSQ 



21 

I 

KEPFGLVLII 
CFAWLGGPQ 
NCDPQTVANR 
LGRI INQKQF 
IVNVQSVDEA 
FGGVGHSGMG 
SCTLL 



31 
I 

APWNYPLNLT 
ETGQLLEHKL 
VAWFCYFNAG 
QRLRALLGCG 
IKFINRQEKP 
RYHGKFTFDT 



41 



I 

LVLLVGTLiPA 
DYIFFTGSPR 
QTCVAPDYVL 
RVAIGGQSNE 
LALYAFSNSR 
FSHHRTCLLA 



51 
I 

GNCWLKPSE 
VGKIVMTAAT 
CSPEMQERLL 
SDRYIAPTVL 
QWNQMLERT 
PSGLEKLKEI 



Seq ID NO: 112 DNA sequence 
Nucleic Acid Accession #: NM_004456 
Coding sequence: 58-22 98 
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GAATTCCGGG 
GGCCAGACTG 
GAGTACATGC 
TTT AGTT CCA 
CAGCGAAGGA 
GAGTGTTCGG 
AATGCAGTTG 
GTGGAAGATG 
GATGGT ACT T 
GAATGTGGGT 
AATGATGATG 
GATCTGGAGG 
AAAATTTTGG 
GAAAAATATA 
CCCAACATAG 
CATACGCTTT 
ACACCCAACA 
CCACAGTGTT 
CGGATAAAGA 
AGTAGCAGGC 
AGGGAAGCAG 
GATGAAACTT 
CCAAATATTG 
GTCCTCATTG 
ACAT GTAGAC 
GCTGAGGATG 
CACTGCAGAA 



11 

I 

CGACGCGCGG 
GGAAGAAATC 
GACTGAGACA 
AT CGTCAGAA 
TACAGCCTGT 
TGACCAGTGA 
CTT CAGTACC 
AAACTGTTTT 
TCATTGAAGA 
TTATAAATGA 
ACGATGATGA 
ATCACCGAGA 
AGGCCATTTC 
AAGAACTCAC 
ATGGACCAAA 
TCTGTAGGCG 
CTTATAAGCG 
ACCAGCATTT 
CCCCACCAAA 
CCAGCACCCC 
GGACTGAAAC 
CGAGCTCCTC 
AACCTCCTGA 
GCACTTACTA 
AGGTGTATGA 
TGGATACTCC 
AGATACAGCT 



21 

I 

GAACAACGCG 
TGAGAAGGGA 
GCTCAAGAGG 
AATTTTGGAA 
GCACATCCTG 
CTTGGATTTT 
CATAATGTAT 
ACATAACATT 
ACTAATAAAA 
TGAAATTTTT 
TGATGGAGAC 
TGATAAAGAA 
CTCAATGTTT 
CGAACAGCAG 
TGCTAAATCT 
ATGTTTTAAA 
GAAGAACACA 
GGAGGGAGCA 
ACGTGCAGGA 
CACCATTAAT 
GGGGGGAGAG 
TGAAGCAAAT 
GAATGTGGAG 
TGACAATTTC 
GTTTAGAGTC 
TCCAAGGAAA 
GAAAAAGGAC 



31 

I 

AGTCGGCGCG 
CCAGTTTGTT 
TTCAGACGAG 
AGAACGGAAA 
ACTTCTGTGA 
CCAACACAAG 
TCTTGGTCTC 
CCTTATATGG 
AATTATGATG 
GTGGAGTTGG 
GATCCTGAAG 
AGCCGCCCAC 
CCAGATAAGG 
CTCCCAGGCG 
GTTCAGAGAG 
TATGACTGCT 
GAAACAGCTC 
AAGGAGTTTG 
GGCCGCAGAA 
GTGCTGGAAT 
AACAATGATA 
TCTCGGTGTC 
TGGAGTGGTG 
TGTGCCATTG 
AAAGAAT CTA 
AAGAAGAGGA 
GGCTCCTCTA 



41 
i 

CGGGACGAAG 
GGCGGAAGCG 
CTGATGAAGT 
TCTTAAACCA 
GCTCATTGCG 
TCATCCCATT 
CCCTACAGCA 
GAGATGAAGT 
GGAAAGTACA 
TGAATGCCCT 
AAAGAGAAGA 
CTCGGAAATT 
GCACAGCAGA 
CACTTCCTCC 
AGCAAAGCTT 
TCCTACATCC 
TAGACAACAA 
CTGCTGCTCT 
GAGGACGGCT 
CAAAGGATAC 
AAGAAGAAGA 
AAACACCAAT 
CTGAAGCCTC 
CTAGGTTAAT 
GCAT CATAGC 
AACACCGGTT 
ACCATGTTTA 



51 
I 

AATAATCATG 
TGTAAAATCA 
AAAGAGTATG 
AGAATGGAAA 
CGGGACTAGG 
AAAGACTCTG 
GAATTTTATG 
TTTAGATCAG 
CGGGGATAGA 
TGGTCAATAT 
AAAGCAGAAA 
TCCTTCTGAT 
AGAACTAAAG 
TGAATGTACC 
ACACTCCTTT 
TTTTCATGCA 
ACCTTGTGGA 
CACCGCTGAG 
TCCCAATAAC 
AGACAGTGAT 
AGAGAAGAAA 
AAAGATGAAG 
AATGT T TAG A 
TGGG AC C AAA 
TCCAGCTCCC 
GTGGGCTGCA 
CAACT AT CAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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CCCTGTGATC ATCCACGGCA GCCTTGTGAC AGTTCGTGCC CTTGTGTGAT AGCACAAAAT 1680 

TTTTGTGAAA AGTTTTGTCA ATGTAGTTCA GAGTGT CAAA ACCGCTTTCC GGGATGCCGC 1740 

TGCAAAGCAC AGTGCAACAC CAAGCAGTGC CCGTGCTACC TGGCTGTCCG AGAGTGTGAC 180 0 

CCTGACCTCT GTCTTACTTG TGGAGCCGCT GACCATTGGG ACAGTAAAAA TGTGTCCTGC 1860 

AAGAACTGCA GTATTCAGCG GGGCTCCAAA AAGCATCTAT TGCTGGCACC ATCTGACGTG 192 0 

GCAGGCTGGG GGATTTTTAT CAAAGATCCT GTGCAGAAAA ATGAATTCAT CTCAGAATAC 1980 

TGTGGAGAGA TTATTTCTCA AGATGAAGCT GACAGAAGAG GGAAAGTGTA TGATAAATAC 2 040 

ATGTGCAGCT TTCTGTTCAA CTTGAACAAT GATTTTGTGG TGGATGCAAC CCGCAAGGGT 2100 

AACAAAATTC GTTTTGCAAA TCATTCGGTA AATCCAAACT GCTATGCAAA AGTTATGATG 2160 

GTTAACGGTG ATCACAGGAT AGGTATTTTT GCCAAGAGAG CCATCCAGAC TGGCGAAGAG 2220 

CTGTTTGTTG ATTACAGATA CAGCCAGGCT GATGCCCTGA AGTATGTCGG CATCGAAAGA 22 80 

GAAATGGAAA TCCCTTGACA TCTGCTACCT CCTCCCCCTC CTCTGAAACA GCTGCCTTAG 2340 

CTTCAGGAAC CTCGAGTACT GTGGGCAATT TAGAAAAAGA ACATGCAGTT TGAAATTCTG 240 0 

AATTTGCAAA GTACTGTAAG AATAATTTAT AGTAATGAGT TTAAAAATCA ACTTTTTATT 2460 

GCCTTCTCAC CAGCTGCAAA GTGTTTTGTA CCAGTGAATT TTTGCAATAA TGCAGTATGG 2520 
TACATTTTTC AACTT TGAAT AAAGAATACT TGAACTTGAA AAAAAAAAAA AAAAAA 
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20 



Seq ID NO: 113 Protein sequence: 
Protein Accession #: NP_004447 
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MGQTGKKSEK 
KQRRIQPVHI 
MVEDETVLHN 
YNDDDDDDDG 
KEKYKELTEQ 
ATPNTYKRKN 
NSSRPSTPTI 
KPNIEPPENV 
PAEDVDT P PR 
NFCEKFCQCS 
CKNCSIQRGS 
YMCSFLFNLN 
ELFVDYRYSQ 



GPVCWRKRVK 
LTSVSSLRGT 
I PYMGDEVLD 
DDPEEREEKQ 
QLPGALPPEC 
TETALDNKPC 
NVLESKDTDS 
EWSGAEASMF 
KKKRKHRLWA 
SECQNRFPGC 
KKHLLLAPSD 
NDFWDATRK 
ADALKYVGIE 



SEYMRLRQLK 
RECSVTSDLD 
QDGTFIEELI 
KDLEDHRDDK 
TPNIDGPNAK 
GPQCYQHLEG 
DREAGTETGG 
RVLIGTYYDN 
AHCRKIQLKK 
RCKAQCNTKQ 
VAGWGIFIKD 
GNKIRFANHS 
REMEIP 



RFRRADEVKS 
FPTQVIPLKT 
KNYDGKVHGD 
ESRPPRKFPS 
SVQREQSLHS 
AKE FAAALTA 
ENNDKEEEEK 
FCAIARLIGT 
DGSSNHVYNY 
CPCYLAVREC 
PVQKNEFISE 
VNPNCYAKVM 



MFSSNRQKIL 
LNAVASVPIM 
RECGFINDEI 
DKILEAISSM 
FHTLFCRRCF 
ERIKTPPKRP 
KDETSSSSEA 
KTCRQVYEFR 
QPCDHPRQPC 
DPDLCLTCGA 
YCGEIISQDE 
MVNGDHRIGI 



ERTEILNQEW 
YSWSPLQQNF 
FVELVNALGQ 
FPDKGTAEEL 
KYDCFLHPFH 
GGRRRGRLPN 
NSRCQTPIKM 
VKESSIIAPA 
DSSCPCVIAQ 
ADHWDSKNVS 
ADRRGKVYDK 
FAKRAIQTGE 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



40 
45 
50 
55 



Seq ID NO: 114 DNA sequence 
Nucleic Acid Accession #: NM_0 0182 7 
Coding sequence: 96-335 



AGTCTCCGGC 
CGCTCTCGTT 
CGGACAAGTA 
CCAAACAAGT 
AACAGAGTCT 
TTAGACGACC 
TTTTCAAATT 
ACAAATCTTT 
AAATGCAACT 
TTTCTCTTAA 
TATGTTGCAT 



11 
I 

GAGTTGTTGC 
TCATTTTCTG 
CTTCGACGAA 
ACCTAAAACT 
AGGCTGGGTT 
TCTTCCAAAA 
TAATGTATAT 
CATCCATACC 
GCAAGTAGGT 
GTGCCTGTTT 
TTAAAAAAAA 



21 
I 

CTGGGCTGGA 
CAGCGCGCCA 
CACTACGAGT 
CATCTGATGT 
CAT TACATGA 
GATCAACAAA 
GTGTATATAA 
TGTGCATGAG 
TACTGTAAGA 
GAGTTTACTG 
AAAAAAA 



31 
I 

CGTGGTTTTG 
CGAGGATGGC 
ACCGGCATGT 
CTGAAGAGGA 
TTCATGAGCC 
AATGAAGTTT 
GGTAGTATTC 
CTGTATTCTT 
TGTTTAAGAT 
AAACAGTTTA 



41 
I 

TCTGCTGCGC 
CCACAAGCAG 
TATGTTACCC 
GTGGAGGAGA 
AGAACCACAT 
ATCTGGGGAT 
AGTGAATACT 
CACAGCAACA 
AAAAGTTCTT 
CTTTTGTTCA 



51 

I 

CCGCTCTTCG 
ATCTACTACT 
AGAGAACTTT 
CTTGGTGTCC 
ATTCTTCTCT 
CGTCAAAT CT 
TGAGAAATGT 
GAGCTCAGTT 
CCAGTCAGTT 
ATAAAGTTTG 



60 
120 
180 
240 
300 
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420 
480 
540 
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Seq ID NO: 115 Protein sequence: 
Protein Accession #: NP_001818 



1 11 21 31 41 51 

60 | j | t I I 

MAHKQIYYSD KYFDEHYEYR HVMLPRELSK QVPKTHLMSE EEWRRLGVQQ SLGWVHYMIH 
EPEPHILLFR RPLPKDQQK 



60 



65 
70 
75 
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Seq ID NO: 116 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



TCAGACCTCA 
GCATCTGGAC 
AGAGGTGTGT 
GCAGCCAACA 
AAGGACTGAT 
CCTTGGAAGG 
TCAAGAATTC 
AGTGTGGGAG 
CCCATCTCTA 
CCTGTAGTTC 
AGGCTGCAAT 
CCTGTCTCAA 
TTTGAGGTGC 
CCTGAAGGAG 
GACAGACCTT 
CTCTTCCCCC 
CCGCCTCCCA 
CAGCCGGATA 
TCCTGCTCCG 



11 

1 

TGAGTCACTT 
CCTTGGTGCT 
TCCAGGGAAA 
GAGTTCAAAA 
CCACATTCCC 
ACCTGGCTCA 
TTTGCTGAGC 
GATCTCTTGA 
AAATAATAAT 
CAGCTACCCA 
GAACTGTGAT 
ATAATAATAA 
CATTTGGGTA 
CAGAGGGATG 
GTCCTTCTTC 
TCCCTGTCCC 
TGTCTGCTGT 
CAGAGTGAAT 
TGTAAAGAGG 



21 

I 

GGACTCTTGA 
ATCGACGAAG 
GCCCCTATCT 
TGCAGGCTTG 
ACCAGGAAGT 
GGCTGGACCA 
ATGGTGCCTC 
GCCCAGGAGT 
AATAATAAAA 
GGAGGCTGAG 
TACCCCACTG 
TAATAATAAT 
GAAAGAAAAG 
CATCGCTGGA 
CTTGTGGAAA 
AGGGAACCAA 
GCCTTTGTAC 
AGTTAACCAC 
CCAGTGTTTG 



31 
I 

GCCACCTCTG 
CTTGGGTGGG 
CTCTGCAGAG 
GAAAGTACAG 
TTAGCAGAAC 
CCTCTTGAGA 
ATGCCTATAA 
TCAAGACTAG 
TAAAAAATTA 
GCAAGAGGAT 
CACTCCAGCC 
CTTATTTTGG 
ACGTTTACAC 
GGTGACCTAC 
GTGTTTCCTC 
AGGGCTTTCT 
TCAGCAATTC 
ACTTAGGTCA 
TGTGTTGCAA 



41 

1 

GGGGTGGAGT 
GCTCTTAGCT 
GTCAAGTGAA 
GGGGCTCTGT 
CCCCGCGTGC 
GGGAGGAGCT 
TACCAACACT 
CCTGGGCAAC 
GCAGGGCATG 
GGCTGGAGCC 
TGGGCAAAAG 
AGAATAAAGA 
CGAGAAATAG 
AGTTGAAGAA 
TGCTGCTACT 
ACCACACCCT 
TTGTTTGCTC 
AATAGGATCT 
GCAGCCTTGG 



51 
I 

CTCTCTCCTG 
GCTATGTGCA 
AGCGACGGCC 
GGAGGATGGG 
CAACTGGACC 
CTGGATTTGA 
TTGGGAGGCC 
ACAGAGAGAA 
GTGGCATGTG 
TGGGATGTTG 
AGCGAGAGAA 
GACCTCTGGA 
TCTGTGTTGC 
GACTCATTAT 
GCTCATGAGA 
TTCTTGCCCC 
CATTATCTTC 
AAATTTTTGT 
AATAGTAACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



232 



10 



WO 02/086443 

CTTCTCATTT GTTTGGGATC TGGCCACCAA GTTCCAGAAT GATACACGGA TCAGTGCAGA 1200 

AGTTCATCAG GCTCTCGGAC CTTAGGGCTG TTGGAGAAGG CTTCAGCAGC AGAACTGATG 12 60 

GTGAAGGCTC GTGTT CTCCA TCCTCAACTT TCTTTGCTTC GATCATACAC AAGAATACAT 1320 

TTGGAAGGGC AAAAAATGAA CACTGTCGTT CATTGCAGCC GTGTTTTGTG ACACAGATGC 13 80 

AC AGT CTGCT GTGAAGACCT TCTCTCAAGT GGCATTTGGG AGTCCATGCC AGATCATGGT 1440 

GCTTCATGAG AGACTGACAG CTATCAGGGG TTGTGGCACT TAGTGAGGAC TCTCCTCCCC 150 0 

CAGTGTGTGC TGATGACACA TACACACCTG ACAATAGCTT GAGTCTTCTC TGTTCCTTTT 1560 

ACTCTGTAGC CAACATACAC ATGATTTAAA ACCCTTTCTA AATATCTATC ATGGTTCATC 162 0 

CTTGTCCAAA TGCAGAGTCA GAGCTATTTG TACTTCATTA TTATTTCCAA GGCGAATAGT 1680 

TGGCTTTCTT TTTGCAAAAA TAATTAAAGT TTTTGTATGT TGCAAAAAAA AAAAAAAAAA 1740 
AAACAAAAAA 



PCT/US02/12476 



Seq ID NO: 117 DNA sequence 
Nucleic Acid Accession #: BC012178.1 
15 Coding sequence: 204-2285 
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CTTCTCTCCC 
TGGCGGCTGG 
GCCCGCGGCG 
CCGCGGCTCC 
CTGGAGGAGA 
CTGGTGCTCA 
AAATTTTCCC 
TCATCTCTGG 
TATTCACTAT 
TATTTGGAGG 
TGGATAATAC 
ATGGAGATAG 
TAGTAGCAGG 
TTGGCCTTAC 
GCAGTGGAAC 
GAGTAGGCAC 
CAGCTTTGCT 
GCTTTATGAG 
AGGT CAAAGT 
CAGATGAAGA 
CTGAAGAGAA 
GAGAAATGAA 
TAATTGAAAG 
ATGACACAGA 
ATTTTCATAA 
TTTCCAGGCA 
CTTATATTTG 
CTGCAAGTGT 
AGGATCAGGA 
CAATTAAAAC 
CCAGTAAAGA 
TGTGTCACAA 
CAGATGTTAC 
TTGAGGCCCA 
TGATTTTGAC 
GAT CTGTGGT 
CTGGCAATGA 
CTGGTATTTC 
AGTAATAAAC 



11 

I 

GCGGCGCTGG 
CTCCTCTCCG 
CCGACCCTTC 
GGCCCTGGCC 
CCTTAAGGAT 
GTACGGGAAA 
CTTGGAAACA 
AGGACCTAAT 
TGGCAAGCCT 
TACTGTGCAC 
ATGTTCATTA 
TGTAGACAAA 
CAT AG CAAAT 
AGAAAATGGA 
CTTCACCGTG 
GTCAAAAGTT 
AAATCGTGCT 
AAAACGAGAA 
GATAAATGCT 
TAGAACCCCA 
AAGAAAAATC 
CTTGAAACCA 
TGCATCCCTT 
GCTCAT CAGA 
AGATGAAGTG 
TCCATTTCCA 
TAAGGACTTT 
TAAAAAGCCA 
GAAGCTGATG 
TGTAGGTGTG 
TGAACCTGAC 
CGTTAACAGA 
TCCCACTTTC 
TAACATTCTC 
ACCATTACAT 
TATTCGAACC 
GAT CCCTGTA 
TCGAATTATG 
TTCTTGTTCT 



21 
I 

GGCCCGCGCT 
CTGCCGGCTG 
CGGCACCCTC 
CCGATGGCTC 
GGCCACCACC 
GT CAT AGACC 
CCAGCATTTG 
TCTGTGTATG 
GTTCTTGGAA 
AAAAAAAGTG 
TTCAGGGGCC 
GTAGCTGATG 
GAATCTAAAA 
AAAGTAATAC 
CAGAACAGAG 
TTGGTTTTAC 
TTGAACCAAG 
AGCCAGTCTG 
GCTCATTCTT 
CGGAAAAGAA 
ATTGGGGATA 
GAGGAGGTTT 
GTTGCAAGTG 
AAGTTGAGAG 
AGAATTTTGG 
GGTCCTGGCC 
CCTGAAACCA 
CATACCCTAT 
CAAATTACCA 
CAGGGTGACT 
TGGGAATCAC 
GTTGTTTATA 
TTGACAACAG 
AGGGAGTCTG 
TTTGATCGGG 
TTTATTACTA 
GAGGTGGTAT 
TATGACTTAA 
ATTAAAA 



31 
I 

CCGCTGCTGT 
CTCCTCGACC 
CCGCCCCGTC 
TGTGCAACGG 
ACTATGAAGG 
GAAGAGTGAG 
CTATAAAGGA 
CTGAAGATGC 
TTTGCTATGG 
TCAGAGAAGA 
TTCAGAAGGA 
GATTCAAGGT 
AGTTATATGG 
TGAAGAATTT 
AACTTGAGTG 
TCAGTGGTGG 
AACAAGTCAT 
TTGAAGAGGC 
TCTACAATGG 
TTAGCAAAAC 
CTTTTGTTAA 
TCCTTGCCCA 
GCAAAGCTGA 
AGGAGGGAAA 
GCAGAGAACT 
TGGCAATCAG 
ACAATATTTT 
TACAGAGAGT 
GTCTGCATTC 
GTCGTTCCTA 
TTATTTTTCT 
TATTTGGCCC 
GGGTGCTCAG 
GGTATGCTGG 
ACCCACTTCA 
GTGACTT CAT 
TAAAGATGGT 
CATCAAAGCC 



41 

I 

TGCTCCATTC 
AGGCCTCCTT 
TCGTACTGTC 
AGACT CCAAG 
AGCTGTTGTC 
GGAACTGTTC 
ACAAGGATTC 
TCCCTGGTTT 
TATGCAGATG 
TGGAGTTTTC 
AGAAGTTGTT 
TGTGGCACGT 
AGCACAGTTC 
CCTTTATGAT 
TATTCGAGAG 
AGTAGACTCA 
TGCTGTGCAC 
CCTCAAAAAG 
AACAACAACC 
GTTAAATATG 
GATTGCCAAT 
AGGTACTTTA 
ACTCATCAAA 
AGTAATAGAA 
TGGACTTCCA 
AGTAATATGT 
GAAAATAGTA 
CAAAGCCTGC 
ACTGAATGCC 
CAGTTACGTG 
GGCTAGGCTT 
ACCAGTTAAA 
TACTTTACGC 
GAAAATCAGC 
AAAGCAGCCT 
GACTGGTATA 
CACTGAGATT 
CCCAGGAACT 



51 
I 

GGCGCTTTTC 
CTCAACCTCA 
GCCGTCACCG 
CTGGAGAATG 
ATTCTGGATG 
GTGCAGTCTG 
CGTGCTATTA 
GAT CCAGCAA 
ATGAATAAGG 
AACATTAGTG 
TTGCTTACAC 
TCTGGAAACA 
CACCCTGAAG 
ATAGCTGGAT 
ATCAAAGAGA 
ACAGTTTGTA 
ATTGATAATG 
CTTGGAATTC 
CTACCAATAT 
ACCACAAGTC 
GAAGTAATTG 
CGGCCTGATC 
ACCCATCACA 
CCTCTGAAAG 
GAAGAGTTAG 
GCTGAAGAAC 
GCTGATTTTT 
ACAACAGAAG 
TTCTTGCTGC 
TGTGGAATCT 
ATACCTCGCA 
GAACCTCCTA 
CAAGCTGATT 
CAGATGCCGG 
TCATGCCAGA 
CCTGCAACAC 
AAGAAGATTC 
ACTGAGTGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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60 



Seq ID NOi 118 Protein sequence: 
Protein Accession #: AAH12178.1 



65 



70 



75 



MALCNGDSKL 
APAIKEQGFR 
KSVREDGVFN 
SKKLYGAQFH 
VLLSGGVDST 
HSFYNGTTTL 
EVFLAQGTLR 
ILGRELGLPE 
TLLQRVKACT 
ESLIFLARLI 
ESGYAGKISQ 
WLKMVTEIK 



11 
I 

ENAGGDLKDG 
AIIISGGPNS 
ISVDNTCSLF 
PEVGLTENGK 
VCTALLNRAL 
PISDEDRTPR 
PDLIESASLV 
ELVSRHPFPG 
TEEDQEKLMQ 
PRMCHNVNRV 
MPVI LTPLHF 
KIPGISRIMY 



21 
I 

HHHYEGAWI 
VYAEDAPWFD 
RGLQKEEWL 
VILKNFIjYDI 
NQEQVIAVHI 
KRISKTLNMT 
ASGKAEJjIKT 
PGLAIRVICA 
ITSLHSIjNAF 
VYIFGPPVKE 
DRDPLQKQPS 
DLTSKPPGTT 



31 
I 

LDAGAQYGKV 
PAIFTIGKPV 
LTHGDSVDKV 
AGCSGTFTVQ 
DNGFMRKRES 
TSPEEKRKII 
HHNDTELIRK 
EEPYICKDFP 
LIjPIKTVGVQ 
PPTDVTPTFIj 
CQRSWIRTF 
EWE 



41 
i 

IDRRVRELFV 
LGI CYGMQMM 
ADGFKWARS 
NRELECIREI 
QSVEEALKKL 
GDTFVKIANE 
LREEGKVIEP 
ETNNILKIVA 
GDCRSYSYVC 
TTGVLSTLRQ 
ITSDFMTGIP 



51 
I 

QSEIFPLETP 
NKVFGGTVHK 
GNIVAGIANE 
KERVGTSKVL 
GIQVKVINAA 
VIGEMNLKPE 
LKDFHKDEVR 
DFSASVKKPH 
GISSKDEPDW 
ADFEAHNILR 
ATPGNEIPVE 



60 
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180 
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Seq ID NO: 119 DNA sequence 

Nucleic Acid Accession #: NM_006500.1 

Coding sequence: 27.. 1967 



11 



21 



31 



41 
I 



51 



ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAG CCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 



233 



WO 02/086443 

TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 3 00 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 

GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGT CCCAGGA GTACCGCATC CAGCTCCGCG 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 

TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAAT CAGGT 840 

GTTTGGCTGA TGGCAACCCT CCACCACACT T CAG CATCAG CAAGCAGAAC CCCAGCACCA 900 

GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 102 0 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 132 0 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 13 80 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGAT CCA CAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC T CACACCAG A CTCCAACACA ACCACTGGCC 162 0 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC AT CCTGGTCC 1740. 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 252 0 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2 580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2 640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2 700 

TACAAAAAAA AAT TAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2 760 

CTGAAGCAGG AGAATGGTAT GAAT CCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2 82 0 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2 940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3 0 60 

T TAG CACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 312 0 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT G CAT AC AT AT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 33 0 0 

AAAGCTTAAT TGT CCCAGAA AAT CATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 33 60 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3 540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 



Seq ID NO: 120 Protein sequence: 
Protein Accession #: NP_006491.1 



1 11 21 31 41 51 

I I I I I I 

MGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGNLSHV 60 

DWFSVHKEKR TLIFRVRQGQ GQSEPGEYEQ RLSLQDRGAT LALTQVTPQD ERIFLCQGKR 120 

PRSQEYRIQL RVYKAPEEPN IQVNPLGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

LKEEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVFYPTE KVWLEVEPVG MLKEGDRVE I RCLADGNPPP HFSISKQNPS TREAEEETTN 300 

DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS 3 60 

LTLTCEAESS QDLEFQWLRE ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGLNRT 42 0 

QLVKLAIFGP PWMAFKERKV WVKENMVLNL SCEASGHPRP TISWNVNGTA SEQDQDPQRV 480 

LSTLNVLVTP ELLETGVECT ASNDLGKNTS ILFLELVNLT TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTER KLPEPESRGV VIVAVIVCIL VLAVLGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTELW EVKSDKLPEE MGLLQGSSGD KRAPGDQGEK YIDLRH 

Seq ID NO: 121 DNA sequence 
Nucleic Acid Accession #: NM_018306 
Coding sequence: 60-671 

1 ll 21 31 41 51 

I I I I I I 

ATAGTCTACA CAGAGCTCCC CTTGCTGCCC AGACAAGCTG AAGGACCACA GGAAAAGCCA 60 
TGGAGACTTC AGCATCCTCC TCCCAGCCTC AGGACAACAG TCAAGTCCAC AGAGAAACAG 12 0 
AAGATGTAGA C T ATGG AG AG ACAGATTTCC ACAAGCAAGA CGGGAAGGCT GGACTCTTTT 180 
CCCAAGAACA ATATGAGAGA AACAAGTCTT CTTCCTCCTC CTTCTCTTCC TCCTCATCCT 240 
CCTCATCTTC TTCATCCTCC TCCTCCTCAG GTCCTGGGCA TGGGGAGCCT GACGTTTTGA 300 



234 
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AGGATGAGCT TCAACTCTAT GGAGATGCTC 
GACTCCGAAG GAGAGG CTCT GACCCAGCAA 
GACTGAATAT AAAGAAAGAT GATGAGT TTT 
GGGCCTTGCT GGTGTGTTAT CACTATTACG 
TGCTCACCTT CGCCTCCCTG GAAACCGTTG 
ACAGCGTCCT CCAAGGCTTC ATCCCCCTCT 
AGACTGACTG AGGCCACTTC CAGGTGGGCA 
GCGACCCCTG AGCCCACAAG GGCAGAGCAG 
AGACCAATAA ACAGAACACT TTTCCTTCCA 
AGGGGCATCT CATTTGGGCA GTACTGCTGT 
GGTGGGTGTG GGGCCTGAGG CTTCACAGTA 
ACTGCTCTCA GAGGACAGCA AGGGACCCTG 
ATGGTTTTTC TCAAATCCCA GGCTATCTGC 
GGGAGAGATG GATGGTCCAC TGCTTTGAGG 
GAACCTGATG CAGGTAAGAT GCTGAGGACT 
GGCAGGAAAA TGATCAT CAG AAACTAAATG 
TCCTCGCACT TTGGGAGGCT CAGGCTAAGG 
CTGGGCAACA TAGTGAGACC CCCATCTCTA 
TACATACCTG TACATACCTG CGGTTCCAGC 
TGAGCCCAGG AGTTCAGGGC TGCAGTGAGG 
CGACAGAGCA AGATCGTTTC TCTAAAATT 



Seq ID NO: 122 Protein sequence: 
Protein Accession #: NP_060776 

1 11 



CTGGAGAGGT 
GTGGAGAAGT 
TCCATTTCGT 
CAGACTGGTT 
GCATCTACTT 
TCCAGAAGTT 
GCAGAGGCAG 
CATTCTGAGA 
TGTGGTCTGA 
GCAACCCAGC 
CCTGGACCAG 
AGCTCTGCAA 
ATGCGCTCTC 
CAGGGAGCCA 
AAAACCATTT 
GCAGCCAGGC 
GTCGCTTGAA 
CAATTTTTTT 
TACTCAAGAG 
TACGATCAAG 



GGTACCCTCT 
GGAGGCCTCT 
CCTCCTGTGC 
CATGTCTCTT 
CGGACTAGTG 
TAGGCTGACA 
GCCCCAGTGT 
GACGCACAGG 
ATGTTGGCAC 
TGCAAGGATG 
CAGGAAGATT 
GCTGTGATCT 
AGGTGCTACC 
TCGGGCTGGG 
TTTTTGCACC 
ATGGGGGCTC 
GCTGAGAGTT 
TTAATGACCA 
GCTGAGGCAG 
CCACTGCACT 



GGGGAATCAG 
CAGTTAAGAA 
TTTGCCATCG 
GGGGTCGGCC 
TACCGTATCC 
GGGTTCAGGA 
GACCACCACT 
AGACCAAGCC 
CAGCCCGGGC 
GAAGGCAGAG 
CTGGGAGGTC 
GTCTGGGTTC 
GAGCCATCCT 
GCCCCTTGGT 
CAAAAAAAAA 
ACGACTGTAA 
CAAGACCAAC 
AATGTGGCGG 
GAGGACTGCT 
CCAGCCTGGG 



51 



21 31 41 

I I I I 

METSASSSQP QDNSQVHRET EDVDYGETDF HKQDGKAGLF SQEQYERNKS SSSSFSSSSS 

SSSSSSSSSS GPGHGEPDVL KDELQLYGDA PGEWPSGES GLRRRGSDPA SGEVEASQLR 

RLN I KKDDE F FHFVLLCFAI GAIiLVCYHYY ADWFMSLGVG LLTFASLETV GIYFGLVYRI 
HSVLQGFIPL FQKFRLTGFR KTD 

Seq ID NO: 123 DNA sequence 
Nucleic Acid Accession #: BC022542 
Coding sequence: 243.. 896 



11 



21 



41 



51 



360 
420 
480 
540 
600 
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720 
780 
840 
900 
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1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



PCT/US02/12476 



31 

I I ! I 1 

ACTTGGTCCC AGCCGATAAA TCTGGGGCAG CGCGCGGTAG GAGCTGCGGG CGGCCAGGCC 
CCTTCCTGCG TCCGCACCTG GCCCCGCGCG CCCCTCTCGG GCGTCCGGCT TCCGGCGTCC 
TGGCGGCTCG GGTGGCGGCG GTTCGGGCGG CCGCCTGGCT GCTCCTCGGG GCGGCGACGG 
GGCTCACGCG CGGGCCCGCC ACGGCCTTCA CCGCCGCGCG CTCTGACGCC GGCATAAGGG 
CCATGTGTTC TGAAATTATT TTGAGGCAAG AAGTTTTGAA AGATGGTTTC CACAGAGACC 
TTTTAATCAA AGTGAAGTTT GGGGAAAGCA TTGAGGACTT GCACACGTGC CGTCTCTTAA 
TTAAACAGGA CATTCCTGCA GGACTTTATG TGGATCCGTA TGAGTTGGCT TCATTACGAG 
AGAGAAACAT AACAGAGGCA GTGATGGTTT CAGAAAATTT TGATATAGAG GCCCCTAACT 
ATTTGTCCAA GGAGT CTGAA GTTCTCATTT ATGC CAGACG AGATTCACAG TGCATTGACT 
GTTTTCAAGC CTTTTTGCCT GTGCACTGCC GCTATCATCG GCCGCACAGT GAAGATGGAG 
AAGCCT CGAT TGTGGTCAAT AACCCAGATT TGTTGATGTT TTGTGACCAA GAGTTCCCGA 
TTTTGAAATG CTGGGCTCAC TCAGAAGTGG CAGCCCCTTG TGCTTTGGAT AATGAGGATA 
TATGCCAATG GAACAAGATG AAGTATAAAT CAGTATATAA GAATGTGATT CTACAAGTTC 
CAGTGGGACT GACTGTACAT ACCTCTCTAG TATGTTCTGT GACTCTGCTC ATTACAATCC 
TGTGCTCTAC ATTGATCCTT GTAGCAGTTT TCAAATATGG CCATTTTTCC CTATAAGTTT 
TATGTAGTTA AATGCTTCCT AGAAACCTAA ATAAGATCTA TTAATTTCTG ACGAGAGGTG 
TTCTTCTAGA ATTAATTACT TTTATCTTTT GTCTTCATTT GTGGCCAAAA TTATGTTTAC 
TAGAGGAAAT TTGGGATCAT T CTCAGCTAA TTCCAAAATG TAGTGCTCTA TTGCATGGAT 
CCTTGGTAAT CCTCAAGCAT CAGATGCCAT AAGGGGAAAC TTAATTCTGC TAAATTAATG 
TTTATTTTGT GAGAAGTGAC TTTATCTTCA TTTGGGGTAG AAAAATTATT TCTTTATGTA 
GTAGAGACAA ATTATTCTCA TTTTGCAAGT ACTTTCAATT TAAGCTACAA ATTGAGAAAA 
CCGTTATAAA TAAGAATAAA ATAGGCCAGG CACAGTGGCT CACACCTGTA ATCCCAGCAC 
TTTGGGAGGC CGAGGTGGGC GGATCACCAG AGGTCAAGAG TTTGAGACCA GCTTGGTGAA 
ACCCTGTCTC TACTAAAAAT ACAAAAGTTA GCTGGGGCTG GTGGTGGGCA TCTGTAGTCC 
CAGCTAATTG GAAGGGTGAG GCGGGAGGAT CGCTTGAACC TGGGAGGCGG AGGTTCCAGA 
GAGCCAAGAT CGCACCACTG CACTACAGCC TGGGCGACAG AACGAGACCC TGTCTCCAAA 
GGAAAAACAA AAAAGAAGAA TAAAATAATT TGGATGAAAA TCATGTTTAT TTAAATAGTA 
ATGTCATGAG ACTATTAAAG ATGTGCCAGA GTTTCAATGA AAATCATTAA AGTAGGACAG 
CTAAGAAATT AATATTAATA TAAAAATTAT TGATAAT CTT AAATTATTGA TTATTCCTTA 
ACGCACTCCA TTCTCCTTTT ACATTTTATC ATGTTTCTTT TGAATATATG AATTGGCAAA 
GGACTTGATG AAACTGAGTA CTAAGATTTG GTACAGAGTA TGT CAGGAAG ACAACTCAGA 
TTGCCATTTT AAATAAAGTT GTACATGAAC AAAAAAAAAA AAAAAA 
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Seq ID NO: 124 Protein sequence: 
Protein Accession #: AAH22542 

1 11 21 31 41 51 

I I 1 I ' I I 

MCSEIILRQE VLKDGFHRDL LIKVKFGESI EDLHTCRLLI KQDIPAGLYV DPYELASLRE 
RNITEAVMVS ENFDIEAPNY LSKESEVLIY ARRDSQCIDC FQAFLPVHCR YHRPHSEDGE 
AS I WNNPDL LMFCDQAGSR RMIRFRFDSF DKTIEFPILK CWAHSEVAAP CALENEDICQ 
WNKMKYKSVY KNVILQVPVG LTVHTSLVCS VTLLITILCS KKKKK 

Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession #: NM_004994.1 

Coding sequence: 20.. 2143 
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1 11 21 31 

I I 1 1 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG 
GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC 
CCTGAGAACC AATCTCACCG ACAGGGAGCT GGCAGAGGAA 
CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG 
CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT 
GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC 
CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC 
GGCGGTGATT GACGACG CCT TTGCCCGCGC CTTCGCACTG 
CACCTTCACT CGCGTGTACA GCCGGGACGC AGACATCGTC 
GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC 
TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC 
GGGCGTCGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC 
CATCTTCGAG gqCCGCTCCT ACTCTGCCTG CACCACCGAC 
CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG 
GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC 
CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGTCGC 
CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC 
CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC 
GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA 
TAGCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC 
TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG 
GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG 
CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT 
AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC 
TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT 
AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG 
TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT 
CAAGGATGGG AAGTACTGGC GATTCT CTGA GGGCAGGGGG 
CCTTATCGCC GACAAGTGGC CCGCGCTGCC CCGCAAGCTG 
GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG 
GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA 
CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC 
GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC 
CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC 
CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG 
GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC 
GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA 
CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG 

Seq ID NO: 126 Protein sequence: 
Protein Accession #: NP_004985.1 



PCT/US02/12476 



MSLWQPLVLV 
RGESKSLGPA 
ITYWIQNYSE 
FDGKDGLLAH 
YSACTTDGRS 
ACTTDGRSDG 
CTS EGRGDGR 
PMYRFTEGPP 
PTAGPTGPPS 
RFSEGRGSRP 
LDKLGLGADV 
THDVFQYREK 



11 

I 

IiLVLGCCFAA 
LLLLQKQLSL 
DLPRAVIDDA 
AFPFGPGIQG 
DGLPWCSTTA 
YRWCATTANY 
LWCATTSNFD 
LHKDDVNGIR 
AGPTGPPTAG 
QGPFLIADKW 
AQVTGALRSG 
AYFCQDRFYW 



21 
I 

PRQRQSTLVL 
PETGELDSAT 
FARAFALWSA 
DAHFDDDELW 
NYDTDDRFGF 
DRDKLFGFCP 
SDKKWGFCPD 
HLYGPRPEPE 
PSTATTVPLS 
PALPRKLDSV 
RGKMLLFSGR 
RVSSRSEIiNQ 



31 
I 

FPGDLRTNLT 
LKAMRTPRCG 
VTPLTFTRVY 
SLGKGVWPT 
CPSERLYTRD 
TRADSTVMGG 
QGYSLFLVAA 
PRPPTTTTPQ 
PVDDACNVNI 
FEEPLSKKLF 
RLWRFDVKAQ 
VDQVGYVTYD 



Seq ID NO: 127 DNA sequence 
Nucleic Acid Accession #: NM_004181 
Coding sequence: 32-670 



GCAGAAATAG 
GGTCGCCGGC 
GGTGCCAGCG 
CTTCAGGAAA 
CATGAAGCAG 
TAATCAAGAC 
AGAGAAAATG 
AGCCCATGAT 
TTTTATTCTG 
TCCGGTGAAC 
CAGAGAATTC 
GGCAGCCTAA 
AATATATACC 
TGTTCTGCAG 
ACAGCTGTCC 
TATGTCTTGT 
AAGACCTTGG 



11 
I 

CCTAGGGAGA 
CAGTGGCGCT 
CCTGCCTGCG 
AAGCAGATTG 
ACCATTGGGA 
AAACTGGGAT 
TCCCCTGAAG 
GCCGTGGCAC 
TTTAACAACG 
CATGGCGCCA 
ACCGAGCGTG 
TGCTCTGTGG 
CCCCATGCAG 
ACACGCCTTC 
ACTGGGCCAT 
AT CCG AT AT C 
ATGTGGTTAT 



21 
I 

TCAACCCCGA 
TCGTGGACGT 
CGCTGCTGCT 
AAGAGCTGAA 
ATTCCTGTGG 
TTGAGGATGG 
ACAGAGCAAA 
AGGAAGGCCA 
TGGATGGCCA 
GTT CAGAGGA 
AGCAAGGAGA 
GAGGGACTTT 
TCTAAAATGC 
CCCTCAGCCA 
TGTGGTGTGA 
TAACGCTTTA 
GTTGTCCTAA 



31 

I 

GATGCTGAAC 
GCTGGGGCTG 
GCTGTTTCCC 
GGGACAAGAA 
CACAATCGGA 
ATCAGTTCTG 
ATGCTTTGAA 
ATGTCGGGTA 
CCTCTATGAA 
CACCCTGCTG 
AGTCCGCTTC 
GCTGATTTCC 
TTCAGTACTT 
CACCCAGGCA 
GCTTCAGATG 
AATGGCTACT 
AGAATAAATT 



41 


51 




1 

GTCCTGGTGC 


| 

TCCTGGTGCT 


60 


CTTGTGCTCT 


TCCCTGGAGA 


120 


TACCTGTACC 


GCTATGGTTA 


180 


GGGCCTGCGC 


TGCTGCTTCT 


240 


AGCGCCACGC 


TGAAGGCCAT 


300 


CAAACCTTTG 


AGGGCGACCT 


360 


TACTCGGAAG 


ACTTGCCGCG 


420 


TGGAGCGCGG 


TGACGCCGCT 


480 


ATCCAGTTTG 


GTGTCGCGGA 


540 


CTGGCACACG 


CCTTTCCTCC 


600 


GAGTTGTGGT 


CCCTGGGCAA 


660 


GCGGCCTGCC 


ACTTCCCCTT 


720 


GGTCGCTCCG 


ACGGCTTGCC 


780 


TTTGGCTTCT 


GCCCCAGCGA 


840 


TGCCAGTTTC 


CATTCATCTT 


900 


TCCGACGGCT 


ACCGCTGGTG 


960 


TTCTGCCCGA 


CCCGAGCTGA 


1020 


GTCTTCCCCT 


TCACTTTCCT 


1080 


GATGGGCGCC 


TCTGGTGCGC 


1140 


TGCCCGGACC 


AAGGATACAG 


1200 


GGCTTAGATC 


ATTCCTCAGT 


1260 


GGGCCCCCCT 


TGCATAAGGA 


1320 


GAACCTGAGC 


CACGGCCTCC 


1380 


TGCCCCACCG 


GACCCCCCAC 


1440 


CCCCCCTCAG 


CTGGCCCCAC 


1500 


CCTTTGAGTC 


CGGTGGACGA 


1560 


GGGAACCAGC 


TGTATTTGTT 


1620 


AGCCGGCCGC 


AGGGCCCCTT 


1680 


GACTCGGTCT 


TTGAGGAGCC 


1740 


TGGGTGTACA 


CAGGCGCGTC 


1800 


GCCGACGTGG 


CCCAGGTGAC 


1860 


AGCGGGCGGC 


GCCTCTGGAG 


1920 


AGCGAGGTGG 


ACCGGATGTT 


1980 


CGAGAGAAAG 


CCTATTTCTG 


2040 


TTGAACCAGG 


TGGACCAAGT 


2100 


TAGGGCTCCC 


GTCCTGCTTT 


2160 


AGGAGCCAGT 


TTGCCGGATA 


2220 


GTGGGCTGGG 


CCCTCTCTTC 


2280 


ATTCTCTAAC 


CTTT 




41 


51 




| 

DRQLAEEYLY 


| 

RYGYTRVAEM ' 


60 


VPDLGRFQTF 


EGDLKWHHHN 


12 0 


SRDADIVIQF 


GVAEHGDGYP 


180 


RFGNADGAAC 


HFPFIFEGRS 


240 


GNADGKPCQF 


PFIFQGQSYS 


300 


NSAGELCVFP 


FTFLGKEYST 


360 


HE FGHALGLD 


HSSVPEALMY 


420 


PTAPPTVCPT 


GPPTVHPSER 


480 


FDAIAEIGNQ 


LYLFKDGKYW 


540 


FFSGRQVWVY 


TGASVLGPRR 


600 


MVDPRSASEV 


DRMFPGVPLD 


660 


ILQCPED 






41 


51 




| 

AAAGTGCTGT 


| 

CCCGGCTGGG 


60 


GAAGAGGAGT 


CTCTGGGCTC 


120 


CTCACGGCCC 


AGCATGAGAA 


180 


GTTAGTCCTA 


AAGTGTACTT 


240 


CTTATTCACG 


CAGTGGCCAA 


300 


AAACAGTTTC 


TTTCTGAAAC 


360 


AAGAATGAGG 


CCATACAGGC 


420 


GATGACAAGG 


TGAATTTCCA 


480 


CTTGATGGAC 


GAATGCCTTT 


540 


AAGGACGCTG 


CCAAGGTGTG 


600 


TCTGCCGTGG 


CTCTCTGCAA 


660 


CCTCTTCCCT 


TCAACATGAA 


720 


GTGAAACACA 


GCTGTTCTTC 


780 


CTTAAGCACA 


AGCAGAGTGC 


840 


GTGAAGCATT 


CTCCCCAGTG 


900 


TTGGTTTCTG 


TCTGTAAGTT 


960 


TTGCTGATAG 


TAGC 
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41 
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MLNKVLSRLG VAGQWRFVDV LGLEEESLGS VPAPACALLL LFPLTAQHEN FRKKQIEELK 60 

GQEVSPKVYF MKQTIGNSCG TIGLIHAVAN NQDKLGFEDG SVLKQFLSET EKMSPEDRAK 120 

5 CFEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILFNNVDGH LYELDGRMPF PVNHGASSED 180 
TLLKDAAKVC REFTEREQGE VRFSAVALCK AA 



Seq ID NO: 12 9 DNA sequence 
10 Nucleic Acid Accession #: NM_000213 
Coding sequence: 127-5385 

1 11 21 31 41 51 

- ~ I I I i 1 1 

1j cgcccgcgcg ctgcagcccc atctcctagc ggcagcccag GCGCGGAGGG AGCGAGTCCG 60 

CCCCGAGGTA GGTCCAGGAC GGGCGCACAG CAGCAG CCGA GGCTGGCCGG GAGAGGGAGG 120 

AAGAGGATGG CAGGG CCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 180 

AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 240 

ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTTCAGGGAC 300 

20 CGGCGCTGCA ACACCCAGGC GGAGCTGCTG GCCGCGGGCT GCCAGCGGGA GAG CATCGTG 360 

GTCATGGAGA GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 420 

AGCCAGATGT CCCCCCAAGG CCTGCGGGTC CGTCTGCGGC CCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC AGTGGAGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 540 

TCCAACTCCA TGTCCGATGA TCTGGACAAC CTCAAGAAGA TGGGGCAGAA CCTGGCTCGG 600 

25 GTCCTGAGCC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 720 

CCCCCCTTCT CCTTCAAGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTGCAGG GAGAGCGGAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

AT CCTGCAGA CAGCTGTGTG CACGAGGGAC ATTGGCTGGC GCCCGGACAG CACCCACCTG 90 0 

30 CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAGGCTG ATGGCGCCAA CGTGCTGGCT 960 

GGCATCATGA GCCGCAACGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 1020 

TACAGGACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

TATTTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGT CCAACAT CGTGGAGCTG 12 00 

35 CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAGCCCC 1260 

CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AGACGAGGAC TGGGTCCTTT 1320 

CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG 1380 

GATGGGACGC ACGTGTGCCA GCTGCCGGAG GACCAGAAGG GCAACAT CCA TCTGAAACCT 1440 

TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTG CGAG 1500 

40 CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 

TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 1620 

GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 

CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGT CAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 1800 

45 ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 

AATGCCACCT GCATCGACAG CAATGGGGGC AT CTGTAATG GACGTGGCCA CTGTGAGTGT 1920 

GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GATCAACTAC 1980 

TCGGCGATCC ACCCGGGCCT CTGCGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2 040 

GGCACCGGCG AGAAGAAGGG GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 

50 GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 2160 

GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 222 0 

CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 

CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 

TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 

55 GAAGACCACT ACATGCTGCG GGAGAACCTG ATGG CCTCTG ACCACT TGGA CACGCCCATG 2460 

CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 252 0 

CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580 

GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2640 

GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGG CAGATC 2700 

60 T CCGGTGT AC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 

CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 2820 

CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2880 

GGCTACTACA CCCTCACTGC AGACCAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2 940 

GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 

65 CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3060 

GTAAACAT CA CC AT CAT C AA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 3120 

TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 3180 

GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCGGGAC 3240 

TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 

70 GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 

TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 342 0 

ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 3 480 

TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 3 540 

GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3600 

75 GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 3660 

CCCTCAGTGG AGCTCAGCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 372 0 

TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3 780 

GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGTCGTCT CCTCCACGGT GACCCAGCTG 3840 

AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 3900 

80 CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 3960 

AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 402 0 

AAGGCGCGCA ACGGGGCCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 40 80 

CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 4140 

AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 42 0 0 

85 GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 42 60 

TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 

TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAGCACATC CTCCACCCTC 43 80 
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ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 

GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 4500 

GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 

CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 4620 

5 GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4680 

GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 

GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 

TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 

TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 492 0 

10 GGGGAT AT CG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4 98 0 

GCATT CCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 

AACGTGCCCT ACAAGTTCAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 

GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 

GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 522 0 

15 AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 52 80 

GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 5340 

GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 5400 

CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 

1 CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 5520 

20 TCCTGGGAGG CATGAAGGGG GCAAGGT CCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 

AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
ACTG 
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30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
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MAGPRPSPWA 
CNTQAELLAA 
EVFEPLESPV 
PQTDMRPEKL 
QTAVCTRDIG 
TQDYPSVPTL 
EAFNRIRSNL 
THVCQLPEDQ 
CSEGWSGQTC 
FQCPRTSGFL 
CHCHQQSLYT 
LKRAEEVWR 
LLPLLALLLL 
SGNLKGRDW 
AQLRQEVEEN 
LTEKQVEQRA 
LVEAIDVPAG 
KSQVSYRTQD 
VQLSNPKFGA 
RKIHFNWLPP 
AQGEGPYSSL 
NDDNRPIGPM 
KRPMSIPIIP 
FPGSTNSLHR 
STLTSVSSHD 
ELHRLNIPNP 
LPGSAFTLST 
RVDGDSPESR 
LFQHPLQSEY 
LSTHMDQQFF 



11 

! 

RLLLAAL I S V 
GCQRESIWM 
DLYILMDFSN 
KEPWPNSDPP 
WRPDSTHLLV 
VRLLAKHNII 
DIRALDSPRG 
KGNIHLKPSF 
NCSTGSLSDI 
CNDRGRCSMG 
DTICEINYSA 
CSFRDEDDDC 
LCWKYCACCK 
RWKVTNNMQR 
LNEVYRQISG 
FHDLKVAPGY 
TATLGRRLVN 
GTAQGNRDYI 
HLGQPHSTTI 
SGKPMGYRVK 
VSCRTHQEVP 
KKVLVDNPKN 
DIPIVDAQSG 
MTTTSAAAYG 
SRLTAGVPDT 
AQTSVWEDL 
PSAPGPLVFT 
LTVPGLSENV 
SSITTTHTSA 
QT 



21 
I 

SLSGTLANRC 
ESSFQITEET 
SMSDDLDNLK 
FSFKNVISLT 
FSTESAFHYE 
PIFAVTNYSY 
LRTEVTSKMF 
SDGLKMDAGI 
QPCLREGEDK 
QCVCEPGWTG 
IHPGLCEDLR 
TYSYTMEGDG 
ACL ALL PCCN 
PGFATHAAS I 
VHKLQQTKFR 
YTLTADQDAR 
ITIIKEQARD 
PVEGELLFQP 
IIRDPDELDR 
YWIQGDSESE 
SEPGRLAFNV 
RMLLIENLRE 
EDYDSFLMYS 
THLSPHVPHR 
PTRLVFSALG 
LPNHSYVFRV 
ALSPDSLQLS 
PYKFKVQART 
TEPFLVDGPT 



31 

I 

KKAPVKSCTE 
QIDTTLRRSQ 
KMGQNLARVL 
EDVDE FRNKL 
ADGANVDAGI 
SYYEKLHTYF 
QKTRTGSFHI 
ICDVCTCELQ 
PCSGRGECQC 
PSCDCPLSNA 
SCVQCQAWGT 
APGPNSTVLV 
RGHMVGFKED 
NPTELVPYGL 
QQPNAGKKQD 
GMVEFQEGVE 
WSFEQPEFS 
GEAWKELQVK 
SFTSQMLSSQ 
AHLLDSKVPS 
VSSTVTQLSW 
SQPYRYTVKA 
DDVLRSPSGS 
VLSTSSTLTR 
PTSLRVSWQE 
RAQSQEGWGR 
WERPRRPNGD 
TEGFGPEREG 
LGAQHLEAGG 



41 
I 

CVRVDKDCAY 
MSPQGLRVRL 
SQLTSDYTIG 
QGERISGNLD 
MSRNDERCHL 
PVSSLGVLQE 
RRGEVGIYQV 
KEVRSARCSF 
GHCVCYGEGR 
TCIDSNGGIC 
GEKKGRTCEE 
HKKKDCPPGS 
HYMLRENLMA 
SLRLARLCTE 
HTIVDTVLMA 
LVDVRVPLFI 
VSRGDQVARI 
LLELQEVDSL 
PPPHGDLGAP 
VELTNLYPYC 
AEPAETNGEI 
RNGAGWGPER 
QRPSVSDDTE 
DYNSLTRSEH 
PRCERPLQGY 
EREGVITIES 
IVGYLVTCEM 
IITIESQDGG 
SLTRHVTQEF 



51 
I 

CTDEMFRDRR 
RPGEERHFEL 
FGKFVDKVSV 
APEGGFDAIL 
DTTGTYTQYR 
DSSNIVELLE 
QLRALEHVDG 
NGDFVCGQCV 
YEGQFCEYDN 
NGRGHCECGR 
CNFKVKMVDE 
FWWLIPLLLL 
SDHLDTPMLR 
NLLKPDTREC 
PRSAKPALLK 
RPEDDDEKQL 
PVIRRVLDGG 
LRGRQVRRFH 
QNPNAKAAGS 
DYEMKVCAYG 
TAYEVCYGLV 
EAIINLATQP 
HLVNGRMD FA 
SHSTTLPRDY 
SVEYQLLNGG 
QVHPQSPLCP 
AQGGGPATAF 
PFPQLGSRAG 
VSRTLTTSGT 



Seq ID NO: 131 DNA sequence 
Nucleic Acid Accession #: BC004372 
Coding sequence: 132.. 2231 



CCTCGTGCCG 
GCCGGCCCCT 
CTCCGGACAC 
TGAGCCTGGC 
AGAAAAATGG 
ATAGCACCTT 
GCAGGTATGG 
GTGCAGCAAA 
ATTGCTTCAA 
ATGCCTTTGA 
AGAAAGGAGA 
ACGTGAGCAG 
CCTTTTCTAC 
ACAGAATCCC 
AAGAAAATGA 
ATGAAGATTT 
AGAACCAGGA 
CAACCACAAG 
ACCCAGAAGC 
ATTCTACAAG 
AGGAACAGTG 



11 

1 

CGGACCCCAG 
GCCCCGCGCC 
CATGGACAAG 
GCAGATCGAT 
TCGCTACAGC 
GCCCACAATG 
GTT CATAGAA 
CAACACAGGG 
TGCTTCAGCT 
TGGACCAATT 
ATACAGAACG 
CGGCTCCTCC 
TGTACACCCC 
TGCTACCAGT 
AGATGAAAGA 
TATCTCCAGC 
CTGGACCCAG 
GATGACTGAT 
ACACCCTCCC 
CACAATCCAG 
GTTTGGCAAC 



21 

I 

CCTCTGCCAG 
CAGGGATCCT 
TTTTGGTGGC 
TTGAATATAA 
ATCTCTCGGA 
GCCCAGATGG 
GGGCATGTGG 
GTGTACATCC 
CCACCTGAAG 
ACCATAACTA 
AATCCTGAAG 
AGTGAAAGGA 
ATCCCAGACG 
ACGTCTTCAA 
GACAGACACC 
ACCATTTCAA 
TGGAACCCAA 
GTAGACAGAA 
CTCATTCACC 
GCAACTCCTA 
AGATGGCATG 



31 , 
I 

GTTCGGTCCG 
CCAGCTCCTT 
ACGCAGCCTG 
CCTGCCGCTT 
CGGAGGCCGC 
AGAAAGCTCT 
TGATTCCCCG 
TCACATCCAA 
AAGATTGTAC 
TTGTTAACCG 
ACATCTACCC 
GCAGCACTTC 
AAGACAGTCC 
ATACCATCTC 
TCAGTTTTTC 
CCACACCACG 
GCCATTCAAA 
ATGGCACCAC 
ATGAGCATCA 
GTAGTACAAC 
AGGGATATCG 



41 
I 

CCATCCTCGT 
TCGCCCGCGC 
GGGACTCTGC 
TGCAGGTGTA 
TGACCTCTGC 
GAGCATCGGA 
GATCCACCCC 
CACCTCCCAG 
ATCAGTCACA 
TGATGGCACC 
CAGCAACCCT 
AGGAGGTTAC 
CTGGATCACC 
AGCAGGCTGG 
TGGATCAGGC 
GGCTTTTGAC 
TCCGGAAGTG 
TGCTTATGAA 
TGAGGAAGAA 
GGAAGAAACA 
CCAAACACCC 



51 

I 

CCCGTCCTCC 
CCTCCGTTCG 
CTCGTGCCGC 
TTCCACGTGG 
AAGGCTTTCA 
TTTGAGACCT 
AACTCCATCT 
TATGACACAT 
GACCTGCCCA 
CGCTATGTCC 
ACTGATGATG 
ATCTTTTACA 
GACAGCACAG 
GAGCCAAATG 
ATTGATGATG 
CACACAAAAC 
CTACTTCAGA 
GGAAACTGGA 
GAGACCCCAC 
GCTACCCAGA 
AGAGAAGACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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CCCATTCGAC AACAGGGACA GCTGCAGCCT CAGCTCATAC CAGCCATCCA ATGCAAGGAA 1320 

GGACAACACC AAGCCCAGAG GACAGTTCCT GGACTGATTT CTTCAACCCA ATCTCACACC 1380 

CCATGGGACG AGGTCATCAA GCAGGAAGAA GGATGGATAT GGACTCCAGT CATAGTACAA 1440 

CGCTTCAGCC TACTGCAAAT CCAAACACAG GTTTGGTGGA AGATTTGGAC AGGACAGGAC 1500 

CTCTTTCAAT GACAACGCAG CAGAGTAATT CTCAGAGCTT CTCTACATCA CATGAAGGCT 1560 

TGGAAGAAGA TAAAGACCAT CCAACAACTT CTACTCTGAC ATCAAGCAAT AGGAATGATG 1620 

TCACAGGTGG AAGAAGAGAC CCAAATCATT CTGAAGGCTC AACTACTTTA CTGGAAGGTT 1680 

ATACCTCTCA TTACCCACAC ACGAAGGAAA GCAGGACCTT CATCCCAGTG ACCTCAGCTA 1740 

AGACTGGGTC CTTTGGAGTT ACTGCAGTTA CTGTTGGAGA TTCCAACTCT AATGTCAATC 180 0 

GTTCCTTATC AGGAGACCAA GACACATTCC ACCCCAGTGG GGGGTCCCAT ACCACTCATG 1860 

GATCTGAATC AGATGGACAC TCACATGGGA GTCAAGAAGG TGGAGCAAAC ACAACCTCTG 1920 

GTCCTATAAG GACACCCCAA ATTCCAGAAT GGCTGATCAT CTTGGCATCC CTCTTGGCCT 1980 

TGGCTTTGAT TCTTGCAGTT TGCATTGCAG TCAACAGTCG AAGAAGGTGT GGGCAGAAGA 2040 

AAAAGCTAGT GATCAACAGT GGCAATGGAG CTGTGGAGGA CAGAAAGCCA AGTGGACT C A 2100 

ACGGAGAGGC CAGCAAGTCT CAGGAAATGG TGCATTTGGT GAACAAGGAG TCGTCAGAAA 2160 

CTCCAGACCA GTTTATGACA GCTGATGAGA CAAGGAACCT GCAGAATGTG GACATGAAGA 2220 

TTGGGGTGTA ACACCTACAC CATTATCTTG GAAAGAAACA ACCGTTGGAA ACATAACCAT 2280 

TACAGGGAGC TGGGACACTT AACAGATGCA ATGTGCTACT GATTGTTTCA TTGCGAATCT 2340 
TTTTTAGCAT AAAATTTTCT ACTCTTAAAA AAAAAAAAAA AAAAAAA 



PCT/US02/12476 



25 



Seq ID NO; 132 Protein sequence: 
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11 



21 



31 



41 
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40 
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75 
80 
85 



MDKFWWHAAW 
PTMAQMEKAL 
ASAPPEEDCT 
GSSSERSSTS 
DERDRHLSFS 
MTDVDRNGTT 
FGNRWHEGYR 
GHQAGRRMDM 
KDHPTTSTLT 
FGVTAVTVGD 
TPQIPEWbll 
SKSQEMVHLV 



GLCLVPLSLA 
SIGFETCRYG 
SVTDLPNAFD 
GGYIFYTFST 
GSGIDDDEDF 
AYEGNWNPEA 
QTPREDSHST 
DSSHSTTLQP 
S SNRNDVTGG 
SNSNVNRSLS 
LASLLALALI 
NKESSETPDQ 



QIDLNITCRF 
FIEGHWIPR 
GPITITIVNR 
VHPIPDEDSP 
ISSTISTTPR 
HPPLIHHEHH 
TGTAAASAHT 
TANPNTGLVE 
RRDPNHSEGS 
GDQDTFHPSG 
LAVCIAVNSR 
FMTADETRNL 



AGVFHVEKNG 
IHPNSICAAN 
DGTRYVQKGE 
WITDSTDRIP 
AFDHTKQNQD 
EEEETPHSTS 
SHPMQGRTTP 
DLDRTGPLSM 
TTLLEGYTSH 
GSHTTHGSES 
RRCGQKKKLV 
QNVDMKIGV 



RYSISRTEAA 
NTGVYILTSN 
YRTNPEDIYP 
ATSTSSNTIS 
WTQWNPSHSN 
TIQATPSSTT 
SPEDSSWTDF 
TTQQSNSQSF 
YPHTKESRTF 
DGHSHGSQEG 
INSGNGAVED 



DLCKAFNSTLi 
TSQYDTYCFN 
SNPTDDDVSS 
AGWEPNEENE 
PEVLLOTTTR 
EETATQKEQW 
FNPISHPMGR 
STSHEGLEED 
IPVTSAKTGS 
GANTTSGPIR 
RKPSGLNGEA 



Seq ID NO: 133 DNA sequence 
Nucleic Acid Accession #*. NMJD02882 
Coding sequence: 150-755 



CGAGGTTCGG 
GCGGAGGGAA 
AGCCGAGCCG 
ATGATACTTC 
TTTCTCTTCC 
TGCGGGCAAA 
GCACTGGTGA 
GGAGGGACAA 
AGCCCAACGC 
AGTGCCCCAA 
TCAAAACAAA 
CAGG CAAAAA 
AGGAGACCAA 
TCTCTTTCCT 
ATTCTTTCAT 



11 

I 

GTCGTGGGGC 
GGAGCTACGA 
CCGCCGCCGC 
CACTGAGAAT 
TGAG CAAGAA 
ACTGTTCCGA 
CGTCAAGCTC 
GACCCTGAAG 
AGGTAGCGAC 
GCCAGAGCTG 
GTTTGAAGAA 
TGATCATGCC 
GGAGGATGCT 
TTCCTTTTTT 
TTTTACAAGG 



21 

I 

GGAGGGAAGA 
GTAGCCGCCG 
CGCGCCCCCA 
ACAGACGAGT 
ATTAAAACAC 
TTTGCCTCTG 
CTGAAGCACA 
ATCTGTGCCA 
CGTGCCTGGG 
CTGGCCATCC 
TGCAGGAAAG 
GAAAAAGTGG 
GAGGAGAAGC 
TAAAAAATTT 
GACGTTATAT 



31 
I 

GCGGGCGGGC 
AGAGGCCGCG 
TGGCGGCCGC 
CCAACCATGA 
TGGAAGAAGA 
AGAACGATCT 
AGGAGAAAGG 
ACCACTACAT 
TCTGGAACAC 
GCTTCCTGAA 
AGATCGAAGA 
CGGAAAAGCT 
AATAAATCGT 
TACCCTGCCC 
AAAGAACTGA 



41 
I 

GGGAGGCGCC 
GAGCCAGCGA 
CAAGGACACT 
CCCTCAGTTT 
TGAAGAGGAA 
CCCAGAATGG 
GGCCATCCGC 
CACGCCGATG 
CCACGCTGAC 
TGCTGAGAAT 
GAGAGAAAAG 
AGAAGCTCTC 
CTTATTTTAT 
CTCTTTTTCG 
ACTC 



Seq ID NO: 134 Protein sequence: 
Protein Accession #: NP_002873 



11 

I 



21 



31 



41 



51 
I 

GGCGCCAGAC 
CGACCGACCC 
CATGAGGACC 
GAGCCAATAG 
CTTTTTAAAA 
AAGG AG CG AG 
CTCCTCATGC 
ATGGAGCTGA 
TTCGCCGACG 
GCACAGAAAT 
AAAGCAGGAT 
TCGGTGAAGG 
TTTCTTTTCC 
GTTTGTTTTT 



51 



MAAAKDTHED HDTSTENTDE SNHDPQFEPI VSLPEQEIKT LEEDEEELFK MRAKLFRFAS 

ENDLPEWKER GTGDVKIiLKH KEKGAIRLLM RRDKTLKICA NHYITPMMEL KPNAGSDRAW 

WNTHADFAD ECPKPEIiLAI RFLNAENAQK FKTKFEECRK EIEEREKKAG SGKNDHAEKV 

AEKLEALSVK EETKEDAEEK Q 

Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM__000077.2 

Coding sequence: 277-742 



CCCAACCTGG 
TCCTCCGAGC 
GGATTTGAGG 
GGGCTGGCTG 
GGAGAGCAGG 
GCCTTCGGCT 
GCTGCTGGAG 
GGTCATGATG 



11 
I 

GGCGACTTCA 
ACTCGCTCAC 
GACAGGGTCG 
GTCACCAGAG 
CAGCGGGCGG 
GACTGGCTGG 
GCGGGGGCGC 
ATGGGCAGCG 



21 
I 

GGTGTGCCAC 
GGCGTCCCCT 
GAGGGGGCTC 
GGTGGGGCGG 
CGGGGAGCAG 
CCACGGCCGC 
TGCCCAACGC 
CCCGAGTGGC 



31 

! 

ATTCGCTAAG 
TGCCTGGAAA 
TTCCGCCAGC 
ACCGCGTGCG 
CATGGAGCCG 
GGCCCGGGGT 
ACCGAATAGT 
GGAGCTGCTG 



41 
I 

TGCTCGGAGT 
GATACCGCGG 
ACCGGAGGAA 
CTCGGCGGCT 
GCGGCGGGGA 
CGGGTAGAGG 
TACGGTCGGA 
CTGCTCCACG 



51 
I 

TAATAGCACC 
TCCCTCCAGA 
GAAAGAGGAG 
GCGGAGAGGG 
GCAGCATGGA 
AGGTGCGGGC 
GGCCGATCCA 
GCGCGGAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 



239 



WO 02/086443 

CAACTGCGCC GACCCCGCCA CTCTCACCCG 
CCTGGACACG CTGGTGGTGC TGCACCGGGC 
GGGCCGTCTG CCCGTGGACC TGGCTGAGGA 
GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG 
TCCCTCAGAC ATCCCCGATT GAAAGAACCA 
GATCAT CAGT CACCGAAGGT CCTACAGGGC 
TTTCGTAGTT TTCATTTAGA AAATAGAGCT 
ATATGCCTTC CCCCACTACC GTAAATGTCC 
AAATGTAAAA AAGAAAAACA CCGCTTCTGC 
•AGCACTCACG CCCTAAGCGC ACATTCATGT 
GAAGCTGTCG ACTTCATGAC AAGCATTTTG 
CTTCT CTTGA GTCACACTGC TAGCAAATGG 
ATTTTCATTC ATT C ACT C 



ACCCGTGCAC GACGCTGCCC GGGAGGGCTT 
CGGGGCGCGG CTGGACGTGC GCGATGCCTG 
GCTGGGCCAT CGCGATGTCG CACGGTACCT 
TAACCATGCC CGCATAGATG CCGCGGAAGG 
GAGAGGCTCT GAGAAACCTC GGGAAACTTA 
CACAACTGCC CCCGCCACAA CCCACCCCGC 
TTTAAAAATG TCCTGCCTTT TAACGTAGAT 
ATTTATATCA TTTTTTATAT ATTCTTATAA 
CTTTTCACTG TGTTGGAGTT TTCTGGAGTG 
GGGCATTTCT TGCGAGCCTC GCAGCCTCCG 
TGAACTAGGG AAGCTCAGGG GGGTTACTGG 
CAGAACCAAA GCTCAAATAA AAATAAAATA 



Seq ID NO: 136 Protein sequence: 
Protein Accession #: NP_000068.1 

1 11 21 31 41 51 

1 t I I ! I 

MEPAAGSSME PSADWLATAA ARGRVEEVRA LLEAGALPNA PNSYGRRPIQ VMMMGSARVA 
ELLLIiHGAEP NCADPATLTR PVHDAAREGP LDTLWLHRA GARLDVRDAW GRLPVDLAEE 
LGHRDVARYIj RAAAGGTRGS nharidaaeg psdipd 



Seq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: NM_058196.1 

Coding sequence: 104-421 



1 11 

I I 
TGTGTGGGGG tctgcttggc 
GCCCCCACCC TGGCTCTGAC 
CCGAGTGGCG gagctgctgc 
TCTCACCCGA CCCGTGCACG 
GCACCGGGCC GGGGCGCGGC 
GGCTGAGGAG CTGGGCCATC 
CAGAGGCAGT AACCATGCCC 
AAAGAACCAG AGAGGCTCTG 
CTACAGGGCC ACAACTGCCC 
AATAGAGCTT TTAAAAATGT 
TAAATGTCCA TTT AT AT CAT 
CGCTTCTGCC TTTTCACTGT 
CATTCATGTG GGCATTTCTT 
AGCATTTTGT GAACTAGGGA 
AGCAAATGGC AGAACCAAAG 



21 31 

I I 
GGTGAGGGGG CTCTACACAA 
CATTCTGTTC TCTCTGGCAG 
TGCTCCACGG CGCGGAGCCC 
ACGCTGCCCG GGAGGGCTTC 
TGGACGTGCG CGATGCCTGG 
GCGATGTCGC ACGGTACCTG 
G CAT AG AT G C CGCGGAAGGT 
AGAAACCTCG GGAAACTTAG 
CCGCCACAAC CCACCCCGCT 
CCTGCCTTTT AACGTAGATA 
TTTTTATATA TTCTTATAAA 
GTTGGAGTTT TCTGGAGTGA 
GCGAGCCTCG CAGCCTCCGG 
AGCTCAGGGG GGTTACTGGC 
CTCAAATAAA AATAAAATAA 



41 51 

I I 
GCTTCCTTTC CGTCATGCCG 
GTCATGATGA TGGGCAGCGC 
AACTGCG CCG ACCCCGCCAC 
CTGGACACGC TGGTGGTGCT 
GGCCGTCTGC CCGTGGACCT 
CGCGCGGCTG CGGGGGGCAC 
CCCTCAGACA TCCCCGATTG 
ATCATCAGTC ACCGAAGGTC 
TTCGTAGTTT TCATTTAGAA 
TAAGCCTTCC CCCACTACCG 
AATGTAAAAA AGAAAAACAC 
GCACTCACGC CCTAAGCGCA 
AAGCTGTCGA CTTCATGACA 
TTCTCTTGAG TCACACTGCT 
TTTTCATTCA TTCACTC 



Seq ID NO: 138 Protein sequence: 
Protein Accession #: NP_478103.1 

1 11 21 31 41 51 

I I I 1 I I 

MMMGSARVAE LLLLHGAEPN CADPATLTRP VHDAAREGFL DTLWLHRAG ARLDVRDAWG 
RLPVDLAEEL GHRDVARYLR AAAGGTRGSN HARIDAAEGP SDIPD 

Seq ID NO: 139 DNA sequence 

Nucleic Acid Accession #: NM_058197.1 

Coding sequence: 272-684 

1 11 21 31 41 51 

11)11) 
CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATT CGCTAAG TGCTCGGAGT TAATAGCACC 
TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 
GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 
GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 
GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA G CAGCATGGA 
GCCGGCGGCG GGGAGCAGCA TGGAGCCTTC GGCTGACTGG CTGGCCACGG CCGCGGCCCG 
GGGTCGGGTA GAGGAGGTGC GGGCGCTGCT GGAGGCGGGG GCGCTGCCCA ACGCACCGAA 
TAGTTACGGT CGGAGG CCGA TCCAGGTGGG TAGAAGGTCT GCAGCGGGAG CAGGGGATGG 
CGGGCGACTC TGGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG CTTCGATTCT 
CCGGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG GTTTGTAATC ACAGACCTCC 
TCCTGGCGAC GCCCTGGGGG CTTGGGAAAC CAAGGAAGAG GAATGAGGAG CCACGCGCGT 
ACAGATCTCT CGAATGCTGA GAAGATCTGA AGGGGGGAAC ATATTTGTAT TAGATGGAAG 
TCATGATGAT GGGCAGCGCC CGAGTGGCGG AGCTGCTGCT GCTCCACGGC GCGGAGCCCA 
ACTGCGCCGA CCCCGCCACT CTCACCCGAC CCGTGCACGA CGCTGCCCGG GAGGGCTTCC 
TGGACACGCT GGTGGTGCTG CACCGGGCCG GGGCGCGGCT GGACGTGCGC GATGCCTGGG 
GCCGTCTGCC CGTGGACCTG GCTGAGGAGC TGGGCCATCG CGATGTCGCA CGGTACCTGC 
GCGCGGCTGC GGGGGGCACC AGAGGCAGTA ACCATGCCCG CATAGATGCC GCGGAAGGTC 
CCTCAGACAT CCCCGATTGA AAGAACCAGA GAGGCTCTGA GAAACCTCGG GAACTTAGAT 
CAT CAGT CAC CGAAGGT CCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 
CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 
TGCCTTCCCC CACTACCGTA AATGTCCATT TAT AT CAT T T TTTATATATT CTTATAAAAA 
TGTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 
ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 
GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 
CTCTTGAGTC ACACTG CTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 



240 



WO 02/086443 

TTCATTCATT CACTC 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 



Seg ID NO: 140 Protein sequence: 
Protein Accession #: NP_478104.1 

1 11 21 31 41 51 

I I I I I I 

MEPAAGSSME PAAGSSMEPS ADWLATAAAR GRVEEVRALli EAGALPNAPN SYGRRPIQVG 
RRSAAGAGDG GRLWRTKFAG ELESGSASIL RKKGRLPGEF SEGVCNHRPP PGDALGAWET 



Seg ID NO: 141 DNA sequence 

Nucleic Acid Accession #: NM_058195 . 1 

Coding sequence: 163-684 



CCTCCCTACG 
GATCTTGGAG 
GGGCGCGCTC 
GTGGGTCCCA 
GGCGGCGCAG 
TTGGTGACCC 
CACATCCCGC 
GTGCTGATGC 
CATGATGATG 
CTGCGCCGAC 
GGACACGCTG 
CCGTCTGCCC 
CGCGGCTGCG 
CTCAGACATC 
CATCAGTCAC 
CGTAGTTTTC 
TGCCTTCCCC 
TGTAAAAAAG 
ACTCACGCCC 
GCTGTCGACT 
CTCTTGAGTC 
TTCATTCATT 



11 
I 

GGCGCCTCCG 
GTCCGGGTGG 
AGGGAAGGCG 
GTCTGCAGTT 
CGGCTGCCGA 
TCCGGATTCG 
GGCTCACGGG 
TACTGAGGAG 
GGCAGCGCCC 
CCCGCCACTC 
GTGGTGCTGC 
GTGGACCTGG 
GGGGGCACCA 
CCCGATTGAA 
CGAAGGTCCT 
ATTTAGAAAA 
CACTACCGTA 
AAAAACACCG 
TAAGCGCACA 
TCATGACAAG 
ACACTGCTAG 
CACTC 



21 

1 

GCAGCCCTTC 
GAGTGGGGGT 
GGTGCGCGCC 
AAGGGGGCAG 
GCTCGGCCCT 
GCGCGCGTGC 
GGAGTGGGCA 
CCAGCGTCTA 
GAGTGGCGGA 
TCACCCGACC 
ACCGGGCCGG 
CTGAGGAGCT 
GAGGCAGTAA 
AGAACCAGAG 
ACAGGGCCAC 
TAGAGCTTTT 
AATGTCCATT 
CTTCTGCCTT 
TTCATGTGGG 
CATT TTGTGA 
CAAATGGCAG 



31 
I 

CCGCGTGCGC 
GGGGTGGGGG 
TGCGGGGCGG 
GAGTGGCGCT 
GGAGGCGGCG 
GGCCCGCCGC 
GCGCCAGGGG 
GGGCAGCAGC 
GCTGCTGCTG 
CGTGCACGAC 
GGCGCGGCTG 
GGGCCAT CGC 
CCATGCCCGC 
AGGCTCTGAG 
AACTGCCCCC 
AAAAATGTCC 
TATATCATTT 
TTCACTGTGT 
CATTTCTTGC 
ACTAGGGAAG 
AACCAAAGCT 



41 
1 

AGGGCTCAGA 
TGGGGGTGAA 
AGATGGGCAG 
GCTCACCTCT 
AGAACATGGT 
GAGTGAGGGT 
CGCCCGCCGC 
CGCTTCCTAG 
CTCCACGGCG 
GCTGCCCGGG 
GACGTGCGCG 
GATGTCGCAC 
ATAGATGCCG 
AAACCTCGGG 
GCCACAACCC 
TGCCTTTTAA 
TTTATATATT 
TGGAGTTTTC 
GAGCCTCGCA 
CTCAGGGGGG 
CAAATAAAAA 



51 
1 

GCCGTTCCGA 
GGTGGGGGGC 
GGGGCGGTGC 
GGTGCCAAAG 
GCGCAGGTTC 
TTTCGTGGTT 
TGTGGCCCTC 
AAGACCAGGT 
CGGAGCCCAA 
AGGGCTTCCT 
ATGCCTGGGG 
GGTACCTGCG 
CGGAAGGTCC 
AAACTTAGAT 
ACCCCGCTTT 
CGTAGATATA 
CTTATAAAAA 
TGGAGTGAGC 
GCCTCCGGAA 
TTACTGGCTT 
TAAAATAATT 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Seg ID NO: 142 Protein sequence: 
Protein Accession #: NP_478102.1 



45 



50 



55 



60 



65 



70 



75 



80 



85 



1 11 21 31 41 51 

I I l I I I 

MGRGRCVGPS LQLRGQEWRC S PL VPKGGAA AAELGPGGGE NMVRRFLVTL RIRRACGPPR 
VRVFWHIPR LTGEWAAPGA PAAVALVLML LRSQRLGQQP LPRRPGHDDG QRPSGGAAAA 
PRRGAQLRRP RHSHPTRARR CPGGLPGHAG GAAPGRGAAG RARCLGPSAR GPG 

Seq ID NO: 143 DNA sequence 
Nucleic Acid Accession #: NM_018131 
Coding sequence: 412.. 1107 



60 
120 



GAAATTGCAC 
GATAAAGAGA 
AATGCTTATC 
GCCAGATATA 
AGGAGGGAGC 
TCTGCTGCAA 
CAGACTGTGG 
CAGCTGAAAG 
GTCTATGTAA 
GCTCATTCAC 
AAGCAGAAAT 
CAAACCATAA 
CAAAAAGAAG 
CATCTGGAAG 
ATTGCTAGGG 
CAGTCTCTTT 
GAACAACAGA 
GTGCAGCATC 
GTTGGAATCC 
AGGAGAGACT 
CAATGGAAGC 
CGATCTGCTT 
TTAAAAGATT 
CATTTTGCAT 
CTTTTAGGCT 
GGCTGCAATG 
AAAATACTTG 
GCAAAATATT 
ACTGTGTTCG 
ATTGTTAAGA 



11 
I 

ACTTAAAGAC 
GACAGAGACT 
AACTCACAGA 
GTACTACCGC 
AGGTGTTGAA 
CCTCACGAAT 
CTCCAAACTG 
ATGCTCTGGA 
AAGGACTTTT 
TCCCACAGCA 
GTTACAACGA 
CTCAGCTGAG 
TTCACAATTT 
ATGATAGGCA 
GAAAACTTGA 
ACACATCTCT 
TGCAGGCATG 
AATTGCATGT 
TTGAAACAGC 
GAAAACAGAG 
CTGGTGGAAT 
GTCCATGTGG 
CAATACTGTA 
AAAACTGCCT 
GCTGTGCATT 
ACAGAATGTG 
GTGAGGAAAA 
TTATGTTTCG 
CATTTTTTAT 
GGTGGTGATA 



21 

I 

ATCAGTGGAT 
TTTGGAGAAA 
GAAGGACAAA 
ATTGCTTGAA 
AGCCTTATCT 
TGCTGAACTT 
CTTCAACTCA 
GAAAAATCAG 
AGCAAAGATC 
GACAAAAAAG 
TCTCTTGGCA 
TTTTGAACTG 
AAAT CAGCTG 
TAAAACAGAG 
AGAAGAGAAG 
GCTAAAGCAG 
TACTTTAGAC 
AATTCTTAAG 
TT CATGAGTT 
AAAAAGTTGC 
GTCCCAAGTG 
AATACTGTTC 
TTTTCTGTTA 
ATCTACCTTT 
TCTCTTGGCA 
GTGAGCAGCG 
GATAGCTCAG 
GGGGTTTTGA 
CCAAGCACTT 
GATACTATTT 



31 
I 

GAAATCACAA 
ATTCGAGTCC 
GAAATACAGC 
CAGCTGGAAG 
GAAGAGAAAG 
GAAAGCAAAA 
T CAAT AAAT A 
CAGTGGCTCG 
TTTGAGTTGG 
CCTGAATCAG 
AGTGCAAAAA 
AGTGAATTTC 
TTGTATTCAC 
AAGATACAAA 
AAGAGATCCG 
CAAGAAGAAC 
TTTGAAAATG 
GAGCTCCGAA 
TGCCATCACA 
CGCCTCACCA 
CAATATACAG 
AAAGTAGCAA 
GCTTGTGGGC 
GACACTCCAG 
GTGATACCTC 
TCTACTGAGA 
GTTATTGCTA 
AAAATCAAAG 
AGAAAACCTA 
TTTTTTCATA 



41 

I 

GTGGGAAAGG 
TTGAGGCTGA 
GACTGAGAGA 
AGACAACGAG 
ACGTATTGAA 
CCAATACACT 
ATATTCATGA 
TGTATGATCA 
AAAAGAAAAC 
AAGGTTATCT 
AAGATCTTGA 
GAAGAAAATA 
AAAGAAGGGC 
AACTCAGGGA 
AAGAGCTCTT 
AAACAAGGGT 
AAAAACTCGA 
AAGCAAGAAA 
GAGCCATTAG 
AAAAGTCCCA 
TAT CCAG CCA 
AATAAGTATT 
ATTTTGAATT 
CATGCTAGTG 
CCTGACATGG 
TACTAACATT 
ATGGGTTAAT 
ATAATTAACC 
CAATCCTAAT 
TTGTATAGCG 



51 
I 

AAAGCTGACT 
GAAGGAGAAG 
CCAACTGAAG 
AGAAGGAGAA 
ACAACAGTTG 
CCGTTTATCA 
AATGGAAATA 
GCAGCGGGAA 
GGAAACAGCT 
TCAAGAAGAG 
GGTTGAACGA 
TGAAGAAACC 
AGATGTGCAA 
AGAGAATGAT 
ATCTCAGGTC 
AGCTCTGTTG 
CCGT CAACAT 
AAAT AA C ACA 
TCACTTTCCA 
CTGCTGCACT 
CTGAGCATCG 
TGTTTTGATA 
ATATATTTCA 
AATCATGTAT 
TTCATCATCA 
TTGCACTGTC 
GCACCAGCAA 
AAGGATCTTA 
TTTGATGTCC 
GTTATTAGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



241 



10 



WO 02/086443 

AAGTTGGGGA TTTTCTTGAT CTTTATTGCT GCTTACCATT GAAACTTAAC 
TCCCCAACTC TGTTCTGCGC ACGAAACAGT ATCTGTTTGA GGCATAATCT 
CACACAATGT TTTCTCTTAT GTTATCTGGC AGTAACTGTA ACTTGAATTA 
TTCTGCTTAG CTAAAATTGT TAAAATAAAC TTTAATAAAC CCATGTAGCC 
ATTGACAGTA TTTTAGTTAT TTTTGGCATT CTTAAAGCTG GGCAATGTAA 
TTTGTTTGTC TGAACAGGTA TTTTTATACA TGCTTTTTGT AAACCAAAAA 
TCTTCAGGTT TTCTAACATG CTTACCACTG GGCTACTGTA AATGAGAAAA 
ATTTAATGTT TT 



Seq ID NO: 144 Protein sequence: 
Protein Accession #: NP 060601 



CCAGCTGTGT 1860 

TAAGTGGCCA 1920 

CAT TAG CACA 1980 

CTCTCATTTG 2040 

TGATCAGATC 210 0 

CTTTTAAATT 2160 

GAATAAAATT 2220 



PCT/US02/12476 



15 1 11 21 31 41 51 

I 1 I I 1 1 

MEIQLKDALE KNQQVJLVYDQ QREVYVKGLL AKIFELEKKT ETAAHSLPQQ TKKPESEGYE 
QEEKQKCYND LLASAKKDLE VERQTITQLS FELSEFRRKY EETQKEVHNL NQLIiYSQRRA 
DVQHLEDDRH KTEKIQKLRE ENDIARGKLE EEKKRSEELL SQVQSLYTSL LKQQEEQTRV 
20 ALLEQQMQAC TLDFENEKLD RQHVQHQLHV ILKELRKARK NNTVGILETA S 



60 
120 
180 



25 



Seq ID NO: 145 DNA sequence 
Nucleic Acid Accession #: NM_001168 
Coding sequence: 50.. 478 



30 
35 
40 
45 
50 
55 



CCGCCAGATT 
GACGTTGCCC 
CTGGCCCTTC 
CCACTGCCCC 
GGAAGGCTGG 
CGCTTT CCTT 
GGACAGAGAA 
TGAGGAAACT 
CCTCTGGCCG 
GTGCCACCAG 
CAAATTAGAT 
TGCCTGTGCA 
GGGGGCTCAT 
AAGGCAGTGT 
GTGAATGTGT 
GGTGCCTGTT 
ACAGTTTTTT 
GTGATGAGAG 
TATTTTGTTT 
AGCCATT CTA 
AGTGATAGGA 
AGTGAGCCGC 
CTTTTTAAAT 
TCTGTCAGCC 
CCAGGTCCCC 
GATGGATTTG 
GCTGGAAACC 



11 
I 

TGAATCGCGG 
CCTGCCTGGC 
TTGGAGGGCT 
AC TG AG AACG 
GAGCCAGATG 
TCTGTCAAGA 
AGAGCCAAGA 
GCGAAGAAAG 
GAGCTGCCTG 
CCTTCCTGTG 
GTTTCAACTG 
GCGGGTGCTG 
TTTTGCTGTT 
CCCTTTTGCT 
CTGGACCTCA 
G AATCTGAGC ■ 
TGTTGTTGTG 
AATGGAGACA 
GAATTGTTAA 
AGT CATTGGG 
AGCGTCTGGC 
GGGGCACATG 
GACTTGGCTC 
CAACCTTCAC 
GCTTTCTTTG 
ATTCGCCCTC 
TCTGGAGGTC 



21 
I 

GACCCGTTGG 
AGCCCTTTCT 
GCGCCTGCAC 
AG CCAGACTT 
ACGACCCCAT 
AGCAGTTTGA 
ACAAAATTGC 
TGCGCCGTGC 
GTCCCAGAGT 
GGCCCCTTAG 
TGCTCCTGTT 
CTGGTAACAG 
TTGATTCCCG 
AGAGCTGACA 
TGTTGTTGAG 
TGCAGGTTCC 
TTTTTTTGTT 
GAGTCCCTGG 
TTCACAGAAT 
GAAACGGGGT 
AGATACTCCT 
CTGGCCGCTC 
GATGCTGTGG 
ATCTGTCACG 
GAGG CAGCAG 
CTCCCTGTCA 
ATCTCGGCTG 



31 

I 

CAGAGGTGGC 
CAAGGACCAC 
CCCGGAGCGG 
GGCCCAGTGT 
AGAGGAACAT 
AGAATTAACC 
AAAGGAAACC 
CAT CGAGCAG 
GGCTGCACCA 
CAATGTCTTA 
TTGTCTTGAA 
TGGCTGCTTC 
GGCTTACCAG 
GCTTTGTTCG 
GCTGTCACAG 
TTATCTGTCA 

CTCCTCTACT 
AGCACAAACT 
GAACTTCAGG 
TTTGCCACTG 
CTCCCTCAGA 
GGGACTGGCT 
TTCTCCACAC 
CTCCCGCAGG 
TAGAGCTGCA 
TTCCTGAGAA 



41 

I 

GGCGGCGGCA 
CGCATCTCTA 
ATGGCCGAGG 
TTCTTCTGCT 
AAAAAGCATT 
CTTGGTGAAT 
AACAATAAGA 
CTGGCTGCCA 
CTTCCAGGGT 
GGAAAGGAGA 
AGTGGCACCA 
TCTCTCTCTC 
GTGAGAAGTG 
CGTGGGCAGA 
TCCTGAGTGT 
CACCTGTGCC 
GGTAGATGCA 
GTTTAACAAC 
ACAATTAAAA 
TGGATGAGGA 
CTGTGTGATT 
AAAAGGCAGT 
GGGCTGCTGC 
GGGGGAGAGA 
GCTGAAGTCT 
GGGTGGATTG 
ATAAAAAGCC 



51 
i 

TGGGTGCCCC 
CATTCAAGAA 
CTGGCTTCAT 
TCAAGGAGCT 
CGTCCGGTTG 
TTTTGAAACT 
AGAAAGAATT 
TGGATTGAGG 
TTATTCCCTG 
TCAACATTTT 
GAGGTGCTTC 
TCTCTTTTTT 
AGGGAGGAAG 
GCCTTCCACA 
GGACTTGGCA 
TCCTCAGAGG 
TGACTTGTGT 
ATGGCTTTCT 
CTAAGCACAA 
GACAGAATAG 
AGACAGGCCC 
GGCCTAAATC 
AGGCCGTGTG 
CGCAGTCCGC 
GGCGTAAGAT 
TTACAGCTTC 
TGTCATTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



60 



65 



Seq ID NO: 146 Protein sequence: 
Protein Accession #: NP 001159 



1 11 21 31 41 51 

1 I I I 1 1 

MGAPTLPPAW QPFLKDHRIS TFKNWPFLEG CACTPERMAE AGFIHCPTEN EPDLAQCFFC 

FKELEGWEPD DDPIEEHKKH SSGCAFLSVK KQFEELTLGE FLKXDRERAK NK I AKETNNK 
KKEFEETAKK VRRAIEQLAA MD 



60 
120 



70 



75 



80 



85 



Seq ID NO: 147 DNA sequence 

Nucleic Acid Accession ft: NM_014176.1 

Coding sequence: 12 7-72 0 



1 
I 

GCGCGCAGCG 
AGTGCATCCC 
GGGATCATGC 
CCCCCAGGCA 
TTAGGTGGAG 
GAGAGGTACC 
ATTGATTCTG 
AGACCATCCC 
AACCCTGATG 
TTCCTCAAGA 
GAGGAAGAGA 
CAGAAAAGGA 
GGGACTTGTC 
ACCTTGAATT 
GTACATATGT 



11 
I 

CTGGTACCCC 
AGGCAGCTCT 
AGAGAGCTTC 
TCACATGTTG 
CCAACACACC 
CATTTGAACC 
CTGGAAGGAT 
TCAACATCGC 
ACCCGCTCAT 
ATGCCAGACA 
TGCTTGATAA 
AGGCCAGTCA 
CTGGTTCATC 
TTTTTTTAAA 
ATTTTGAAAT 



21 
I 

GTTGGTCCGC 
TAGTGTGGAG 
ACGTCTGAAG 
GCAAGATAAA 
TTATGAGAAA 
TCCTCAGATC 
TTGTCTGGAT 
AACTGTGTTG 
GGCTGACATA 
GTGGACAGAG 
TCTACCAGAG 
GCTAGTAGGC 
TTAGTTAATG 
TATATTTGAT 
CTTTTAAACC 



31 
I 

GCGTTGCTGC 
CAGTGAACTG 
AGAGAGCTGC 
GACCAAATGG 
GGTGTTTTTA 
CGATTTCTCA 
GTTCTCAAAT 
ACCTCTATTC 
TCCTCAGAAT 
AAGCATGCAA 
GCTGGTGACT 
ATAGAAAAGA 
TGTTCTTTGC 
GACATAATTT 
TGAAAAATAA 



41 
I 

GTTGTGAGGG 
TGTGTGGTTC 
ACATGTTAGC 
ATGACCTGCG 
AGCTAGAAGT 
CTCCAATTTA 
TGCCACCAAA 
AGCTGCTCAT 
TTAAATATAA 
GACAGAAACA 
CCAGAGTACA 
AATTTCATCC 
CAAGGTGATC 
TTGTGTAGTT 
ATAGTCATTT 



51 • 

I 

GTGTCAGCTC 
CTTCTACTTG 
CACAGAGCCA 
AGCTCAAATA 
TATCATTCCT 
TCATCCAAAC 
AGGTGCTTGG 
GTCAGAACCC 
TAAGCCAGCC 
AAAGGCTGAT 
CAACTCAACA 
TGATGTTTAG 
TAAGTTGCCT 
TATTTATCTT 
AATGTTGAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



242 



WO 02/086443 

AAAAAAAAAA AAAAAAAAAA AAAAAAAA 

Seq ID NO: 148 Protein sequence: 
Protein Accession #: NP 054895.1 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



1 11 21 31 41 51 

I 1 1 I I 1 

MQRASRLKRE LHMLATEPPP GITCWQDKDQ MDDLRAQILG GANTPYEKGV FKLEVIIPER 
YPFEPPQIRF LTPIYHPNID SAGRICLDVL KLPPKGAWRP SLNIATVLTS IQLLMSEPNP 
DDPLMADISS EFKYNKPAFL KNARQWTEKH ARQKQKADEE EMLDNLPEAG DSRVHNSTQK 
RKASQLVGIE KKFHPDV 

Seq ID NO: 149 DNA sequence 
Nucleic Acid Accession #: NM_003812 
Coding sequence: 224-2722 



60 
120 
180 



TCCTCTGCGT 
GCCCAGCCCC 
CCATGCGCGC 
GCTCTCGCCG 
CAGCTCGCGG 
ACGCGGCCCC 
GCTTCTCGTC 
GGCTGCTGCG 
GGCAGATGAA 
AATGCAGAAA 
AAGCCCTTAT 
CCATCTGGCC 
CATACTGAAC 
ACCACAGTAC 
AGACT CCAAG 
CTTCGTGTAT 
ACATATAATC 
GGAAAGAGGT 
AG CAGTG AAT 
TAATGATCAC 
AAAGTCCGTG 
CCTGGTGGCT 
GCAGATGCTC 
GCACCTCATC 
TGTCTGTTCT 
ACAAGTATTA 
AAAGCCAAAA 
GTCCCATTCT 
AGGAGGTGGA 
AAATGGATAC 
ATTATGCTGT 
TAACAATACC 
GTGTGATATT 
GCAAGACGGA 
CAGAGACAAC 
CTATGAAAAG 
GTGGATTCAG 
TCGAGCTCCA 
AGGCCGGGTG 
CTATGTAGAA 
ACAAATTCAA 
GGGCCATGGG 
AGATTGCAGT 
GGGTCCTAGT 
TATTGTCCTT 
TACTCAGCAA 
ATTCTGGGTA 
CTTTGGGTGG 
CTGTCTCTTT 
GGGGGCAAAA 
ACGAAGGAAC 



11 
I 

CCCGCCCCGG 
GAGCCCCGCG 
CGAGCCGGCG 
GCCACACGGA 
CAGCCGCCCC 
GCCGGCTCGG 
CTTCTCCTGC 
CCCAGCGCTC 
GACAATACAT 
GAAATCACAC 
CACGTTCTTG 
CAGGCAAGCT 
AATGGTTTGT 
TCTAAGGGTG 
GTGGCTCTGT 
ATGATAGAGC 
CAGAAAACCT 
GACCAGTGGC 
CCATCACGTG 
AAAACGTATA 
GTCAACCTTG 
GTAGAGACCT 
CATGAGTTCT 
TCGCGGGTGA 
CG CACAAG AG 
TCGCAG AG CC 
TGTGACTGCA 
CGAAAATTTT 
GCCTGCCTTT 
GTGGAAGCTG 
AAGAAATGTT 
TCATGTCTTT 
ACTGAATATT 
TATGCATGCA 
CAGTGTCAGT 
CTGAATACAG 
TGCAGCAAAC 
CGTATTGGTC 
ATTGACTGCA 
GATGGAACGC 
GCCCTAAATA 
GTGTGTAGTA 
ATCCGGGATC 
GCCACCAATC 
GGGGGCACAG 
GGCCCCATCT 
TGACATACTC 
TAATGACTAC 
TGGAAATAAT 
GACCATGCTA 
AACACACACA 



21 

I 

GAGTGGCTGC 
CCCCGTGCCC 
TGACCGGCTC 
GCGGCGCCCG 
TGGCGGGCTG 
TGCCTGCCAG 
TGCCTCCGCT 
CGCATTGGAA 
TGCAACAGAA 
TGCCTTCAAG 
ACACAAAGGC 
TCCAGATTGA 
TGTCTTCTGA 
GAGAGCACTG 
CAACCTGCAA 
CACTAGAGCT 
TGGCAGGACA 
CCTTTCTCTC 
GTATATTTGA 
AG AAGCAT CG 
TGGATT CTAT 
GGACTGAGAA 
CAAAATACCG 
CATTTCACTA 
GAGTTGGTGT 
TGGCTCAAAA 
CAGAATCCTG 
CAAAGTGCAG 
TCAACAGGCC 
GGGAGGAGTG 
CCCTCTCCAA 
TTCAGCCACG 
GTACTGGAGA 
ATCAAAATCA 
ACATCTGGGG 
AAGGCACTGA 
ATGATGTGTT 
AACTTCAGGG 
GTGGTGCCCA 
CATGTGGCCC 
TGAGCAGCTG 
ATGAAGCCAC 
CAGTTAGGAA 
TCATAATAGG 
GCTGGGGATT 
GAATCAGCTG 
GCAGCAGTGT 
GGAGCTAAAG 
GTCAAAGAAC 
TAAAAAGAAC 
CAAAAATTAA 



31 
i 

GAGGCTAGGC 
CGAGCCCGGA 
CGCCCGCGGC 
GGAGCTATGA 
CAGCCTTGCC 
CGCCCCGGCC 
CGCCGCCTCG 
TGAAACTGCA 
TAGCAGCAGT 
ACTCATATAT 
AAGACACCAG 
AGCCTTCGGC 
TTATGTGGAG 
TTACTACCAT 
TGGACTTCAT 
GGTTCATGAT 
GTATT CTAAG 
TGAATTACAG 
AGAAATGAAA 
CTCTTCTCAT 
TTACAAGGAG 
GGATCAGATT 
GCAGCGCATT 
TAAGAGAAGC 
GAATGAGTAT 
CCTTGGAATC 
GGGTGGCTGC 
CATTTTGGAG 
AACAAAGCTA 
TGATTGTGGT 
CGGGGCTCAC 
AGGGTATGAA 
CTCTGGTCAG 
GGGCCGCTGC 
AACAAAGGCT 
GAAGGGAAAC 
CTGTGGATTC 
TGAGATCATT 
TGTAGTTTTA 
GTCTATGATG 
TCCACTCGAT 
CTGCATTTGT 
CCTTCACCCC 
CTCCATCGCT 
TAAAAATGTC 
CGCTGGATGG 
TACTGGAACT 
TTGGGGTGAC 
ACCTTTCACC 
TGTT CCAGAA 
ATGCAATAAA 



Seq ID NO: 150 Protein sequence: 
Protein Accession #: NP_003 803 



MKPPGSSSRQ 
RPRAWGAAAP 
INQDSESPYH 
HYENGKPQYS 
KSTGRPHIIQ 
LELMIVNDHK 
ITTNPVQMLH 
LPMAVAQVLS 
RDFLQRGGGA 
SDGPCCNNTS 
NGECKTRDNQ 



11 

1 

PPLAGCSLAG 
SAPHWNETAE 
VLDTKARHQQ 
KGGEHCYYHG 
KTLAGQYSKQ 
TYKKHRSSHA 
EFSKYRQRIK 
QSLAQNLGIQ 
CLFNRPTKLF 
CLFQPRGYEC 
CQYIWGTKAA 



21 
1 

ASCGPQRGPA 
KNLGVLADED 
KHNKAVHLAQ 
SIRGVKDSKV 
MKNLTMERGD 
HTNNFAKSW 
QHADAVHLIS 
WEPSSRKPKC 
EPTECGNGYV 
RD AVNE CD I T 
GSDKFCYEKL 



31 
I 

GSVPASAPAR 
NTLQQNSSSN 
ASFQIEAFGS 
ALSTCNGLHG 
QWPFLSELQW 
NLVDSIYKEQ 
RVTFHYKRSS 
DCTESWGGCI 
EAGEECDCGF 
EYCTGDSGQC 
NTEGTEKGNC 



41 
I 

GAGCCGGGAA 
GCCCCCTGCC 
CGCCCCGCAG 
GCCATGAAGC 
GGCGCTTCCT 
CGCACGCCGC 
TCCCGGCCCC 
GAAAAAAATT 
AATATCAGTT 
TACATCAACC 
CAAAAACATA 
TCCAAATTCA 
ATTCACTACG 
GGAAGCATCA 
GGCATGTTTG 
GAGAAAAGCA 
CAAATGAAGA 
TGGTTGAAAA 
TATTTGGAAC 
GCACATACCA 
CAGCTCAACA 
GACATCACCA 
AAGCAGCATG 
AGTCTGAGTT 
GGTCTTCCAA 
CAATGGGAAC 
ATCATGGAGG 
TATAGAGACT 
TTTGAGCCCA 
TTTCATGTGG 
TGCAGCGACG 
TGCCGGGATG 
TGCCCACCAA 
TACAATGGCG 
GCAGGGTCTG 
TGCGGGAAGG 
TTACTCTGTA 
CCAACTTCCT 
GATGATGATA 
TGTTTAGATC 
TCCAAGGGTA 
GATTTCACCT 
CCCAAGGATG 
GGTGCCATCC 
AAGAAGAGAA 
ACACCGCCTT 
ATTAAGTTTG 
AAGGATGGGG 
ACCTGTCAGT 
TCTTTTTTTT 
GGAATCATTA 



41 
I 

TPPCRLLLVL 
ISYSNAMQKE 
KFILDLILNN 
MFEDDTFVYM 
IiKRRKRAVNP 
LNTRWLVAV 
LSYFGGVCSR 
MEETGVSHSR 
HVECYGLCCK 
PPNLHKQDGY 
GKDGDRWIQC 



51 
I 

AGGGGGCGCC 
CGCGGCGGCA 
CTAGCCCGGC 
CGCCCGGCAG 
GCGGCCCCCA 
CCTGCCGCCT 
GCGCCTGGGG 
TGGGAGTCCT 
ACAGCAATGC 
AAGACTCGGA 
ATAAGGCTGT 
TTCTTGACCT 
AAAATGGGAA 
GAGGCGTCAA 
AAGATGATAC 
CAGGTCGACC 
ATCTCACTAT 
GAAGGAAGAG 
TTATGATTGT 
ACAACTTTGC 
CCAGGGTTGT 
CCAACCCTGT 
CTGATGCTGT 
ACTTTGGAGG 
TGGCAGTGGC 
CTTCTAGCAG 
AAACAGGGGT 
TTT TACAGAG 
CGGAATGTGG 
AATGCTATGG 
GGCCCTGCTG 
CTGTGAACGA 
ATCTTCATAA 
AGTGCAAGAC 
ACAAGTTCTG 
ATGGAGACCG 
CCAATCTTAC 
TCTACCATCA 
CGGATGTGGG 
GGAAGTGCCT 
AAGTCTGTTC 
GGGCAGGGAC 
AAGGACCCAA 
TGGTAGCAGC 
GGTTCGATCC 
GCACTGTTGG 
TAAACAAAAC 
TAAAAGAAAA 
AAACGGGGGA 
TCCCTAATGG 
AAAA 



51 
1 

LLLPPLAASS 
ITLPSRLIYY 
GLLSSDYVEI 
IEPLELVHDE 
SRGIFEEMKY 
ETWTEKDQID 
TRGVGVNEYG 
KFSKCSILEY 
KCSLSNGAHC 
ACNQNQGRCY 
SKHDVFCGFL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



243 



WO 02/086443 

LCTNLTRAPR IGQLQGEIIP TSFYHQGRVI DCSGAHWLiD DDTDVGYVED GTPCGPSMMC 
LDRKCLQIQA LNMSSCPLDS KGKVCSGHGV CSNEATCICD FTWAGTDCSI RDPVRNLHPP 
KDEGPKGPSA TNLI IGSIAG AILVAAIVLG GTGWGFKNVK KRRFDPTQQG PI 

Seq ID NO: 151 DNA sequence 
Nucleic Acid Accession ft: NM_023915 
Coding sequence: 250-1326 

1 11 21 31 41 51 

I I I I 1 I 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCTTAGTT 
TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 
GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 
CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 
AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 
CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 
AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 
TTGCTGAATG GTTTAGCAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 
TTCTATCTCA AAAACATAGT GGTTGCAGAC CTCATAATGA CGCTGACATT TCCATTTCGA 
ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 
TCAGTTTTGT TTTATGCAAA CATGTATACT TCCAT CGTGT TCCTTGGGCT GATAAGCATT 
GATCGCTATC TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 
ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGG CTG TTTTGTCTTT GCCAAACATC 
AT CCTGACAA ATGGTCAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 
CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 
GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 
AGGCAATT CA TAAGTCAGTC AAGCCGAAAG CGAAAACATA ACCAGAGCAT CAGGGTTGTT 
GTGGCTGTGT TTTTTACCTG CTTTCTACCA TATCACTTGT GCAGAATTCC TTTTACTTTT 
AGTCACTTAG ACAGGCTTTT AGATGAATCT GCACAAAAAA TCCTATATTA CTGCAAAGAA 
ATTACACTTT TCTTGTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 
TGTAGGTCAT TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 
ATCAGATCAC TGCAAAGTGT GAGAAGATCG GAAGTTCGCA TATATTATGA TTACACTGAT 
GTGTAGGCCT TTTATTGTTT GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTTCTT 
TTCATTATCC TTAAAAAAAA AA 



Seq ID NO: 152 Protein sequence: 
Protein Accession #: NP_0 76404 

1 11 21 31 41 51 

I I I I I I 

MGFNLTLAKL PNNELHGQES HNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIFVASIDLN 
GLAVWIFFHI RNKTSFIFYL KNIWADLIM TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 
FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 
NGQPTEDNIH DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV IIiIGCYIAlS RYIHKSSRQF 
ISQSSRKRKH NQSIRVWAV FFTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 
FLSACNVCLD PIIYFFMCRS FSRRL.FKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 

Seq ID NO: 153 DNA sequence 
Nucleic Acid Accession #: D80008.1 
Coding sequence: 149-739 

1 11 21 31 41 51 

1111)1 

GTTCGGCGCC AAAGCGCGGA GCGGAGGCCG AGGCGAGAGC CTGGCGCTGT AGGACTAGAA 
CGAAAGGAGT GAGGCGCCGA GAGCCCAGAT ACCATTTTGG CGTGAGAGCT GGTGGTTGGC 
AAGGCCGCGG GAGTGGGAAG CGTCCGCCAT GTTCTGCGAA AAAGCCATGG AACTGATCCG 
CGAGCTGCAT CGCGCGCCCG AAGGGCAACT GCCTGCCTTC AACGAGGATG GACTCAGACA 
AGTTCTGGAG GAGATGAAAG CTTTGTATGA ACAAAACCAG TCTGATGTGA ATGAAGCAAA 
GTCAGGTGGA CGAAGTGATT TGATACCAAC TATCAAATTT CGACACTGTT CTCTGTTAAG 
AAATCGACGC TGCACTGTAG CATACCTGTA TGACCGCTTG CTTCGGATCA GAGCACTCAG 
ATGGGAATAT GGTAGCGTCT TGCCAAATGC ATTACGATTT CACATGGCTG CTGAAGAAAT 
GGAGTGGTTT AATAATTATA AAAGATCTCT TGCTACTTAT ATGAGGTCAC TGGGAGGAGA 
TGAAGGTTTG GACATTACAC AGGATATGAA ACCACCAAAA AGCCTATATA TTGAAGTCCG 
GTGTCTAAAA GACTATGGAG AATTTGAAGT TGATGATGGC ACTTCAGTCC TATTAAAAAA 
AAATAGCCAG CACTTTTTAC CTCGATGGAA ATGTGAGCAG CTGATCAGAC AAGGAGTCCT 
GGAGCACATC CTGTCATGAC CATGCGCCGA GGCACTTCCA GGCTTCACTC AACTCATGGA 
CTCCTCTGTA CTCACTCTCT CCACCACTCC CTTCACCTCC CTCTTTGATT TTAGAAGCTA 
TAGACATTGT TTAAGATAAC TAAGAATACT TGGCTAAGAA GTATAATTTG CTAACTATTA 
AGGACTTTCT TTTTTTAATG TTGTACACTA TTCTTCCTAC TCTTTTTTGG TTTTGGTTTT 
GTTTTGTAGA GACTGTCTCA CTATGTTGCC CAAGCTGGTC TCAAACTCCT GGCCTCAAGC 
AGTCCTCCCA CCTTAGCTTC T CAAAGTGTT GAGATCACAG GCGTGAGCCA CTGCACCCGG 
CCCCTACTCC TTTTTCTAAT AAGCTGTATC TGTAATCACA GCATTCCTAC AGTTGTTACA 
GTGTGTTTTT TAAATGAAAG TAAACATGGT TACATTTGAA TCTCTTAAAT AAGCAGTCAC 
TTGGCTGGAC AGGAAGAAGG TAG AT CCTGT GTGTCTTGTT TTCTGGTCAT GTGTATTGTA 
CAAGCTAGAG AGCTGAATTT C T G AG AT ACA CATTTTCAAA TCACATGCAA GTGAAGATGA 
TGGTCTGTAG AAATTTTCAG TATATATAAT GTTTAATGAC ATACTAATTT ATCATCTGGC 
TATTTGGGAA GGAAGGACAC ACATGGATTT TGCACATTTC CACCATGGTG GCTGGTGTGG 
CTTGTGGCTA TGGGGTGATC ACCAGTATCA CCACTTTGGA AGGGGACAGT GAAATTGGGG 
CTAGAGAAGG AACTTTGTAC AGTTTTCCCT GAGATTCAGA TTGACTGAAA AGTCACATGA 
AGAGTTGATT GTCTTTTAAT GGTATGTTTT AAACAGCTGA CATTTTAAAT TTTGATGAAA 
TCCAGTTTAT TCGTTTGTTC TTTTATGCTT TGGGTGTTGC ATCCGAGAAA TCTTTTCCCA 
TCCCAAGATC ACAATTTTTT TTCCTTTTTA CTTCTAGAAG TGTTATAATT TTAAGCTTTA 
TACTTTGGTC TATGACCCGT TTTTTTTTTT GTTTTGTTTT GTTTTTTCGT TTGTTTCTTT 
GTTTTGAGAT GGAGTCTTGT TCTGTCACCC AGGCTGGGGT GCAGTGGCGT GATCTTGGCT 
CACTGCAATC TCTATCCCCT GGGTTCAAGT GATTCTCTTG TCTCAGCCTC CCAAGTAGCT 
GGGATTACAG GCACAGGCCG CCACGCCTGG CTAATTTTTG TATTTTTAGT AGAGACAGAG 



244 



WO 02/086443 

TTTTACCATG TTGGCCAGGC TGGTTTCAAA CTCCTGACCT CAAGTGACCC ACCTTGGCCT 2040 

CCCAAAGTTT TGGGATTACA AGTGTGGGCC ACCGCGGCCA GCCTATGATC CATTTTGAAT 2100 

GAATTTTTTA TATGGTGCAA GGTGTCAATC CACCTTCACT TTTTCTTGGG AATATAGATA 2160 

TCCAGCTGTT TCACTACCAT TTTTTGAAAG GACTGCCCTT TGCTCTATCA CCTTTGCATT 2220 

TTTGTTAAAA AGTAGTTGTC AATGTATATG TGGGTTTATT TCAGGACTCT GTTTTGTTCC 22 80 

ATTGACCTGT TTTTCTCTCC TGAATGCCAA TACCATATTT GTATGTAGTG TATGTAATTT 2340 

TCTAATAATT CTTGAAACAG ATAGTATTAA TGTGT CAT AT TTTTGCTGTT GTTTGTATTT 2400 

TTTGTAGAGA TGGGGTTTCA CCGTGTTGGC CAGGCTGTGT TGAACTCCTG AGCTAAAGCA 2460 

ATACACTTGC CTCGTCCTCC CCATGTGCTG GGATTACAGG CGTGAGCCTT GGTGCTGGCC 2520 

CAGTGTACCA CATTTCTTTT TGAGATTTGT TTTGGCTATG TTAAGTCCTT TGCTTTTGAT 2580 

GTGAAATTTG GGAACAGGCA GGGTGTGGTG GCTTATGCCT GTAATCCTAG AACTTTGGGA 2640 

GGCCTAGATG GGTGGATCAC TTGAG CTCAG GAGTTCCAGA CCAGCCCGGG CCTATGGCAA 2700 

AACTCCGTCT CTACAAAAAA TAGAAAAAAT TAGCCAGGTG TGGTGGTGCA TGCCTGTAGT 2760 

CACAGTTACA CGGCAGGCTG AGGTGGGAGG AT CACTTGAA CCCCAGAGGT CAAGACTGCA 2820 

GTGAGCTGAG ATCACACCAC TGT ACT CC AG CCTGGGTGAC AAAGTGAGAC TCTATCTCAA 2880 

AAAGAAATTA GGATCAATTT GTCAATTTCT ACAACAACAA CAACAAAAAC CCCTGTTGGG 2940 

CACCTTGATT GAGATTGCAT TGAATTTATA TAAAACTGTT GGGAGAATTG ACATCTTAAT 3000 

AATATTGAGT CTTCTGGCCT ATAAACAAGG TCTGTCTTCC TAGGTATTAA TGTTTTGTCT 3060 

TCTATTTCTC TTAATAATCT TTTGTAGTTT TCAGTGTACA GGTCTACCAT GTCAGCATTT 312 0 

CATAGTTTTG ATGCTAAATG GTATTTTAAA ATTTCAAATT CTAACCACTT GTTG CTAGTA 3180 

AATAGAAATA CAATTGATGT TGAACTTGTA TCCTTCAGCC TTGCTAAACT GTGAGTTCTC 3240 

ATGGTGTTTT TGTAAATTAC ATCAACAGTC ATGTGTTCTA TGAATAAAGA GTTTTACTCC 3300 
TTC 

Seq ID NO: 154 Protein sequence: 
Protein Accession #: BAA11503.1 

1 11 21 31 

I I I I 

MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY 
TIKFRHCSLIi RNRRCTVAYL YDRLLRIRAL RWEYGSVLPN 
LATYMRSLGG DEGLDITQDM KPPKSLYIEV RCLKDYGEFE 
KCEQLIRQGV LEHILS 

Seq ID NO: 155 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
Coding sequence: 149-709 

1 11 21 31 41 51 

! 1 i i I 1 

GTTCGGCGCC AAAGCGCGGA GCGGAGGCCG AGGCGAGAGC CTGGCGCTGT AGGACTAGAA 60 

CGAAAGGAGT GAGGCGCCGA GAGCCCAGAT ACCATTTTGG CGTGAGAGCT GGTGGTTGGC 12 0 

AAGGCCGCGG GAGT GGGAAG CGTCCGCCAT GTTCTGCGAA AAAGCCATGG AACTGATCCG 180 

CGAGCTGCAT CGCGCGCCCG AAGGGCAACT GCCTGCCTTC AACGAGGATG GACTCAGACA 240 

AGTT CTGGAG GAGATGAAAG CTTTGTATGA ACAAAACCAG TCTGATGTGA ATGAAGCAAA 300 

GTCAGGTGGA CGAAGTGATT TGATACCAAC TATCAAATTT CGACACTGTT CTCTGTTAAG 360 

AAATCGACGC TGCACTGTAG CATACCTGTA TGACCGCTTG CTTCGGATCA GAGCACT CAG 42 0 

ATGGGAATAT GGTAGCGTCT TGCCAAATGC ATTACGATTT CACATGGCTG CTGAAGAAAT 480 

GGAGTGGTTT AATAATTATA AAAGATCTCT TGCTACTTAT ATGAGGTCAC TGGGAGGAGA 540 

TGAAGGTTTG GACATTACAC AGGATATGAA ACCACCAAAA AGCCTATATA TTGAAGCTGG 600 

ATGCAGTGGC GCGATCTCGG CTCAACCTGC AACCTCCACC TCCCAGGTTC ACCTCAACTG 660 

CAACCTCCAC CTCCCAGGTC CGGTGTCTAA AAGACTATGG AGAATTTGAA GTTGATGATG 72 0 

GCACTTCAGT CCTATTAAAA AAAAATAGCC AGCACTTTTT ACCTCGATGG AAATGTGAGC 780 

AGCTGATCAG ACAAGGAGTC CTGGAGCACA TCCTGTCATG ACCATGCGCC GAGGCACTTC 840 

CAGGCTTCAC TCAACTCATG GACTCCTCTG TACTCACTCT CTCCACCACT CCCTTCACCT 900 

CCCTCTTTGA TTTTAGAAGC TATAGACATT GTTTAAGATA ACTAAGAATA CTTGGCTAAG 960 

AAGTATAATT TGCTAACTAT TAAGGACTTT CTTTTTTTAA TGTTGTACAC TATTCTTCCT 102 0 

ACTCTTTTTT GGTTTTGGTT TTGTTTTGTA GAGACTGTCT CACTATGTTG CCCAAGCTGG 10 80 

TCTCAAACTC CTGGCCTCAA GCAGTCCTCC CACCTTAGCT TCTCAAAGTG T TG AG AT C AC 1140 

AGGCGTGAGC CACTGCACCC GGCCCCTACT CCTTTTTCTA ATAAGCTGTA TCTGTAATCA 12 0 0 

CAGCATTCCT ACAGTTGTTA CAGTGTGTTT TTTAAATGAA AGTAAACATG GTTACATTTG 12 60 

AATCTCTTAA AT AAGCAGT C ACTTGG CTGG ACAGGAAGAA GGTAGATCCT GTGTGTCTTG 132 0 

TTTTCTGGTC ATGTGTATTG TACAAGCTAG AGAGCTGAAT TTCTGAGATA CACATTTTCA 13 80 

AAT CACATGC AAGTGAAGAT GATGGTCTGT AGAAATTTTC AGTATATATA ATGTTTAATG 1440 

ACATACTAAT T T ATCATCTG GCTATTTGGG AAGGAAGGAC ACACATGGAT TTTGCACATT 150 0 

TCCACCATGG TGGCTGGTGT GGCTTGTGGC TATGGGGTGA TCACCAGTAT CACCACT TTG 1560 

GAAGGGGACA GTGAAATTGG GGCTAGAGAA GGAACTTTGT ACAGTTTTCC CTGAGATT CA 162 0 

GATTGACTGA AAAGTCACAT GAAGAGTTGA TTGTCTTTTA ATGGTATGTT TTAAACAGCT 168 0 

GACATTTTAA ATTTTGATGA AATCCAGTTT ATTCGTTTGT TCTTTTATGC TTTGGGTGTT 1740 

GCATCCGAGA AATCTTTTCC CATCCCAAGA TCACAATTTT TTTTCCTTTT TACTTCTAGA 180 0 

AGTGTTATAA TTTTAAGCTT TATACTTTGG TCTATGACCC GTTTTTTTTT TTGTTTTGTT 1860 

TTGTTTTTTC GTTTGTTT CT TTGTTTTGAG ATGGAGTCTT GTTCTGTCAC CCAGGCTGGG 192 0 

GTGCAGTGGC GTGAT CTTGG CTCACTGCAA TCTCTATCCC CTGGGTTCAA GTGATTCTCT 1980 

TGTCTCAGCC TCCCAAGTAG CTGGGATTAC AGGCACAGGC CGCCACGCCT GGCTAATTTT 2 040 

TGTATTTTTA GTAGAGACAG AGTTTTACCA TGTTGGCCAG GCTGGTTTCA AACTCCTGAC 2100 

CTCAAGTGAC CCACCTTGGC CTCCCAAAGT TTTGGGATTA CAAGTGTGGG CCACCGCGGC 2160 

CAGCCTATGA TCCATTTTGA ATGAATTTTT TATATGGTGC AAGGTGTCAA TCCACCTTCA 222 0 

CTTTTTCTTG GGAATATAGA TATCCAGCTG TTTCACTACC ATTTTTTGAA AGGACTGCCC 2280 

TTTGCTCTAT CACCTTTGCA TTTTTGTTAA AAAGTAGTTG TCAATGTATA TGTGGGTTTA 2340 

TTTCAGGACT CTGTTTTGTT CCATTGACCT GTTTTTCTCT CCTGAATGCC AATACCATAT 2400 

TTGTATGTAG TGTATGTAAT TTTCTAATAA TTCTTGAAAC AGATAGTATT AATGTGTCAT 2460 

ATTTTTGCTG TTGTTTGTAT TTTTTGTAGA GATGGGGTTT CACCGTGTTG GCCAGGCTGT 2520 

GTTGAACTCC TGAGCTAAAG CAATACACTT GCCTCGTCCT CCCCATGTGC TGGGATTACA 2580 

GGCGTGAGCC TTGGTGCTGG CCCAGTGTAC CACATTTCTT TTTGAGATTT GTTTTGGCTA 2 640 

TGTTAAGTCC TTTGCTTTTG ATGTGAAATT TGGGAACAGG CAGGGTGTGG TGGCTTATGC 2 70 0 

CTGTAATCCT AGAACTTTGG GAGGCCTAGA TGGGTGGATC ACTTGAGCTC AGGAGTT CCA 2760 

GACCAGCCCG GGCCTATGGC AAAACTCCGT CTCTACAAAA AATAGAAAAA ATTAGCCAGG 2 820 



41 51 
I I 

EQNQSDVNEA KSGGRSDLIP 60 
ALRFHMAAEE MEWFNNYKRS 120 
VDDGTSVLLK KNSQHFLPRW 180 
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WO 02/086443 

TGTGGTGGTG CATGCCTGTA GTCACAGTTA CACGGCAGGC TGAGGTGGGA GGATCACTTG 2880 

AACCCCAGAG GTCAAGACTG CAGTGAGCTG AGATCACACC ACTGTACTCC AG CCTGGGTG 2 940 

ACAAAGTGAG ACTCTATCTC AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC 3 000 

AACAACAAAA ACCCCTGTTG GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG 3060 

TTGGGAGAAT TGACATCTTA ATAATATTGA GTCTTCTGGC CTATAAACAA GGTCTGTCTT 3120 

CCTAGGTATT AATGTTTTGT CTTCTATTTC TCTTAATAAT CTTTTGTAGT TTTCAGTGTA 318 0 

CAGGTCTACC ATGTCAGCAT TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA 3240 

TTCTAACCAC TTGTTGCTAG TAAATAGAAA TACAATTGAT GTTGAACTTG TATCCTTCAG 3300 

CCTTGCTAAA CTGTGAGTTC TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC 3360 
TATGAATAAA GAGTTTTACT CCTTC 

Seq ID NO: 156 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I i I I I I 

MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 60 

TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL RWEYGSVLPN ALRFHMAAEE MEWFNNYKRS 12 0 

LATYMRSLGG DEGLDITQDM KPPKSLYIEA GCSGAISAQP ATSTSQVHLN CNLHLPGPVS 180 
KRLWRI 

Seq ID NO: 157 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-621 

1 11 21 31 41 51 

I I I 1 I I 

TTCGGCGCCA AAGCGCGGAG CGGAGGCCGA GGCGAGAGCC TGGCGCTGTA GGACTAGAAC * 60 

GAAAGGAGTG AGGCGCCGAG AGCCCAGATA CCATTTTGGC GTGAGAGCTG GTGGTTGGCA 120 

AGGCCGCGGG AGTGGGAAGC GTCCGCCATG TTCTGCGAAA AAGCCATGGA ACTGATCCGC 180 

GAGCTGCATC GCGCGCCCGA AGGGCAACTG CCTGCCTTCA ACGAGGATGG ACTCAGACAA 240 

GTTCTGGAGG AGATGAAAGC TTTGTATGAA CAAAACCAGT CTGATGTGAA TGAAGCAAAG 300 

TCAGGTGGAC GAAGTGATTT GATACCAACT ATCAAATTTC GACACTGTTC TCTGTTAAGA 3 60 

AATCGACGCT GCACTGTAGC ATACCTGTAT GACCGCTTGC TTCGGATCAG AGCACTCAGA 420 

TGGGAATATG GTAGCGTCTT GCCAAATGCA TTACGATTTC ACATGGCTGC TGAAGAAGTC 480 

CGGTGT CTAA AAGACTATGG AGAATTTGAA GTTGATGATG GCACTTCAGT CCTATTAAAA 540 

AAAAATAGCC AGCACTTTTT ACCTCGATGG AAATGTGAGC AGCTGAT CAG AC AAGGAGT C 600 

CTGGAGCACA TCCTGTCATG ACCATGCGCC GAGGCACTTC CAGGCTTCAC TCAACTCATG 660 

GACTCCTCTG TACTCACTCT CTCCACCACT CCCTTCACCT CCCTCTTTGA TTTTAGAAGC 72 0 

TATAGACATT GTTTAAGATA ACTAAGAATA CTTGGCTAAG AAGTATAATT TGCTAACTAT 780 

TAAGGACTTT CTTTTTTTAA TGTTGTACAC TATTCTTCCT ACTCTTTTTT GGTTTTGGTT 840 

TTGTTTTGTA GAGACTGTCT CACTATGTTG CCCAAGCTGG TCTCAAACTC CTGGCCTCAA 900 

GCAGTCCTCC CACCTTAGCT TCTCAAAGTG TTGAGATCAC AGGCGTGAGC CACTGCACCC 960 

GGCCCCTACT CCTTTTTCTA ATAAGCTGTA TCTGTAATCA CAGCATTCCT ACAGTTGTTA 102 0 

CAGTGTGTTT TTTAAATGAA AGTAAACATG GTTACATTTG AATCTCTTAA ATAAGCAGTC 10 80 

ACTTGGCTGG ACAGGAAGAA GGTAGATCCT GTGTGTCTTG TTTTCTGGTC ATGTGTATTG 1140 

TACAAGCTAG AGAGCTGAAT TTCTGAGATA CACATTTTCA AAT CACATGC AAGTGAAGAT 12 00 

GATGGTCTGT AGAAATTTTC AGTATATATA ATGTTTAATG ACATACTAAT TTATCATCTG 12 60 

GCTATTTGGG AAGGAAGGAC ACACATGGAT TTTGCACATT TCCACCATGG TGGCTGGTGT 13 2 0 

GGCTTGTGGC TATGGGGTGA TCACCAGTAT CACCACTTTG GAAGGGGACA GTGAAATTGG 13 80 

GGCTAGAGAA GGAACTTTGT ACAGTTTTCC CTGAGATTCA GATTGACTGA AAAGTCACAT 1440 

GAAGAGTTGA TTGTCTTTTA ATGGTATGTT TTAAACAGCT GACATTTTAA ATTTTGATGA 150 0 

AAT CCAGTTT ATTCGTTTGT TCTTTTATGC TTTGGGTGTT GCATCCGAGA AATCTTTTCC 1560 

CATCCCAAGA TCACAATTTT TTTTCCTTTT TACTTCTAGA AGTGTTATAA TTTTAAGCTT 162 0 

TATACTTTGG TCTATGACCC GTTTTTTTTT TTGTTTTGTT TTGTTTTTTC GTTTGTTTCT 16 80 

TTGTTTTGAG ATGGAGTCTT GTTCTGTCAC CCAGGCTGGG GTGCAGTGGC GTGATCTTGG 1740 

CTCACTGCAA TCTCTATCCC CTGGGTTCAA GTGATTCTCT TGTCTCAGCC TCCCAAGTAG 180 0 

CTGGGATTAC AGGCACAGGC CGCCACGCCT GGCTAATTTT TGTATTTTTA GTAGAGACAG 18 60 

AGTTTTACCA TGTTGGCCAG GCTGGTTTCA AACTCCTGAC CTCAAGTGAC CCACCTTGGC 192 0 

CTCCCAAAGT TTTGGGATTA CAAGTGTGGG CCACCGCGGC CAGCCTATGA TCCATTTTGA 1980 

ATGAATTTTT TATATGGTGC AAGGTGTCAA TCCACCTTCA CTTTTTCTTG GGAATATAGA 2040 

TATCCAGCTG TTTCACTACC ATTTTTTGAA AGGACTGCCC TTTGCTCTAT CACCTTTGCA 2100 

TTTTTGTTAA AAAGTAGTTG TCAATGTATA TGTGGGTTTA TTTCAGGACT CTGTTTTGTT 2160 

CCATTGACCT GTTTTTCTCT CCTGAATGCC AATACCATAT TTGTATGTAG TGTATGTAAT 222 0 

TTTCTAATAA TTCTTGAAAC AGATAGTATT AATGTGT CAT ATTTTTGCTG TTGTTTGTAT 2280 

TTTTTGTAGA GATGGGGTTT CACCGTGTTG GCCAGGCTGT GTTGAACTCC TGAGCTAAAG 2340 

CAATACACTT GCCTCGTCCT CCCCATGTGC TGGGATTACA GGCGTGAGCC TTGGTGCTGG 240 0 

CCCAGTGTAC CACATTTCTT TTTGAGATTT GTTTTGGCTA TGTTAAGTCC TTTGCTTTTG 2460 

ATGTGAAATT TGGGAACAGG CAGGGTGTGG TGGCTTATGC CTGTAATCCT AGAACTTTGG 2520 

GAGGCCTAGA TGGGTGGATC ACTTGAGCTC AGGAGTTCCA GACCAGCCCG GGCCTATGGC 2580 

AAAACTCCGT CTCTACAAAA AATAGAAAAA ATTAGCCAGG TGTGGTGGTG CATGCCTGTA 2640 

GTCACAGTTA CACGGCAGGC TGAGGTGGGA GGATCACTTG AACCCCAGAG GTCAAGACTG 270 0 

CAGTGAGCTG AGATCACACC ACTGTACTCC AGCCTGGGTG ACAAAGTGAG ACTCTATCTC 2760 

AAAAAGAAAT TAGGATCAAT TTGTCAATTT CTACAACAAC AACAACAAAA ACCCCTGTTG 282 0 

GGCACCTTGA TTGAGATTGC ATTGAATTTA TATAAAACTG TTGGGAGAAT TGACATCTTA 28 8 0 

ATAATATTGA GTCTTCTGGC CTATAAACAA GGTCTGTCTT CCTAGGTATT AATGTTTTGT 2 940 

CTTCTATTTC TCTTAATAAT CTTTTGTAGT TTTCAGTGTA CAGGTCTACC ATGTCAGCAT 3 000 

TTCATAGTTT TGATGCTAAA TGGTATTTTA AAATTTCAAA TTCTAACCAC TTGTTGCTAG 30 60 

TAAATAGAAA TACAATTGAT GTTGAACTTG TATCCTTCAG CCTTGCTAAA CTGTGAGTTC 312 0 

TCATGGTGTT TTTGTAAATT ACATCAACAG TCATGTGTTC TATGAATAAA GAGTTTTACT 3180 
CCTTC 

Seq ID NO: 158 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I i I 
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MFCEKAMELI RELHRAPEGQ LPAFNEDGLR QVLEEMKALY EQNQSDVNEA KSGGRSDLIP 

TIKFRHCSLL RNRRCTVAYL YDRLLRIRAL RWEYGSVLPN ALRFHMAAEE VRCLKDYGEF 

EVDDGTSVLL KKNSQHFLPR WKCEQLIRQG VLEHILS 

Seq ID NO: 159 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 149-229 



PCT/US02/12476 



GTTCGGCGCC 
CGAAAGGAGT 
AAGGCCGCGG 
CGAGCTGCAT 
GGCACACACC 
GGAAGTTGAA 
TGAATCCCTG 
AAGCTTTGTA 
GTAGCATACC 



11 
I 

AAAG CGCGGA 
GAGGCGCCGA 
GAGTGGGAAG 
CGCGCGCCCG 
TGTAGTCCCA 
ACTGCAGTGA 
TCTCAAAAAG 
TGAACAAAAC 
TGTATGACCG 



21 

i 

GCGGAGGCCG 
GAGCCCAGAT 
CGTCCGCCAT 
AAGGGCAACT 
GCAACTTAGG 
ACTGTGGTCA 
GAAAAGGAGG 
CAGTCTGATG 
CTTGCTTCGG 



31 
I 

AGG CG AG AG C 
ACCATTTTGG 
GTTCTGCGAA 
GCCTGCCTTC 
AGGCTGAAGT 
CGCTATTACA 
ATGGACT CAG 
TGTTCTCTGT 
ATCAGAGCAC 



41 
I 

CTGGCGCTGT 
CGTGAGAGCT 
AAAGCCATGG 
AACAATTAGC 
GAGAGGATTG 
CTCCAGCCTG 
ACAAGTTCTG 
TAAGAAATCG 
TCAGATGG 



51 
I 

AGGACTAGAA 
GGTGGTTGGC 
AACTGATCCG 
TGGGTGTGGT 
CATGGCTCCA 
GGTGACAGAC 
GAGGAGATGA 
ACGCTGCACT 



Seq ID NO: 160 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

l l I l l l 

ATGTTCTGCG AAAAAGCCAT GGAACTGATC CGCGAGCTGC ATCGCGCGCC CGAAGGGCAA 
CTGCCTGCCT TCAACAATTA G 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 



60 



30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 161 DNA sequence 
Nucleic Acid Accession #: U10694 
Coding sequence: 1333-2280 



GGAT CCGGCC 
ACAGGGAGGC 
AGGAACCTAA 
TTTGCCCTGC 
CCTTTATCCT 
CACGTCAGCA 
CCCACTCACC 
CCGCTGGGTG 
AGGTCAACAG 
CTCACCCCAG 
AACCCTGGGC 
ATGTGAGCTC 
CAGGAGAAAG 
AGAACTCAAG 
CAGTCTGCAG 
CTTGGTCTGA 
TGAAGGTGAA 
GGACCCCATA 
AGGCTAGCTG 
ACCAGGAGGA 
GCCTTTGTTA 
TCCCCAGGCC 
ACCAGAGTCA 
GAAGCCCAAG 
GAGACTACCT 
CCTCCCCAGA 
AGCCAATTCG 
CCAGCTCAGC 
CATTTCCTGC 
AGCGTCATCA 
ATGCAGGTGA 
CTTGTCACTG 
AAGGCCGCCC 
GAAGAGGTTA 
TTCTACGGGG 
TACCGGCAGG 
CACGCTGAAA 
CCCATCTGCT 
GCACCAGCCG 
GCCCCATGTG 
CAGTGGCAGT 
GTGATTTGGA 
AGTTTAATGA 
ATGTTATTTA 
TGAATTGGGA 
CAGCAAAATA 
ACCTCTTTCT 
TCTTTGAGCA 
TTTTTTCTTC 



11 
I 

GGAT CTC AGG 
CCTGTGTTCG 
GGGAGGAT CG 
CCCTACTGTC 
GGGATCACTG 
GAGGGAGGGT 
AAACACAGAG 
GATGGACTCC 
AGGGAGGGTT 
GACACATGGA 
AGGTGTGGGC 
CTGATCTGAG 
GTCAGGGCCC 
AGTGTCCAGC 
CCTAAGGGCC 
GACAGTGTCC 
GTGTTCACCC 
GCACCTGGCC 
CACGCTGAGT 
CAGGAGCCCC 
GAACCTCCAA 
TGTGGGTCTC 
TCATGTCTCT 
GAGAGGACTT 
CCTCCTCTGA 
GTCCTCAGGG 
ATGAGGGCTC 
TGGAGTTCAT 
TCCACAAATA 
AAAATTACAA 
TCTTTGGCAC 
CTCTTGGCCT 
T CCTGAT CAT 
TCTGGGAAGC 
AGCCCAGGAA 
TGCCCGGCAG 
CCAGCTATGA 
ACCCATCCCT 
CAGCCGGGGC 
ACATGAGGCC 
GGGTGGAAGT 
GATTTATCCT 
ACTTCACCAT 
GGAGTAAGAT 
CAAGATAACA 
GAGCTCATAA 
CTCTCCTGTA 
TGTAAGAGAA 
AGACACGCAC 



21 
I 

GAGGTGAGGA 
ACAGACACAG 
AGGGTACCTC 
ACCCCAGAGA 
GTGTCGGGGA 
CCCAGGCCCT 
GACCTAGCCC 
CCTCACTTCC 
CCAGACCCTG 
CCCCATTGAA 
AGATGTTGGT 
AGACTCTCAG 
TGAGTGAGCG 
CCGCCCTCTT 
CCTCGATTCC 
TCAGGTCGCA 
TGAATGTGCA 
CCATTCCCCC 
AGCCCTCTCA 
AAGAGGCCCC 
GGTTCGGTTC 
CATCGCCCAG 
CGAGCAGAGG 
GGGCCTGATG 
CAGCAAGGAG 
AGGCGCTTCC 
CAGCAGTCAA 
GTTCCAAGAA 
TCGAGTCAAG 
GCGCTACTTT 
TGATGTGAAG 
CTCGTGCGAT 
TGTCCTGGGT 
GTTGAGTGTG 
GCTGCTCACC 
TGAT CCTGCG 
GAAGGTCATA 
TTATGAAGAG 
CAAAGTTTGT 
CATTCTTCGC 
GAGCACACTG 
TGCTCCCTTT 
CGAAGTTAAT 
TCTTGCTTTT 
TAG CAG AGG A 
AGAAATAGTG 
AAATTAAAAC 
ATAAAAATTG 
TGAACATCTG 



31 
I 

CTTTGTTCTC 
TGGTCCCAGG 
CAGGCCAGAG 
GCCCGGGCAG 
GGGCTGGCCT 
GCCAGGAGTC 
CACCCTGCCC 
TCTTCAGGTG 
CAGGCATCAA 
TTTAGACATC 
TGGGGCATGT 
GCAAGTAGAG 
CAGAGGGGAC 
GACAGCACTG 
TCTTCCAGGA 
GAGCAGAGGA 
CCAAGGGCCC 
TACTGTCACT 
CTTCCTCCCT 
AGAGCAGCAC 
TCAGCTGAAG 
CTCCTGCCCA 
AGTCCGCACT 
GGTGCACAGG 
GAGGAGGTGT 
TCCTCCATTT 
GAAGAGGAAG 
GCACTGAAAT 
GAGCCGGTCA 
CCTGTGATCT 
GAGGTGGACC 
AGCATGCTGG 
GTGATCCTAA 
ATGGGGGTGT 
CAAGATTGGG 
CACTACGAGT 
AATTATTTGG 
GTTTTGGGAG 
GGGGTCAGGG 
TCTGTGTTTG 
TATGTCATCT 
TGGAATTGTT 
GAATGACAGT 
GAGTCACATG 
ATTAATAATT 
AAATGAAAAT 
ATATACATGT 
AAAGAATAAT 
TTATTCGGAA 



41 
I 

AGAGGGTGTG 
ATTGGAGAGC 
AAACTCTCAG 
GGCTGTCTGC 
TGGTCTGAGG 
CAGGTGCAGA 
CTTGTGTCAG 
TCTCCTGGAG 
GATGAGGACC 
TCTTACTGTA 
CCTTCTGTTC 
GAGTAGAGTC 
CATCCACCCC 
AGGGACCGGG 
GCTCCAGGAA 
GACCCAGGCA 
CACCTGCCCC 
CATAGAGCCT 
CAGGTTCTCG 
TGACGAAGAC 
TCTCTCACAC 
CGCTCCTGAC 
GCAAGCCTGA 
AACCCACAGG 
CTGCTGCTGG 
CCGTCTACTA 
AGCCAAGCTC 
TGAAGGTGGC 
CAAAGG CAGA 
TCGGCAAAGC 
CCGCCGGCCA 
GTGATGGTCA 
CCAAAGACAA 
ATGTTGGGAA 
TGCAGGAAAA 
TCCTGTGGGG 
TCATGCTCAA 
AGGAGCAAGA 
CCCCATCCAG 
AAGAGAGCAA 
CTGGGTTCCT 
CAAATGTTCT 
AGT CACACAT 
GGGAAATCCC 
TTTTTGAAAC 
GTAGTTAATT 
ATACCTGGAT 
TTTTCCTGTT 
CACCCTGGGT 



51 
I 

TGTGGACAAA 
AGT CCAGGTG 
AT CAAGAGAG 
TGAGGTCCCT 
GGGCTGCACT 
CTGAGGGGAC 
CTGAGGGAAG 
ATAGGGCCTC 
AGGCAGTATC 
CTT CCGAGGA 
CAT AT CAGGG 
CAGTCCCTGC 
AAAAGTGTGT 
GCTCTGCCTG 
GCAGGCAGGC 
GTGT CAGCAG 
AG CACACATG 
TGATCTCTGC 
GGACAGGCTA 
GTGTAAGTCA 
ACTCCCTCTC 
TGCTGCCCTG 
TGAAGACCTT 
CGAGGAGGAG 
GTCAT CAAGT 
CACTTTATGG 
CTCGGTCGAC 
TGAGTTGGTT 
AATGCTGGAG 
CTCCGAGTTC 
CTCCTACATC 
TAGCATGCCC 
CTGCGCCCCT 
GGAGCACATG 
CTACCTGGAG 
TTCCAAGGCC 
TGCAAGAGAG 
GGGAGTCTGA 
CAGCTGCCCT 
TCAGTGTTCT 
TGTCTATTGG 
TTTAATGGTC 
ATTGCTGTTT 
TGTTATTTTG 
TTGAACTTAG 
CTTGCCTTAT 
TTGCTTGGCT 
CACTGGCTCA 
T 



60 
120 
180 
240 
300 
360 
420 
430 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



Seq ID NO: 162 Protein sequence: 
Protein Accession #: AAA68877.1 
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10 



1 11 21 31 41 51 

I I I I I I 

MSLEQRSPHC KPDEDLEAQG EDLGLMGAQE PTGEEEETTS SSDSKEEEVS AAGSSSPPQS 60 

PQGGASSSIS VYYTLWSQFD EGSSSQEEEE PSSSVDPAQI* EFMFQEAIjKIj KVAELVHFLIi 120 

HKYRVKEPVT KAEMIjE S V I K NYKRYFPVIF GKASEFMQVI FGTDVKEVDP AGHSYILVTA 180 

LGLiS CD SMLG DGHSMPKAAL LIIVLGVILT KDNCAPEEVI WEALSVMGVY VGKEHMFYGE 240 

PRKLLTQDWV QENYLEYRQV PGSDPAHYEF LWGSKAHAET SYEKVINYLV MLNAREP I CY 300 
PSLYEEVLGE EQEGV 

Seq ID NO: 163 DNA sequence 
Nucleic Acid Accession #: AF292100 
Coding sequence: 30-809 

15 1 11 21 31 41 51 

I I I I I I 

GGGGGGGGAG AGGCCTGGAG GACACCAACA TGAACAAGTT GAAATCATCG CAGAAGGATA 60 

AAGTTCGTCA GTTTATGATC TTCACACAAT CTAGTGAAAA AACAGCAGTA AGTTGTCTTT 12 0 

CTCAAAATGA CTGGAAGTTA GATGTTGCAA CAGATAATTT TTTCCAAAAT CCTGAACTTT 180 

20 ATATACGAGA GAGTGTAAAA GGATCATTGG ACAGGAAGAA GTTAGAACAG CTGTACAATA 240 

GATACAAAGA CCCTCAAGAT GAGAATAAAA T TGGAATAG A TGGCATACAG CAGTTCTGTG 300 

ATGACCTGGC ACTCGATCCA GCCAGCATTA GTGTGTTGAT TATTGCGTGG AAGTT CAG AG 3 60 

CAGCAACACA GTGCGAGTTC TCCAAACAGG AGTTCATGGA TGGCATGACA GAATTAGGAT 420 

GTGACAGCAT AGAACAACTA AAGGCCCAGA TACCCAAGAT GGAACAAGAA TTGAAAGAAC 480 

25 CAGGACGATT TAAGGATTTT TACCAGTTTA CTTTTAATTT TGCAAAGAAT CCAGGACAAA 54 0 

AAGGATTAGA TCTAGAAATG GCC AT TGCCT ACTGGAACTT AGTGCTTAAT GGAAGATTTA 60 0 

AATTCTTAGA CTTATGGAAT AAATTTTTGT TGGAACATCA TAAACGATCA ATACCAAAAG 660 

ACACTTGGAA TCTT CTTTT A GACTTCAGTA CGATGATTGC AGATGACATG TCTAATTATG 720 

ATGAAGAAGG AGCATGGCCT GTTCTTATTG ATGACTTTGT GGAATTTGCA CGCCCTCAAA 78 0 

30 TTGCTGGGAC AAAAAGTACA ACAGTGTAGC ACTAAAGGAA CCTTTTAGAA TGTACATAGT 840 

CTGTACAATA AATACAACAG AAAATTGCAC AGTCAATTTC TGCTGGCTGG ACTGAACTGA 90 0 

AGATCAATCC TCACAATTCA GACTGAGGGT TGAGACAAAA CTTTAAGGAT ACATCTTGGA 960 

CCATATCGTA TTTCATTCTT CTAATGGTGG TTTGGGCTTG TCTTCTAGTC TGGGCCGCTC 102 0 

TAAACATTTA TAATT CCAAC ATTGTGGATT T CATCTT ATA TCTGTGGACC AT CCT AGTTT 1080 

35 ATTCTCCCAT AAGTCTTAGA AGCTTTATGG TGATTATTTT GAGGTTTTCA TTCTCGCATA 114 0 

AAGCACAATG CTGTCTTCAT CAGAAAACAG TTGGCATAAG AATTAAACAT ATGAACATCA 1200 

CAAAACAATT TATAAAAACT TCTTAAATAT ACGCTTTGGG CTAGTTGCAA AGACTATGCT 12 60 

AATAGCACTT CCAGTGAGAG TGATATATTT AAGTGTACTG GATCTGGAAT GGTGTTTTGG 132 0 

TTTGGGGGGA ATTTTTTTTT TTTCCTGGCA AATCACATAT GTTGTTGATG TGAGTATCTG 13 80 

40 ATGAAAAAAC AATGTCAGAA TAACCGACAT GAAAATTTTT TAGGATAACT TGGTGCCTAC 144 0 

CTGAAAAATG TATTGTGTTT TAGACTCTTG ATTTCAAAAG GTTCCACAGA ACTAGTCTGC 15 0 0 

GCTTACCTTA CCCATGTTTA TATATAGCTG TCCTACAGGG AGCTTTTATT TAGAAAATGT 1560 

CTGCATAATG TTAGATTCTT CTCCTGTCTA CATTATGCAC TACATAATTG GACTT CATT A 1620 

TGCTTTTGAA ATGCTTATCT GCCTGTCACA TAAGTTAAAC TATTTAATTT GTTTTGAATG 1680 

45 TTTTGGATTG CTACACAATA CAATATTCTA AATTTAGGCA TGAGGGTTTT TTTGTTTTAT 1740 

TTTTACTTTT TTTTTGTCAT TGCACTATGG AACACAAATG AAATTCTCTT AATTTATAAG 180 0 

AAGATAGTAG GAGTTAAATT TTGAAAATGG TTGTGATGAG CCACGAAATT CAATCTTTAT 1860 

AATATAGGTA CTGCTCTTTC AGACAAACAG TCCATTTTTA ATGACTTCTT ATTTTGTTGA 192 0 

AATTACTTTA ACTGCTAATC ACTGTGGTTG CCAAATATTT ACTTCAGAAG CAAAGATTTT 1980 

50 C AAACAAG C A TACACGATGC AAAATACCAG TCTGGCTTCT AGTCTATTTA CTGTTTTGTT 2040 

TCACTCAGAT TAGCTCAGTT TTCTCATCAA AGCAGAATGC TATCTTGCGT GTGTGTGTGT 2100 

GTGTGTGTGT GTGTGTGTGT GTATGTGTGT ATATATATAT ATATATATAT ATATATATTT 2160 

TTTTTTTTTT TTTTTTTTAA ATTACAAAAG CCATGAGCTG CTTTTATGCT GAAAATGGTC 222 0 

ATTTCCCTGT TCACTTACTG ACATGTGAAG AAGGGTTTCT TGCTTTCTTA AACATTTCCG 2280 

55 TAAGGCAGGC TAGAAATGTA ATACTT CAAA TGTTTGATGA TTATGGTCTT TTGATAGGAA 2340 

TAGATTCTGC TTGGGATATA TAT CCAGGCA CTCTCTAAGG TCTAGGGTTG ATATTAACAA 24 0 0 

AGGAATGTAC TTAGAATAGC AGTACATTTT ATGCAAATAT GGAAATTATT TTAAGAAACA 2460 

ATGACATATC AAAACTGCTT TTTACATGAT TTTGAAATAG ACTAGAAAGC TTTCCCTATA 2520 

GACATATTAA T ATT CCAAT C ATAACTTTAA TTCAAGAATG CAGTTTTACC AAAAGAAAAA 258 0 

60 TTTGAAAATT TCTATTCAGG CTACTGGAAT TGGTTATTAA AAGAAAAAGG AAAAAGAAGA 2640 

ATCTTGCTGC TTTCAGTATT TCCTGATTTT TTTGTAAATA TAAAGAGGAA CTTCAATTAT 2700 

GAAAAATTTT TAAAAGATAT ATATATCTAT ATATCTATAT ATATGTACTG TTTTGTTTCC 2 760 

TGTCTTGAAG ATTTTGAGTT ATGGTTATTG GTTTCAGATT GATTAATTCA CATATGCTGT 2 820 

GTTTTCTTTA AAAGT CAT AT GGGTTCGTGG CCTAATGCCT TGGATTTTAC ATATTTTTCT 2 880 

65 TTTTAAATGC AAAACCTTTT CAACAAAATA GTGTTTGTCA TCAGGTTGGT ACTAAACATT 2 940 

TATAATTACT GTGTAATTAT AAACAAAAAT ACATAAAGCT TTGAATATAA TTATGTAGCA 3000 

TAAAAGTTAA GGTTGTTCAC TATGATGGCA TCTTAGAATT AAACAAAACT TTTACTAGGG 3060 

CTGAAAAGAG AAGACTGATT TAATGTGGTG TGATTATTCT GAAGATAAAT GT CTGGCT AC 312 0 

AGGGAATATT TTGTACTAAA AAATGATTAC ACATATGGCT GTGTGTGTTT GAGTCTGTGT 3180 

70 CTGTGAGAGA GCCAGAGAGA GTGAGAGAGA TTGACAGAGA AAGGGAGAGA CACACACACG 3240 

CCCCTTGAAT TGCTTTAACT CCTAAGTGTT TCAGTCCTCA TTCCGGTAAA CTCCCCATGC 3300 

TGATTCTTTG TTTTAAACTG AACCATAGGT ACAGTTTCCT TTTTGCCAAA TGT CAAAACA 3360 

GGTACAAATT TTAAAATGTA ATGCTTTTTA AATAGAAAAA TGTATAAAAT TAGAAGTGCC 342 0 

CACATATAAA AAATACTTGA GATGAAGATT ATCTTTAGTG AATATCATCT GCATATCTCT 3480 

75 GTAAGTTCAA TTGTGTTTCT TACAGTCCCT GTCATATTAC CAACAGAGGC AATAAAAGCT 3540 
GCAGTGAAAT TG 



80 



Seq ID NO: 164 Protein sequence: 
Protein Accession #: AAG00606 



1 11 21 31 41 51 

l l I I I I 

MNKLKSSQKD KVRQFMIFTQ SSEKTAVSCL SQNDWKLDVA TDNFFQNPEL YIRESVKGSL 60 

DRKKLEQLYN RYKDPQDENK IGIDGIQQFC DDLALDPAS I SVLIIAWKFR AATQCEFSKQ 12 0 

85 EFMDGMTELG CDSIEQLKAQ IPKMEQELKE PGRFKDFYQF TFNFAKNPGQ KGLDLEMAIA 180 

YWNLVLNGRF KFLDLWNKFL LEHHKRSIPK DTWNLLLDFS TMIADDMSNY DEEGAWPVLI 240 
DDFVEFARPQ IAGTKSTTV 
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Seq ID NOt 165 DNA sequence 
Nucleic Acid Accession #: AF256215 
Coding sequence: 220-2028 

1 11 21 31 41 51 

I I i I 1 1 

CTCCAGTCCG CATGCTCAGT AGCTGCTGCC GGCCGGGCTG CGGGGCGGCG TCCGCTGCGC 60 
GCCTACGGGC TGCGGTGGCG GCCGCCGCGG CACCCGGCAG GGCCCGCCAG TCCCCGCTTC 120 
CCTGCTCCAG AGCCGCCGCC TGGGCCGGGG CAGGGCGGGC CCGGGGCTCC TCCATGCTGC 180 
CAGCCGCCGG GCTGCGGAGC CGACCAAGTG GCTCCTGCGA TGGCGGCGGA AGAGGAGGCT 240 
GCGGCGGGAG GTAAAGTGTT GAGAGAGGAG AACCAGTGCA TTGCTCCTGT GGTTTCCAGC 3 00 
CGCGTGAGTC CAGGGACAAG ACCAACAGCT ATGGGGTCTT TCAGCTCACA CAT G AC AG AG 360 
TTTCCACGAA AACGCAAAGG AAGTGATTCA GACCCATCCC AAGTGGAAGA TGGTGAACAC 42 0 
CAAGTTAAAA TGAAGGCCTT CAGAGAAGCT CATAGCCAAA CTGAAAAGCG GAGGAGAGAT 4 80 
AAAATGAATA ACCTGATTGA AGAACTGTCT GCAATGATCC CTCAGTGCAA CCCCATGGCG 540 
CGTAAACTGG ACAAACTTAC AGTTTTAAGA ATGGCTGTTC AACACTTGAG ATCTTTAAAA 600 
GGCTTGACAA ATTCTTATGT GGGAAGTAAT TATAGACCAT CATTTCTTCA GGATAATGAG 660 
CTCAGACATT TAATCCTTAA GACTGCAGAA GGCTTCTTAT TTGTGGTTGG ATGTGAAAGA 720 
GGAAAAATTC TCTTCGTTTC TAAGTCAGTC TCCAAAATAC TTAATTATGA TCAGGCTAGT 780 
TTGACTGGAC AAAGCTTATT TGACTTCTTA CATCCAAAAG ATGTTGCCAA AGTAAAGGAA 840 
CAACTTTCTT CTTTTGATAT TTCACCAAGA GAAAAG CTAA TAGATGCCAA AACTGGTTTG 90 0 
CAAGTTCACA GTAAT CTCCA CGCTGGAAGG ACACGTGTGT ATTCTGGCTC AAGACGATCT 960 

TTTTTCTGTC GGATAAAGAG TTGTAAAATC TCTGTCAAAG AAGAGCATGG ATGCTTACCC 102 0 

AACTCAAAGA AGAAAGAGCA CAGAAAATTC TATACTATCC ATTGCACTGG TTACTTGAGA 10 80 

AGCTGGCCTC CAAATATTGT TGGAATGGAA GAAGAAAGGA ACAGTAAGAA AGACAACAGT 1140 

AATTTTACCT GCCTTGTGGC CATTGGAAGA TTACAGCCAT ATATTGTTCC ACAGAACAGT 1200 

GGAGAGATTA ATGTGAAACC AACTGAATTT ATAACCCGGT TTGCAGTGAA TGGAAAATTT 1260 

GTCTATGTAG ATCAAAGGGC AACAG CGATT TTAGGATATC TGCCTCAGGA ACTTTTGGGA 132 0 

ACTTCTTGTT ATGAATATTT T CATC AAGAT GACCACAATA ATTTGACTGA CAAGCACAAA 1380 

GCAGTTCTAC AGAGTAAGGA GAAAATACTT ACAGATTCCT ACAAATTCAG AGCAAAAGAT 1440 

GGCTCTTTTG TAACTTTAAA AAGCCAATGG TTTAGTTTCA CAAATCCTTG GACAAAAGAA 1500 

CTGGAATATA TTGTATCTGT CAACACTTTA GTTTTGGGAC ATAGTGAGCC TGGAGAAGCA 1560 

TCATTTTTAC CTTGTAGCTC TCAATCATCA GAAGAATCCT CTAGACAGTC CTGTATGAGT 162 0 

GTACCTGGAA TGTCTACTGG AACAGTACTT GGTGCTGGTA GTATTGGAAC AGATATTGCA 1680 

AATGAAATTC TGGATTTACA GAGGTTACAG TCTTCTTCAT ACCTTGATGA TTCGAGTCCA 1740 

ACAGGTTTAA TGAAAGATAC TCATACTGTA AACTGCAGGA GTATGTCAAA TAAGGAGTTG 180 0 

TTTCCACCAA GTCCTTCTGA AATGGGGGAG CTAGAGGCTA CCAGGCAAAA CCAGAGTACT 1860 

GTTGCTGTCC ACAGCCATGA GCCACTCCTC AGTGATGGTG CACAGTTGGA TTTCGATGCC 1920 

CTATGTGACA ATGATGACAC AGCCATGGCT GCATTTATGA ATTACTTAGA AGCAGAGGGG 1980 

GGCCTGGGAG ACCCTGGGGA CTTCAGTGAC ATCCAGTGGA CCCTCTAGCC TTTGATTTTT 2040 

AACTCCAAAA ATGAGAAACA TTTTAAAGCA TTATTTACGA AAAAACTGTC TCAACTATTC 210 0 

TTAAGTACTG TATTGATATT GTTTGTATCT TTTATTAATG TTCTACCACT TTTTATAGAT 2160 

TTGCATCTTC CTGTCACAGG GATGTGGGGA AATACGTTTT CCTCCCAAGA GAACCAAGTT 222 0 

TATTATAGAC TCCTTTATTC AGTGAAATGG CTTATAATCC ACTAGTTGCC ATATTTTTGC 22 80 

TAAAATATTT CTAACCAAGA ATACTACTTA CATATTGTTT TGGCTTTGTT TTATTTTTGA 2340 

TGCAGTTTTT TTTAGTTGAG GTAATGTAAT ATATTGATGT TTTCCTTTGT GTCTAAGATT 240 0 

GATTTATAAT AGTAGGTTTG TATAATTTGG AACATTTTCC ATGCCTTGCG AATTTCCTTA 2460 

ATTGAGGATA GGGCTTACAC ACTTTAAGAA AACAGTGAGT ACTTGAACAT TTAAAGGGAC 252 0 

AGTGCAATTT ATAGTCATAA TCACATTGAA TACTGTATTT GATCTTTGGA GACTTAGGCA 2580 

AG CACAGAGC TGGGATATTT ATGCT CAGTT GAGCACTTTA AGATGAATTT TAAGTGAGAT 2 64 0 

GATTTCTTGC TTAAAACTCA GAAAGTCAAA AGAGTTTCAG CTTTCCTTAC AGAAAAGGAA 2700 

GGATCTTGGG CCCTAGATCT TGGGGATTAA CCTCTGCATA TAAGATTTAC TCTTAATAGG 2760 

CCAGACGTGG TGCTCACGCC TGTAATCCCA GTACTTTGGG AGGCTGAGAC GGGCAGATCA 282 0 

CTTGAGGTCA GGAGTTCAAG ACCAGCCTGG CCAATATGGT GAAACCCCGT TTCTACTAAA 2 8 80 

AATACAAAAA AAAT TACCCA GGCACTCACT CTTGAGGTAA CTAACCAACT CCCACGATAA 2 940 

TGACAGTCCA TTCATGAGCG CAAAGGCCTC ATGACCTAAT GGCACACACC TGTAATCCCA 3 000 

ACTGCTTGGG AGGCTGAGGC GAGAGGATTG CTTGAACCTG GGAGGCAGAG GTTGCAGTGA 3060 

GCCGAGATCG CACCACTGCA CTCCAGTCTG GGCAACAGAG TGAGACTTCA TCTCAAAAAA 312 0 

AGTAAAAAAA AAGATTTAAT ATAATCACTG AAGAT CT CT A TTATAGATAG ATTAGGTTTT 3180 

TGACATTGGA AACATACTTA GGGATAGATT TGTCCTAAAG GAAAAAAGTA GGCCCGGGCA 3240 

GATTAAATGT CTTGTGTAAA GTCACACATT AAATTCAGTC ACACATTAAA TTCATAGAGT 330 0 

TTTAAATGTT TAATGTATAT AAACCAGTTT CTTTATACAC ATTTGGGAAA ACATTGGTCT 3360 

CACAGATTAA ATGATTAACT AACTGACCCA GGAACTAGTT GTAGCTTTCT AAGTAATTAG 3420 

GCAATTACAG TTATTGCCTG TAACCAAAGG TAATAAAACA AAATGACAAG TACATGTTTA 3 480 

AAAT TATGAG GCAATGAGAA ATAATTTAAA AACCAATTTT CTAGTTATAA TTTAAAATTT 3540 

GGAGAGCATT TTTAACAGTA ATTAATCCAG AGGTGGCTCA AATTGAGTAT AAGAATTAAG 360 0 

ATTATTTAAA ATACTGCATG TCTACCTTCT CGGGGATCAT ACTTTATAAC ACTTTCTGCT 3660 

TCAGTAGCTC TTCATAGCTT GCCAAGTATG CTCCCATATT TTCTCTCTCG TGCCTCGCAA 3 72 0 

ATGAAAGTCA GATAGGCTGG GAACTCATGG GGCAGCCCTC AGACTTCAAT GTGGGCTTCA 3780 

AATCCAGTTT CCTGTTCTAT ATGGTGCTAC ATCTTTCCAG AAAATTTCCC TCAGAGCCCC 3 840 

TCGCCAAAAC AAAGCATTAT TTTGACCCTG CATGCTATTT CTTTAGCTGT AGGTGATAGA 3 90 0 

TTAGAACTTC TGTCAGACAT GTTAATGACA AACATACCAA CAGACAATAA CCAAAGCAAA 3960 

TGTTTCCTTC AAGTGTGAAA TGTGCAGGGG CTCGTGGGCA AGGATGTATT GGCACACTGT 402 0 

CCTCTTGAAC TGATAGTGTC CCAGCAATGT TGGAGGTTGG CACCATTCCT GGT CCGACAC 4080 

TTGAGGACCT GAGAGACATC AGGTTTAGAA TGAGCCAAAG AAATCCTACA AGATGGGGAG 4140 

AATTGGTGTG CAGCAGCCTA AGTGTTATAG TTAAGTCTAA AGAAGTATGA AAGATCCCCT 4200 

GTGTTCTCTA AATTG AG CAG AGGGGCCTGC CTACCAATAT CACTTTTTAG GGGACTGAAC 42 60 

CATTGCAGGT TAGACTTGGC TTCCAAAGAG TCTGCCTAAG CCAGGGGTGG CAGGGTAGGC 432 0 

CAT CAT AGCT GGATGGCCTC AAAAGCAGAT GGGGGCAGAC TTGCCCTCGT GATGCCAGGA 43 80 

TTTGAGAGGC AGAGT TTCTA GAGGGAGACC AGTGCTGCCT CTCACAGTGG CAGTTTTTTC 4440 

TCTTTGCAAG AGGAGGGGCT GTTCAATTCC ATAGACCAGT GGGCAGATAG CCAGTTGAAT 450 0 

ACTCTGTGCA TGGTTTGATC CTTTATTAGT TCGCTCTAAT ATTTTTCTGT AGATCCTTTT 45 60 

GTCCTGGACT CAAAATCTAA TCCATGCATT GTATGATACC GTAGCTCTCC TAAGGTTTGT 462 0 

GTTTCCTTCA AAATGTTTTA GTTTTCTTCA ACTAAATTTG ATTTTTGCTG TTAGAAGTGA 4680 

CATATTTTTA TGGTATACAC TATGTTCCTT TTTTCTACTG CGAGTCAATT TTTTGAATTT 4740 

TCGTGAGAAA GAATATATCT ACAAATTGCA CGAAAGTATC ATAAAAACAG TACTCTAGAG 4800 
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CAGCGCTGTC CAATAGAAAT ATAAT CTGAG CCACATGTAT AATTTTATTT TCTTCTAGCC 4860 

ACATTAAAGA AGTAAAAAGA TACAAGTAGA ACTAATTTTA ATGTTTTAAT TCAGTATATC 492 0 

CAAAATATCA TTTGAACATG TAATTAATAT AAAATTATTA ATGTGATATT TTACATTCTT 4980 

TTGGTAATAC TAGTCTTCAA AATCTGGTAT GTAT CTTACA TTGATAGCAC ATCTCACTTT 5040 

GTACTAGCCA CATTGCAAGT GCTCAGTAGC CACATGTGGC TAGTGGCTAC TGCACTGGAC 5100 

AGCACAGTTC TAGGTTCCAC GCTAACACGG AAGTCCTGTG GATTAGAATC CCAGAATCAG 5160 

AGCTGGAAGT AAACATAGAG ATCAAACCTC CTTTTAAAAA TGAGGACGCT GAGGGACAGA 522 0 

GTTTAAATGG CTTGCATGAG GTCATACAGC TAAATTCAGC CTCAACAGGG TCTTCTGATT 52 80 

CCAGGCACTC TTCCCACTCC ACTACATTAC TGTAGTGGTA ATTCTTAGGG TTAAAAAAAG 5340 

TGTAGAGTAG GCCGGGCGCA GTGGCT CATG CCTGTAATCC CAGCACTTTG GGAGGCCGAA 5400 

GTGGGCGGAT CACGAGGTCA GGAGATCGAG ACCATCCTGG CCAACATGGT GAAACCCCGT 5460 

CTCTACTGAA AATACAAAGC AAAATTAGCC AGGTGTGGTG GCGGGCGCCT GTGGTCCCAG 5520 

CTGCTCTGGA GGCTGAGGCA GAATGGCGTG AACCCAGGAG GCAGAGATGG CAGTGAGCCA 5580 

AGATCGCGCC ACTGCACCCC AGCCTGGGCG ACAGAGCGAG ACTCCATCTC AAAAAAAAAA 5640 

AAAAAAAAAA AAGAAAAGAA AAGAAAAGTC TAGAGAACAT TATATTAAGT GGTTATTATT 5700 

GAAGTAGACC AAAGTTTATA CCATAAGGAT ATTTTTCCTT AAATACCATG TTTGAAGAAC 5760 

AATTATTTAT TGATCCTTGA AT CTGT AAGA TCAAATAACA AGTCTCTATC CATGTTACCA 582 0 

AATTTAACCT TTTGAAAATA ATAAACTTTA AAATATCAGA TGTGTTATTA CAGGATGATA 58 8 0 

CTTGGAATCA AGTGAAATGA GTTATATGGT CAT CAC T AAA TTTAGAAATC TATTGTGAAA 5 940 

CAAAGACAAA CAGGAAAGTA CAGAATAGAG ACTTTTAGTA AATAAATGGA ATTTAAAAGA 60 0 0 

AAGTGTTTAT TTACAGTGTC ACGACAGAAA AGGATGTCTT TGTTGTCATA GTCTTTGAGG 60 60 

GATCTCCGTA AAATCTGGGG CACAGGTACA AGAAATAGCC AATATTTAGT TCCCAGACCA 612 0 

TGTTTAGTAG TGTCCAGTTT CAGATCATGC TGCCAAGAGG TATCTCCCCC TCAGGTGGGT 6180 

CATCACTGAG CCCTGGAATT GGAGACTCAT ACTTGCCCAG CACAATGTTA CGGGCAGACA 6240 

GGCCGACATC TATGATTAGC TAGAAGCCAT AAAGAAAAGC TGCTAAGTGG CCACTAGGTG 63 0 0 

CCACTTTTCT GTTTTTGTAA TGCTTTCATT AGCAGAT CTT TTTTTTCCAA GCTCCATGGG 63 60 

G CCTATGAG A GGCATTTATG ATTTTTGTGC CTACAATAAG TCAGCCTGTC TGGTGTGAGT 642 0 

TGTTTTATGA GAAATGCTTT CCAAGGGAGG TCTAGGAAGA TCCTGACACA TAAGAACTTT 6480 

GGCTTAGAGA GCTTTCCAGG TGTAGTGCCA ATAAAAACTG ACCTGGAAAG AAAACCTGCC 6540 

CAGCACGGAA CATGCTTTCT GAACTCACTT GAGAGTGTAT GGTGTATGTC ACTTCTCATA 6600 

TATTCTTGAG TTTAGATTTG TCTTTTATAC AATTTTTAGC TCTTTTCCAG TTCACTTGTG 6660 

CTCGTCTGTA TATTGGTATT TTTAAATTTT TGTGGTAAAT AATGAAAAGA GTGAAATTAT 672 0 

ATTTTATAAT TACTCATTTG TAGTTTTTTT .TTTTAATTTA ATAAACTTCC TCCAAAAAGT 67 80 
GCTCCCTTAA AA 

Seq ID NO: 166 Protein sequence: 
Protein Accession #: AAG34652 

1 11 21 31 41 51 

1 1 1 I I I 

MAAEEEAAAG GKVLREENQC IAPWSSRVS PGTRPTAMGS FSSHMTEFPR KRKGSDSDPS 60 

QVEDGEHQVK MKAFREAHSQ TEKRRRDKMN NLIEELSAMI PQCNPMARKL DKLTVLRMAV 12 0 

QHIiRSLKGLT NSYVGSNYRP SFLQDNELRH LILKTAEGFL FWGCERGKI LFVSKSVSKI 180 

LNYDQASLTG QSLFDFLHPK DVAKVKEQLS SFDISPREKL IDAKTGLQVH SNLHAGRTRV 240 

YSGSRRSFFC RIKSCKISVK EEHGCLPNSK KKEHRKFYTI HCTGYLRSWP PNIVGMEEER 3 00 

NSKKDNSNFT CLVAIGRLQP YIVPQNSGEI NVKPTEFITR FAVNGKFVYV DQRATAILGY 3 60 

LPQELLGTSC YEYFHQDDHN NLTDKHKAVL QSKEKILTDS YKFRAKDGSF VTLKSQWFSF 42 0 

TNPWTKELEY IVSVNTLVLG HSEPGEASFL PCSSQSSEES SRQSCMSVPG MSTGTVLGAG 480 

SIGTDIANEI LDLQRLQSSS YLDDSSPTGL MKDTHTVNCR SMSNKELFPP SPSEMGELEA 540 

TRQNQSTVAV HSHEPLLSDG AQLDFDAL CD NDDTAMAAFM NYLEAEGGLG DPGDFSDIQW 600 
TL 

Seq ID NO: 167 DNA sequence 
Nucleic Acid Accession #: NM_014400 
Coding sequence: 86-1126 

1 11 21 31 41 51 

I I i I I I 

GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGG CAC ACCCAGGGGG 60 

GACGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 12 0 

GATCTGGACT GCAGGCTGGC TGCTGCTGCT GCTGCTTCGC GGAGGAGCGC AGGCCCTGGA 180 

GTGCTACAGC TGCGTGCAGA AAGCAGATGA CGGATGCTCC CCGAACAAGA TGAAGACAGT 240 

GAAGTGCGCG CCGGGCGTGG ACGTCTGCAC CGAGGCCGTG GGGGCGGTGG AGACCATCCA 3 00 

CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTCGGGA CTCCCCGGCA AGAATGACCG 3 60 

CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCCAGCTG CAGCAATGCG CTCAGGATCG 420 

CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 480 

ATACCCGCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CTGAGCCGGG AGGCGTGCCA 540 

GGGTACATCG CCGCCGGTCG TGAG CTGCTA CAACGCCAGC GATCATGTCT ACAAGGG CTG 60 0 

CTTCGACGGC AACGT CACCT TGACGGCAGC TAATGTGACT GTGTCCTTGC CTGTCCGGGG 660 

CTGTGTCCAG GATGAATTCT GCACTCGGGA TGGAGTAACA GGCCCAGGGT TCACGCTCAG 72 0 

TGGCTCCTGT TGCCAGGGGT CCCGCTGTAA CTCTGACCTC CGCAACAAGA CCTACTTCTC 780 

CCCTCGAATC CCACCCCTTG TCCGGCTGCC CCCTCCAGAG CCCACGACTG TGGCCTCAAC 840 

CACATCTGTC ACCACTTCTA CCTCGGCCCC AGTGAGACCC ACATCCACCA CCAAACCCAT 900 

GCCAGCGCCA ACCAGTCAGA CTCCGAGACA GGGAGTAGAA CACGAGGCCT CCCGGGATGA 960 

GGAGCCCAGG TTGACTGGAG GCGCCGCTGG CCACCAGGAC CGCAGCAATT CAGGGCAGTA 102 0 

TCCTGCAAAA GGGGGGCCCC AGCAGCCCCA TAATAAAGGC TGTGTGGCTC CCACAGCTGG 1080 

ATTGGCAGCC CTTCTGTTGG CCGTGGCTGC TGGTGTCCTA CTGTGAGCTT CTCCACCTGG 1140 

AAATTTCCCT CTCACCTACT TCTCTGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 

CCCACCACTG GACTGGGCTG GCCCAGCCCC TGTTTTTCCA ACATTCCCCA GTATCCCCAG 12 60 

CTTCTGCTGC GCTGGTTTGC GGCTTTGGGA AATAAAATAC CGTTGTATAT ATTCTGGCAG 132 0 

GGGTGTTCTA GCTTTTTGAG GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 13 80 

TCCTCTTGTG ATGTTAGGAC AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 1440 

AGGATGCTAA GCTTCCTACT CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 1500 

GGTGGGACAA TGGCTCCCCA CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 1560 

ATCGGTTCCC CATATGTCTT CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 162 0 

CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 1680 
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TTGTATAGTG AAAAAAAA 



PCT/US02/12476 



5 



Seq ID NO: 168 Protein sequence: 
Protein Accession #: NP 055215 



1 ll 21 31 41 51 

I I ! I 1.1 

MDPARKAGAQ AMI WT AGWLL LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 60 

CTEAVGAVET IHGQFSLAVX GCGSGLPGKN DRGLDLHGLL AFIQLQQCAQ DRCNAKLNLT 12 0 

10 SRALDPAGNE SAYPPNGVEC YSCVGDSREA CQGTSPPWS CYNASDHVYK GCFDGNVTLT 180 

AANVTVSLPV RGCVQDEFCT RDGVTGPGFT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 240 

LPPPEPTTVA STTSVTTSTS APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 300 
AGHQDRSNSG QYPAKGGPQQ PHNKGCVAPT AGLAALLLAV AAGVLL 

15 Seq ID NO: 169 DNA sequence 

Nucleic Acid Accession #: NM_006875 
Coding sequence: 186-1190 

1 11 21 . 31 41 51 

20 | | | | | | 

GAATT CGGCA CGAGCGCGCG GCGAATCTCA ACGCTGCGCC GTCTGCGGGC GCTTCCGGGC 60 

CACCAGTTTC TCTGCTTTCC ACCCTGGCGC CCCCCAGCCC TGGCTCCCCA GCTGCGCTGC 12 0 

CCCGGGCGTC CACGCCCTGC GGGCTTAGCG GGTTCAGTGG GCTCAATCTG CGCAGCGCCA 180 

CCTCCATGTT GACCAAGCCT CTACAGGGGC CTCCCGCGCC CCCCGGGACC CCCACGCCGC 240 

25 CGCCAGGAGG CAAGGATCGG GAAGCGTTCG AGGCCGAGTA TCGACTCGGC CCCCTCCTGG 3 00 

GTAAGGGGGG CTTTGGCACC GTCTTCGCAG GACACCGCCT CACAGATCGA CTCCAGGTGG 360 

CCATCAAAGT GATTCCCCGG AATCGTGTGC TGGGCTGGTC CCCCTTGTCA GACTCAGTCA 42 0 

CATGCCCACT CGAAGTCGCA CTGCTATGGA AAGTGGGTGC AGGTGGTGGG CACCCTGGCG 480 

TGATCCGCCT GCTTGACTGG TTTGAGACAC AGGAAGGCTT CATGCTGGTC CTCGAGCGGC 540 

30 CTTTGCCCGC CCAGGATCTC TTTGACTATA TCACAGAGAA GGGCCCACTG GGTGAAGGCC 600 

CAAGCCGCTG CTTCTTTGGC CAAGTAGTGG CAGCCATCCA GCACTGCCAT TCCCGTGGAG 660 

TTGTCCATCG TGACATCAAG GATGAGAACA TCCTGATAGA CCTACGCCGT GGCTGTGCCA 72 0 

AACTCATTGA TTTTGGTTCT GGTGCCCTGC TTCATGATGA ACCCTACACT GACTTTGATG 7 80 

GGACAAGGGT GTACAGCCCC C C AG AGTGG A TCTCTCGACA CCAGTACCAT GCACTCCCGG 840 

35 CCACTGTCTG GTCACTGGGC ATCCTCCTCT ATGACATGGT GTGTGGGGAC ATTCCCTTTG 900 

AGAGGGACCA GGAGATTCTG GAAGCTGAGC TCCACTTCCC AGCCCATGTC TCCCCAGACT 960 

GCTGTGCCCT AATCCGCCGG TGCCTGGCCC CCAAACCTTC TTCCCGACCC T CACTGGAAG 102 0 

AGATCCTGCT GGACCCCTGG ATGCAAACAC CAGCCGAGGA TGTTACCCCT CAACCGCTCC 10 80 

AAAGGAGGCC CTGCCCCTTT GGCCTGGTCC TTGCTACCCT AAGCCTGGCC TGGCCTGGCC 1140 

40 TGGCCCCCAA TGGTCAGAAG AGCCATCCCA TGGCCATGTC ACAGGGATAG ATGGACATTT 120 0 

GTTGACTTGG TTTTACAGGT CATTACCAGT CATTAAAGTC CAGTATTACT AAGGTAAGGG 1260 

ATTGAGGATC AGGGGTTAGA AGACATAAAC CAAGTTTGCC CAGTTCCCTT CCCAATCCTA 132 0 

CAAAGGAGCC TTCCTCCCAG AACCTGTGGT CCCTGATTTT GGAGGGGGAA CTTCTTGCTT 1380 

CTCATTTTGC TAAGGAAGTT TATTTTGGTG AAGTTGTTCC CATTTTGAGC CCCGGGACTC 1440 

45 TTATTTTGAT GATGTGTCAC CCCACATTGG CACCTCCTAC TACCACCACA CAAACTTAGT 1500 

TCATATGCTT TTACTTGGGC AAGGGTGCTT TCCTTCCAAT ACCCCAGTAG CTTTTATTTT 1560 

AGTAAAGGGA CCCTTTCCCC TAGCCTAGGG TCCCATATTG GGTCAAGCTG CTTACCTGCC 1620 

TCAGCCCAGG ATTTTTTATT TTGGGGGAGG TAATGCCCTG TTGTTACCCC AAGGCTTCTT 1680 

TTTTTTTTTT TTTTTTTTTG GGTGAGGGGA CCCTACTTTG TTATCCCAAG TGCTCTTATT 1740 

50 CTGGTGAGAA GAACCTTAAT TCCATAATTT GGG AAGGAAT GGAAGATGGA CACCACCGGA 1800 

CACCACCAGA CAATAGGATG GGATGGATGG TTTTTTGGGG GATGGGCTAG GGGAAATAAG 1860 

GCTTGCTGTT TGTTTTCCTG GGGCGCTCCC TCCAATTTTG CAGATTTTTG CAACCTCCTC 1920 

CTGAGCCGGG ATTGTCCAAT TACTAAAATG TAAATAATCA CGTATTGTGG GGAGGGGAGT 1980 

TCCAAGTGTG CCCTCCTTTT TTTTCCTGCC TGGATTATTT AAAAAGCCAT GTGTGGAAAC 2040 

55 CCACTATTTA ATAAAAGTAA TAGAATCAGA AAAAAAAAAA AAAAAAAA 



Seq ID NO: 170 Protein sequence: 
Protein Accession #: NP 006866 



1 


11 


21 


31 


41 

1 


51 

1 




1 

MLTKPLQGPP 


1 

APPGTPTPPP 


1 

GGKDREAFEA 


1 

EYRLGPLLGK 


GGFGTVFAGH 


RLTDRLQVAl 


60 


KV I PRNRVLG 


WSPLSDSVTC 


PLEVALLWKV 


GAGGGHPGVI 


RLLDWFETQE 


GFMLVLERPL 


120 


PAQDLFDYIT 


EKGPLGEGPS 


RCFFGQWAA 


IQHCHSRGW 


HRDIKDENIL 


IDLRRGCAKL 


180 


IDFGSGALLH 


DEPYTDFDGT 


RVYSPPEWIS 


RHQYHALPAT 


VWSLGILLYD 


MVCGD I PFER 


240 


DQEILEAELH 


FPAHVSPDCC 


AlilRRCIiAPK 


PSSRPSLEEI 


LLDPWMQTPA 


EDVTPQPLQR 


300 


RPCPFGLVLA 


TIiSLAWPGLA 


PNGQKSHPMA 


MSQG 









60 



65 



70 Seq ID NO: 171 DNA sequence 

Nucleic Acid Accession NM_003646 
Coding sequence: 89.. 2875 

„ 1 11 21 31 41 51 

75 i | | l I l 

GCGGCGCGGA GCGGGCGTGC TGAGCCCCGG CCGCCGGCCC GGCATGGGCG TCTCCCGCGG 60 

GCCCTCCGCC GGCCGGGGCT AGGGCCGGAT GGAGCCGCGG GACGGTAGCC CCGAGGCCCG 12 0 

GAGCAGCGAC TCCGAGTCGG CTTCCGCCTC GTCCAGCGGC TCCGAGCGCG ACGCCGGTCC 180 

CGAGCCGGAC AAGGCGCCGC GGCGACTCAA CAAGCGGCGC TTCCCGGGGC TGCGGCTCTT 240 

80 CGGGCACAGG AAAGCCATCA CCAAGT CGGG CCTCCAGCAC CTGGCCCCCC CTCCGCCCAC 300 

CCCTGGGGCC CCGTGCAGCG AGTCAGAGCG GCAGATCCGG AGTACAGTGG ACTGGAGCGA 360 

GTCAG CGACA TATGGGGAGC ACATCTGGTT CGAGACCAAC GTGTCCGGGG ACTTCTGCTA 42 0 

CGTTGGGGAG CAGTACTGTG TAGCCAGGAT GCTGAAGTCA GTGTCTCGAA GAAAGTGCGC 480 

AGCCTGCAAG ATTGTGGTGC ACACGCCCTG CATCGAGCAG CTGGAGAAGA TAAATTTCCG 540 

85 CTGTAAGCCG TCCTTCCGTG AATCAGGCTC CAGGAATGTC CGCGAGCCAA CCTTTGTACG 60 0 

GCACCACTGG GTACACAGAC GACGCCAGGA CGGCAAGTGT CGG CACTGTG GGAAGGGATT 660 

CCAGCAGAAG TTCACCTTCC ACAGCAAGGA GATTGTGGCC ATCAGCTGCT CGTGGTGCAA 720 
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GCAGGCATAC CACAGCAAGG TGTCCTGCTT CATGCTGCAG CAGATCGAGG AGCCGTGCTC 780 

GCTGGGGGTC CACGCAGCCG TGGTCATCCC GCCCACCTGG ATCCTCCGCG CCCGGAGGCC 840 

CCAGAATACT CTGAAAGCAA GCAAGAAGAA GAAGAGGGCA TCCTTCAAGA GGAAGTCCAG 900 

CAAGAAAGGG CCTGAGGAGG GCCGCTGGAG ACCCTTCATC ATCAGGCCCA CCCCCTCCCC 960 

5 GCTCATGAAG CCCCTGCTGG TGTT TGTGAA CCCCAAGAGT GGGGGCAACC AGGGTGCAAA 102 0 

GAT CATCCAG TCTTTCCTCT GGTAT CTCAA TCCCCGACAA GTCTTCGACC TGAGCCAGGG 1080 

AGGGCCCAAG GAGGCGCTGG AGATGTACCG CAAAGTGCAC AACCTGCGGA TCCTGGCGTG 1140 

CGGGGGCGAC GGCACGGTGG GCTGGATCCT CTCCACCCTG GACCAGCTAC GCCTGAAGCC 12 00 

GCCACCCCCT GTTGCCATCC TGCCCCTGGG TACTGGCAAC GACTTGGCCC GAACCCTCAA 1260 

10 CTGGGGTGGG GGCTACACAG ATGAGCCTGT GTCCAAGATC CTCTCCCACG TGGAGGAGGG 132 0 

GAACGTGGTA CAGCTGGACC GCTGGGACCT CCACGCTGAG CCCAACCCCG AGGCAGGGCC 1380 

TGAGGACCGA GATGAAGGCG CCACCGACCG GTTGCCCCTG GATGTCTTCA ACAACTACTT 144 0 

CAGCCTGGGC TTTGACGCCC ACGTCACCCT GGAGTTCCAC GAGTCTCGAG AGGCCAACCC 1500 

AGAGAAATTC AACAGCCGCT TTCGGAATAA GATGTTCTAC GCCGGGACAG CTTTCTCTGA 1560 

15 CTTCCTGATG GGCAGCTCCA AGGACCTGGC CAAGCACATC CGAGTGGTGT GTGATGGAAT 1620 

GGACTTGACT CCCAAGATCC AGGACCTGAA ACCCCAGTGT GTTGTTTTCC TGAACATCCC 1680 

CAGGTACTGT GCGGGCACCA TGCCCTGGGG CCACCCTGGG GAGCACCACG ACTTTGAGCC 1740 

CCAGCGGCAT GACGACGGCT ACCTCGAGGT CATTGGCTTC ACCATGACGT CGTTGGCCGC 180 0 

GCTGCAGGTG GGCGGACACG GCGAGCGGCT GACGCAGTGT CGCGAGGTGG TGCTCACCAC 1860 

20 ATCCAAGGCC ATCCCGGTGC AGGTGGATGG CGAGCCCTGC AAGCTTGCAG CCTCACGCAT 1920 

CCGCATCGCC CTGCGCAACC AGGCCACCAT GGTGCAGAAG GCCAAGCGGC GGAGCGCCGC 1980 

CCCCCTGCAC AGCGACCAGC AGCCGGTGCC AGAGCAGTTG CGCATCCAGG TGAGTCGCGT 2040 

CAGCATGCAC GACTATGAGG CCCTGCACTA CGACAAGGAG CAGCTCAAGG AGGCCTCTGT 210 0 

GCCGCTGGGC ACTGTGGTGG TCCCAGGAGA CAGTGACCTA GAGCTCTGCC GTGCCCACAT 2160 

25 TGAGAGACTC CAGCAGGAGC CCGATGGTGC TGGAGCCAAG TCCCCGACAT GCCAGAAACT 222 0 

GTCCCCCAAG TGGTGCTTCC TGGACGCCAC CACTGCCAGC CGCTTCTACA GGATCGACCG 22 8 0 

AGCCCAGGAG CACCTCAACT ATGTGACTGA GAT CGCACAG GATGAGATTT ATATCCTGGA 2340 

CCCTGAGCTG CTGGGGGCAT CGGCCCGGCC TGACCTCCCA ACCCCCACTT CCCCTCTCCC 2400 

CACCTCACCC TGCTCACCCA CGCCCCGGTC ACTGCAAGGG GATGCTGCAC CCCCTCAAGG 2460 

30 TGAAGAGCTG ATTGAGGCTG CCAAGAGGAA CGACTTCTGT AAGCTCCAGG AGCTGCACCG 2 52 0 

AGCTGGGGGC GACCTCATGC ACCGAGACGA GCAGAGTCGC ACGCTCCTGC ACCACGCAGT 25 80 

CAGCACTGGC AGCAAGGATG TGGTCCGCTA CCTGCTGGAC CACGCCCCCC CAGAGATCCT 2 640 

TGATGCGGTG GAGGAAAACG GGGAGACCTG TTTGCACCAA GCAGCGGCCC TGGGCCAGCG 2 70 0 

CACCATCTGC CACTACATCG TGGAGGCCGG GGCCTCGCTC ATGAAGACAG ACCAGCAGGG 2760 

35 CGACACTCCC CGGCAGCGGG CTGAGAAGGC TCAGGACACC GAGCTGGCCG CCTACCTGGA 2820 

GAACCGGCAG CACTACCAGA TGATCCAGCG GGAGGACCAG GAGACGGCTG TGTAGCGGGC 2880 
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Seq ID NO: 172 Protein sequence: 
Protein Accession #: NP 003637 



MEPRDGSPEA 
GLQHLAPPPP 
MIjKSVSRRKC 
DGKCRHCGKG 
PPTWILRARR 
NPKSGGNQGA 
LSTLDQLRLK 
LHAEPNPEAG 
KMFYAGTAFS 
GHPGEHHDFE 
GEPCKLAASR 
YDKEQLKEAS 
TTASRFYRID 
SLQGDAAPPQ 
YLLDHAPPE I 
AQDTELAAYL 



11 
1 

RSSDSESASA 
TPGAPCSESE 
AACKIWHTP 
FQQKFTFHSK 
PQNTLKASKK 
KIIQSFLWYL 
PPPPVAILPL 
PEDRDEGATD 
DFLMGSSKDL, 
PQRHDDGYLE 
IRIALRNQAT 
VPLGTWVPG 
RAQEHLNYVT 
GEEDI EAAKR 
LDAVEENGET 
ENRQHYQMIQ 



21 

i 

SSSGSERDAG 
RQIRSTVDWS 
CIEQLEKINF 
EIVAISCSWC 
KKRASFKRKS 
NPRQVFDLSQ 
GTGNDLARTL 
RLPLDVFNNY 
AKHIRWCDG 
VIGFTMTSLA 
MVQKAKRRSA 
DSDLELCRAH 
EIAQDEIYIL 
NDFCKLQELH 
CLHQAAALGQ 
REDQETAV 



31 

1 

PEPDKAPRRL 
ESATYGEHIW 
RCKPSFRESG 
KQAYHSKVSC 
SKKGPEEGRW 
GGPKEALEMY 
NWGGGYTDEP 
FSLGFDAHVT 
MDLTPKIQDL 
ALQVGGHGER 
APLHSDQQPV 
IERLQQEPDG 
DPELLGASAR 
RAGGDLMHRD 
RTICHYIVEA 



41 

I 

NKRRFPGLRL 
FETNVSGDFC 
SRNVREPTFV 
FMLQQIEEPC 
RPFIIRPTPS 
RKVHNLRILA 
VSKILSHVEE 
LEFHESREAN 
KPQCWFLNI 
LTQCREWLT 
PEQLRIQVSR 
AGAKSPTCQK 
PDLPTPTSPL 
EQSRTLLHHA 
GASLMKTDQQ 



51 
I 

FGHRKAI TKS 
YVGEQYCVAR 
RHHWVHRRRQ 
SLGVHAAWI 
PliMKPLLVFV 
CGGDGTVGWI 
GNWQLDRWD 
PEKFNSRFRN 
PRYCAGTMPW 
TSKAIPVQVD 
VSMHDYEALH 
LSPKWCFIiDA 
PTSPCSPTPR 
VSTGSKDWR 
GDTPRQRAEK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 173 DNA sequence 
Nucleic Acid Accession ft: AF232772 
Coding sequence: 1-1662 



65 



70 



75 



80 



85 



ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGCGCAAGT 
GTGGTGGATG 
GGCGGCACCG 
GGTGAGACGG 
AGCACCTTCT 
TTCAAGGCCC 
GATCCAGCCT 
GTCGGGGGAG 
GTGCGGTACT 
CAGTGTATTA 
GACTGGTACC 
ACCAACCGAG 
ACAGAGACCC 
TACTTCCGGG 
TACGAGTCAG 
TTCTACCGGG 
ATTATCAAGG 



11 

I 

AGCTGACGAC 
GCATCCTGGC 
CCTTCGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTGCATCAT 
TCGGCGATTC 
GCACCATCGA 
ATGTCCAGAT 
GGATGGCCTT 
GTGGGCCCTT 
ATCAGAAGTT 
TCCTGAGCCT 
CCACTAAGTA 
AGTGGCTCTA 
TGGTCACGGG 
GCCGCATCTG 
CCACCTACGC 



21 
I 

AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGGCGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 
GGTGGACTAC 
GATGCTTCGA 
CCTCAACAAG 
CAACGTGGAG 
GGGCATGTAC 
CCTAGGCAGC 
TGGCTACCGA 
CCTCCGGTGG 
CAACTCTCTG 
TTTCTTCCCC 
GAACATTCTC 
CTGCTTCCTT 



31 
I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG 
ATTGCCGCAT 
ATCTCCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACCGTG 
GGAGGCAAGC 
AT CCAGGTGT 
GTCCTGGAGG 
TACGACTCAT 
CGGGCCTGCC 
CGCAACAGCC 
AAGTGCAGCT 
ACTAAGTATA 
CTCAACCAGC 
TGGTTCCATA 
TTCTTCCTCA 
CTCTTCCTGC 
CGGGGCAATG 



41 
I 

CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGTCAT 
GCGACTCTGA 
AGGATCCCCA 
GGATTTCCTT 
AGTCCTACTT 
TCCTCCAGCA 
TCGGGGATGA 
CCGCGCGCTC 
AAACCCGCTG 
AGCACCACCT 
TTGCCACGGT 
TGACGGTGCA 
CAGAGATGAT 



51 

TGCCCTGGCA 
CACGGAAAAG 
CATT CAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 
CACTGTGCTG 
AGTAGGGGGA 
CCTGAGCAGC 
TGGCTGTGTG 
GTTCCTGGAG 
CCGGCACCTC 
CAAGTGCCTC 
GAGCAAGTCT 
CTGGATGACC 
TATACAGCTT 
GCTGGTGGGC 
CTTCATGTCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



252 



WO 02/086443 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGAGG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG .TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 162 0 

AAGAAGCCGG AG C AGT AC AG CTTGGCTTTT GCTGAGGTGT GACATGGCCC CCAAGCAGAG 1680 

CGGGTAAAGT GCAATGGGTA AGGGAGGGAA GGGGAATGGA AGAGAAAAGA CAGGGTGGGA 174 0 

GGGAGGAGGG AGTGCTGTGT TTTAGTCTCT T AATGGT CCA AAGGACAAAT CTAAAATGCA 1800 

AAGAACGGTG ATGTAGTATG GCCTGACAGC TCTGTTTAGA GGAGGCAACA CTGATCCCCC 1860 

AGATGCAGGG CTGCAGGGGA TTCTGTGTTT TCAGACTGCC TGTCTGCTTG CAT CTGCACA 192 0 

TAGGCAGTAG CCTCCTCCTG GGCTCCAGAG GGCACTCAGA AGTTGTGCTA AACCAAGTTA 1980 

AGTCCCATTC AGTGGCAACT TGTGATAGGT ACCTGAGTGA CGGCAACCTG CGGAAGGAGG 2 040 

TTCTCCCAGC CCATCTGAAC ACAACCAGAG GTGG CAGGAG AATTTCTACT GAGCGAGGTG 2100 

GGCCGGTTAG TGTATGTCAC CCCCACCCCA CCCATAAGTA GTCAT CAATG CAATAAGATT 2160 

GCGCGTGAGA TACAAGGCCC AGAAGCCTGA TCTTTGGGCA TCAGAAAACA GGGT CCAGGA 222 0 

ATGGTGCTTT ATGTGAGATA CCCCACTCCA CATCAACATT CCAGGGATGA GCCAAACCAG 2280 

CAGGGAGTTA GCACTGAACT GCTTTTAAAA GTGCACATTA AAAAGGAAAG TTTGCCAGGA 2340 

GGAACAAAGA GATTGTGGTG GTGCTAAAGG AGGCCATAAG CTACACAGAG GCCTTGGGTG 2400 

TTCCACCTGG AAACTGCTCA GACGTCTAGA TGGGTTCTTA GCTTGTCTGT GAT CTCTGCT 2460 

GGGGAGATAA AAAGATTAAG CCCCAACATG TTCAGAAAAG AAGTGAAGTC TTGGGTATTT 252 0 

TAACCTGTAT ACTCTTGAAT TCCTCTCAAA TTCAGCTCTG ATCTGAGGCT AAGACACACT 2 580 

CCCCACTTCA CTTTCTTCAA AGCCACATTT TTTGAGGTAT CACTGCAGTC ACCTCTTCTA 2 640 

CCCTCATCAT CATAGGTAAG GTTTTCAAGG TGGCAATTGG GGCGGAGCCC CGGCTTCTTA 2 700 

TAGAAGCTTC AGCAGGAGGC AAGCGTGTTC TCAGCACATA TGGGAACTAT GAGGAGCCTC 2760 

TGATCAAATT GGCTACAATC TTGGAGCTGC TTGGACGGAT TCCTTGGCAG CCGGGTTAGC 2 82 0 

ATGTGTGACT TTCAGGCTAC TGTTCTTGAC AATCATCTCC AATGGAAAGC TTTTCAGTGT 2 8 80 

TCCCAAAGTG AACTCTCAAA TCCAAAATGG TTATCTTTGA GACCATCCAT TCTCCTCAGT 2 940 

GGCTTCTCCA GGGAATTCTT ACAGCCAAGT TGTGACAGTC ACTGCATTTG CCTGCTTCTT 3 000 

TCCAGAAACC AAACTAGGAG ATGAAACTGG TTCCTACATC CTAAGGTTCT TGCTTTCTCT 3 0 60 

CTCATGCCTC CTGAGGCTGT TTTTGGCTGT TTTCCCTCTG CTGCTTTTGG GGAATGAGGG 312 0 

GAAGCCATTT TCCAAGTGAC TTGCAATCCA GGCTGTTCTC AGCGTTTTGA GTTTAAAACC 3180 

TGGGATCCTG ACTAAGCCTT TGACTTAAGG GTTGCTTGCT TGCCCTCCAA ATGTCCTTTC 3240 

TCAAAGGGGC CAACTAACCC GTGCAGAACC AGCACTAAGG TGGACAGCAG ACAAGAGGGC 33 0 0 

AAGCCTCTAA TGTACCAAGT GCTTCCTACA AAGACGCAAG GTGTGCTCCG AACCACAGAT 3 3 60 

GGGCAAACCC TGGTGCTTTC CTTCATCTCC CACGAACTCA AGGGTTTTCC AAGTGTAGCT 3 420 

AACAGTTGCC ACATCACACA GACCTCCAGT TTCTGGTAAG ACTGCTGGTT GACATCAGAC 3 480 

CCAACCCATT GAAGGCTGGA AGGCAGCAGG CATTTGCTAA GGCAGCTGAT CCAGGCAATC 3540 

GTTCTGCTGG CCAAGAAGTT AAACTATTTT GAGCATTAGA ATGGAGGAAA TCCGGTCAGC 3 600 

CAAGTGCAGA GTTCAGACTT CGCTAAGGGC TTGTTTTTCT TCAGCATTTA CTTGAAGATT 3660 

AATGTAGGAT GACAGGCTCT CCTGGCTGTC CTACCATCAG CTCTGCCTTG CACTGTGGTC 3720 

GTCAACTTTC CTCAAATCAA AAACAGGCAG GTACAGGTAG TGGGCTCACA ACGTTTGACC 3 78Q 

TCGACTGGTT TTTCTAAGTT ATTTTGTACA TTTTTCAGCA GCAAAACCAA ACTGGGTCTT 3 840 

CAGCTTTATC CCCGTTTCTT GCAAGGGAAG AGCCTTTATA CAATTGGACG CATTTTGGTT 3 900 

TTTCCTCATT GAGAATTCAA ATCCTCTTTT GTATTGTTTC TACAATAATT TGTAAACATA 3 960 

TTTATTTTTA CCTGCTTTTT TTTTTTTTTT TAATTTTCAG GTCAAGTTTT TTATACTGCA 4020 
CTTATTTGTC AAAATAAAGA TTCTCACAT 

Seq ID NO: 174 Protein sequence: 
Protein Accession #: AAF36984 

1 11 21 31 41 51 

I I I I 1 I 

MPVQLTTALR WGTSLFALA VLGG I LAAYV TGYQFIHTEK HYLSFGLYGA ILGLHLLIQS 60 

LFAFLEHRRM RRAGQALKLP SPRRGSVALC IAAYQEDPDY LRKCLRSAQR I SFPDLKWM 120 

WDGNRQEDA YMLDIFHEVL GGTEQAGFFV WRSNFHEAGE GETEASLQEG MDRVRDWRA 180 

STFSCIMQKW GGKREVMYTA FKALGDSVDY IQVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 

VGGDVQIliNK YDSWISFLSS VRYWMAFNVE RACQSYFGCV QCISGPLGMY RNSLLQQFLE 300 

DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LNQQTRWSKS 360 

YFREWLYNSL WFHKHHLWMT YESWTGFFP FFLIATVIQL FYRGRIWNIL LFLLTVQLVG 420 

I IKATYACFD RGNAEMIFMS LYSLLYMSSL IiPAKIFAIAT INKSGWGTSG RKTIWNFIG 480 

LIPVSIWVAV LLEGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML YLAI I ARRCG 540 
KKPEQYSLAF AEV 

Seq ID NO: 175 DNA sequence 
Nucleic Acid Accession #: NMJ300691 
Coding sequence: 43.. 1404 

1 11 21 31 41 51 

) I I I I I 

CCAGGAGCCC CAGTTACCGG GAGAGGCTGT GTCAAAGGCG CCATGAGCAA GAT CAGCGAG 60 

GCCGTGAAGC GCGCCCGCGC CGCCTTCAGC TCGGGCAGGA CCCGTCCGCT GCAGTTCCGA 120 

TTCCAGCAGC TGGAGGCGCT GCAGCGCCTG ATCCAGGAGC AGGAG CAGGA GCTGGTGGGC 180 

GCGCTGGCCG CAGACCTGCA CAAGAATGAA TGGAACGCCT ACTATGAGGA GGTGGTGTAC 240 

GTCCTAGAGG AGATCGAGTA CATGATCCAG AAGCTCCCTG AGTGGGCCGC GGATGAGCCC 300 

GTGGAGAAGA CGCCCCAGAC TCAGCAGGAC GAGCTCTACA TCCACTCGGA GCCACTGGGC 360 

GTGGTCCTCG TCATTGGCAC CTGGAACTAC CCCTTCAACC TCACCATCCA GCCCATGGTG 420 

GGCGCCATCG CTGCAGGGAA CGCAGTGGTC CTCAAGCCCT CGGAGCTGAG TGAGAACATG 480 

GCGAGCCTGC TGGCTACCAT CATCCCCCAG TACCTGGACA AGGATCTGTA CCCAGTAATC 540 

AATGGGGGTG TCCCTGAGAC CACGGAGCTG CTCAAGGAGA GGTTCGACCA TATCCTGTAC 600 

ACGGGCAGCA CGGGGGTGGG G AAGAT CATC ATGACGGCTG CTGCCAAGCA CCTGACCCCT 660 

GTCACGCTGG AGCTGGGAGG GAAGAGTCCC TGCTACGTGG ACAAGAACTG TGACCTGGAC 720 

GTGGCCTGCC GACGCATCGC CTGGGGGAAA TTCATGAACA GTGGCCAGAC CTGCGTGGCC 780 

CCAGACTACA TCCTCTGTGA CCCCTCGATC CAGAACCAAA TTGTGGAGAA GCTCAAGAAG 840 

TCACTGAAAG AGTTCTACGG GGAAGATGCT AAGAAATCCC GGGACTATGG AAGAATCATT 900 

AGTGCCCGGC ACTTCCAGAG GGTGATGGGC CTGATTGAGG GCCAGAAGGT GGCTTATGGG 960 

GGCACCGGGG ATGCCGCCAC TCGCTACATA GCCCCCACCA TCCTCACGGA CGTGGACCCC 1020 
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CAGTCCCCGG TGATGCAAGA GGAGAT CTTC GGGGCTGTGC TGCCCATCGT GTGCGTGCGC 1080 

AGCCTGGAGG AGGCCATCCA GTTCAT CAAC CAGCGTGAGA AGCCCCTGGC CCTCTACATG 1140 

TTCTCCAGCA ACGACAAGGT GATTAAGAAG ATGATTGCAG AGACATCCAG TGGTGGGGTG 12 00 

GCGGCCAACG ATGTCAT CGT CCACATCACC TTGCACTCTC TGCCCTTCGG GGGCGTGGGG 1260 

AACAGCGGCA TGGGATCCTA CCATGGCAAG AAGAGCTTCG AGACTTTCTC TCACCGCCGC 1320 

TCTTGCCTGG TGAGGCCTCT GATGAATGAT GAAGGCCTGA AGGTCAGATA CCCCCCGAGC 1380 

CCGGCCAAGA TGACCCAGCA CTGAGGAGGG GTTGCTCCGC CTGGCCTGGC CATACTGTGT 14 40 

CCCATCGGAG TGCGGACCAC CCTCACTGGC TCTCCTGGCC CTGGAGAATC GCTCCTGCAG 1500 

CCCCAGCCCA GCCCCACTCC TCTGCTGACC TGCTGACCTG TGCACACCCC ACTCCCACAT 1560 

GGGCCCAGGC CTCACCATTC CAAGTCTCCA CCCCTTTCTA GACCAATAAA GAGACAAATA 162 0 
CAATTTTCTA ACTCGG 

Seq ID NO: 176 Protein sequence: 
Protein Accession #: NP_000682 

1 11 21 31 41 51 

111)11 

MSKISEAVKR ARAAFSSGRT RPLQFRFQQL EALQRLIQEQ EQELVGALAA DLHKNEWNAY 60 

YEEWYVLEE IEYMIQKLPE WAADEPVEKT PQTQQDELYI HSEPLGWLV I GTWNYPFNL 120 

TIQPMVGAIA AGNAWLKPS ELSENMASLL ATIIPQYLDK DLYPVINGGV PETTELLKER 180 

FDHILYTGST GVGKI IMTAA AKHLTPVTLE LGGKSPCYVD KNCDLDVACR RIAWGKFMNS 2 40 

GQTCVAPDYI LCDPSIQNQI VEKLKKSLKE FYGEDAKKSR DYGRI ISARH FQRVMGLIEG 300 

QKVAYGGTGD AATR Y I APT I LTDVDPQSPV MQEEIFGPVL PIVCVRSLEE AIQFINQREK 3 60 

PLALYMFSSN DKVIKKMIAE TSSGGVAAND VIVHITLHSL PFGGVGNSGM GSYHGKKSFE 420 
TFSHRRSCLV RPLMNDEGLK VRYPPSPAKM TQH 

Seq ID NO: 177 DNA sequence 

Nucleic Acid Accession #: NM_001067.1 

Coding sequence: 108-4703 

1 11 21 31 41 51 

I I ! I I 1 

CTAACCGACG CGCGTCTGTG GAGAAGCGGC TTGGTCGGGG GTGGTCTCGT GGGGTCCTGC 60 

CTGTTTAGTC GCTTTCAGGG TTCTTGAGCC CCTTCACGAC CGTCACCATG GAAGTGTCAC 12 0 

CATTGCAGCC TGTAAATGAA AATATGCAAG T CAAC AAAAT AAAGAAAAAT GAAGATGCTA 180 

AGAAAAGACT GTCTGTTGAA AGAAT CTATC AAAAGAAAAC ACAAT TGGAA CATATTTTGC 240 

TCCGCCCAGA CACCTACATT GGTTCTGTGG AATTAGTGAC CCAGCAAATG TGGGTTTACG 300 

ATGAAGATGT TGGCATTAAC TATAGGGAAG TCACTTTTGT TCCTGGTTTG TACAAAATCT 3 60 

TTGATGAGAT TCTAGTTAAT GCTGCGGACA ACAAACAAAG GGACCCAAAA ATGTCTTGTA 42 0 

TTAGAGTCAC AATTGATCCG GAAAACAATT TAATTAGTAT ATGGAATAAT GGAAAAGGTA 480 

TTCCTGTTGT TG AA C AC AAA GTTGAAAAGA TGTATGTCCC AGCTCTCATA TTTGGACAGC 540 

TCCTAACTTC TAGTAACTAT GATGATGATG AAAAGAAAGT GACAGGTGGT CGAAATGGCT 600 

ATGGAGCCAA ATTGTGTAAC ATATTCAGTA CCAAATTTAC TGTGGAAACA GCCAGTAGAG 660 

AATACAAGAA AATGTTCAAA CAGACATGGA TGGATAATAT GGGAAGAGCT GGTGAGATGG 72 0 

AACTCAAGCC CTTCAATGGA GAAGATTATA CATGTAT CAC CTTTCAGCCT GATTTGTCTA 780 

AGTTTAAAAT GCAAAGCCTG GACAAAGATA TTGTTGCACT AATGGT CAGA AGAGCATATG 840 

ATATTGCTGG ATCCACCAAA GATGTCAAAG TCTTTCTTAA TGGAAATAAA CTGCCAGTAA 90 0 

AAGGATTTCG TAGTTATGTG GACATGTATT TGAAGGACAA GTTGGATGAA ACTGGTAACT 960 

CCTTGAAAGT AATACATGAA CAAGTAAACC ACAGGTGGGA AGTGTGTTTA ACTATGAGTG 102 0 

AAAAAGGCTT TCAGCAAATT AGCTTTGTCA ACAGCATTGC TACAT CCAAG GGTGGCAGAC 1080 

ATGTTGATTA TGTAGCTGAT CAGATTGTGA CTAAACTTGT TGATGTTGTG AAGAAGAAGA 1140 

ACAAGGGTGG TGTTGCAGTA AAAGCACATC AGGTGAAAAA TCACATGTGG ATTTTTGTAA 12 00 

ATGCCTTAAT TGAAAACCCA ACCTTTGACT CTCAGACAAA AGAAAACATG ACTTTACAAC 12 60 

CCAAGAGCTT TGGATCAACA TGCCAATTGA GTGAAAAATT TATCAAAGCT GCCATTGGCT 132 0 

GTGGTATTGT AGAAAGCATA CTAAACTGGG TGAAGTTTAA GGCCCAAGTC CAGTTAAACA 1380 

AGAAGTGTTC AGCTGTAAAA CATAATAGAA TCAAGGGAAT TCCCAAACTC GATGATGCCA 1440 

ATGATGCAGG GGGCCGAAAC TCCACTGAGT GTACGCTTAT CCTGACTGAG GGAGAT T C AG 1500 

CCAAAACTTT GGCTGTTTCA GGCCTTGGTG TGGTTGGGAG AGACAAATAT GGGGTTTTCC 1560 

CTCTTAGAGG AAAAATACTC AATGTTCGAG AAGCTTCTCA TAAGCAGATC ATGGAAAATG 1620 

CTGAGATTAA CAATAT CATC AAGATTGTGG GTCTTCAGTA CAAGAAAAAC TATGAAGATG 1680 

AAGATTCATT GAAGACGCTT CGTTATGGGA AGATAATGAT TATGACAGAT CAGG AC CAAG 1740 

ATGGTTCCCA CATCAAAGGC TTGCTGATTA ATTTTATCCA TCACAACTGG CCCTCTCTTC 1800 

TGCGACATCG TTTTCTGGAG GAATTTATCA CTCCCATTGT AAAGGTATCT AAAAACAAGC 1860 

AAGAAATGGC ATTTTACAGC CTTCCTGAAT TTGAAGAGTG GAAGAGTTCT ACTCCAAATC 1920 

ATAAAAAATG GAAAGTCAAA TATTACAAAG GTTTGGGCAC CAGCACATCA AAGGAAGCTA 1980 

AAGAATACTT TGCAGATATG AAAAGACATC GTATCCAGTT CAAATATTCT GGTCCTGAAG 2040 

ATGATGCTGC TATCAGCCTG GCCTTTAGCA AAAAACAGAT AGATGATCGA AAGGAATGGT 2100 

TAACTAATTT CATGGAGGAT AGAAGACAAC GAAAGTTACT TGGGCTTCCT GAGGATTACT 2160 

TGTATGGACA AACTACCACA TATCTGACAT ATAATGACTT CAT CAAC AAG GAACTTATCT 2220 

TGTTCT C AAA TTCTGATAAC GAGAGATCTA TCCCTTCTAT GGTGGATGGT TTGAAACCAG 2280 

GTCAGAGAAA GGTTTTGTTT ACTTGCTTCA AACGGAATGA CAAGCGAGAA GTAAAGGTTG 2340 

CCCAATTAGC TGGATCAGTG GCTGAAATGT CTTCTTATCA TCATGGTGAG ATGTCACTAA 240 0 

TGATGACCAT TATCAATTTG GCTCAGAATT TTGTGGGTAG CAATAATCTA AACCTCTTGC 2460 

AGCCCATTGG TCAGTTTGGT ACCAGGCTAC ATGGTGGCAA GGATTCTGCT AGTCCACGAT 252 0 

ACATCTTTAC AATGCTCAGC TCTTTGGCTC GATTGTTATT TCCACCAAAA GATGATCACA 2580 

CGTTGAAGT T TTTATATGAT GACAACCAGC GTGTTGAGCC TGAATGGTAC ATTCCTATTA 2640 

TTCCCATGGT GCTGATAAAT GGTGCTGAAG GAATCGGTAC TGGGTGGTCC TGCAAAATCC 2700 

CCAACTTTGA TGTGCGTGAA ATTGTAAATA ACATCAGGCG TTTGATGGAT GGAGAAGAAC 2760 

CTTTGCCAAT GCTTCCAAGT TACAAGAACT TCAAGGGTAC TATTGAAGAA CTGGCTCCAA 2820 

AT CAATATGT GATTAGTGGT GAAGTAGCTA TTCTTAATTC TACAACCATT GAAATCTCAG 2880 

AGCTTCCCGT CAGAACATGG ACCCAGACAT ACAAAGAACA AGTTCTAGAA CCCATGTTGA 2 940 

ATGGCACCGA GAAGACACCT CCTCTCATAA CAGACTATAG GGAATACCAT ACAGATACCA 3000 

CTGTGAAATT TGTTGTGAAG ATGACTGAAG AAAAACTGGC AGAGGCAGAG AGAGTTGGAC 3060 

TACACAAAGT CTTCAAACTC CAAACTAGTC TCACATGCAA CTCTATGGTG CTTTTTGACC 3120 

ACGTAGGCTG TTTAAAGAAA TATGACACGG TGTTGGATAT TCTAAGAGAC TTTTTTGAAC 3180 

TCAGACTTAA ATATTATGGA TTAAGAAAAG AATGGCTCCT AGGAATGCTT GGTGCTGAAT 3240 

CTGCTAAACT GAATAATCAG GCTCGCTTTA TCTTAGAGAA AATAGATGGC AAAAT AATCA 330 0 
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TTGAAAATAA GCCTAAGAAA GAATTAATTA AAGTT CTGAT TCAGAGGGGA TATGATTCGG 3 360 

ATCCTGTGAA GGCCTGGAAA GAAGCCCAGC AAAAGGTTCC AGATGAAGAA GAAAATGAAG 342 0 

AGAGTGACAA CGAAAAGGAA ACTGAAAAGA GTGACTCCGT AACAGATTCT GGACCAACCT 3 480 

TCAACTATCT TCTTGATATG CCCCTTTGGT ATTTAACCAA GGAAAAGAAA GATGAACTCT 3540 

5 GCAGGCTAAG AAATGAAAAA GAACAAGAGC TGGACACATT AAAAAGAAAG AGT CCATCAG 3600 

ATTTGTGGAA AGAAGACTTG GCTACATTTA TTGAAGAATT GGAGGCTGTT GAAGCCAAGG 3 660 

AAAAACAAGA TGAACAAGTC GGACTTCCTG GGAAAGGGGG GAAGGCCAAG GGGAAAAAAA 3 72 0 

CACAAATGGC TGAAGTTTTG CCTTCTCCGC GTGGTCAAAG AGTCATTCCA CGAATAACCA 3 780 

TAGAAATGAA AGCAGAGGCA GAAAAGAAAA ATAAAAAGAA AATTAAGAAT GAAAATACTG 3840 

10 AAGGAAGCCC TCAAGAAGAT GGTGTGGAAC TAGAAGGCCT AAAACAAAGA TTAGAAAAGA 3900 

AACAGAAAAG AGAACCAGGT ACAAAGACAA AGAAACAAAC TACATTGGCA TTTAAGCCAA 3960 

TCAAAAAAGG AAAGAAGAGA AATCCCTGGC CTGATT CAGA ATCAGATAGG AGCAGTGACG 4020 

AAAGTAATTT TGATGTCCCT CCACGAGAAA C AG AG CCACG GAGAGCAGCA ACAAAAACAA 4080 

AATTCACAAT GGATTTGGAT TCAGATGAAG ATTTCTCAGA TTTTGATGAA AAAACTGATG 4140 

15 ATGAAGATTT TGTCCCATCA GATGCTAGTC CACCTAAGAC CAAAACTTCC CCAAAACTTA 42 00 

GTAACAAAGA ACTGAAACCA CAGAAAAGTG TCGTGTCAGA CCTTGAAGCT GATGATGTTA 4260 

AGGGCAGTGT ACCACTGTCT TCAAGCCCTC CTGCTACACA TTTCCCAGAT GAAACTGAAA 4320 

TTACAAACCC AGTTCCTAAA AAGAATGTGA CAGTGAAGAA GACAGCAGCA AAAAGTCAGT 4380 

CTTCCACCTC CACTACCGGT GCCAAAAAAA GGGCTGCCCC AAAAGGAACT AAAAGGGATC 4440 

20 CAGCTTTGAA TTCTGGTGTC TCTCAAAAGC CTGATCCTGC CAAAACCAAG AATCGCCGCA 4500 

AAAGGAAGCC ATCCACTTCT GATGATTCTG ACTCTAATTT TGAGAAAATT GTTT CGAAAG 4560 

CAGTCACAAG CAAGAAATCC AAGGGGGAGA GTGATGACTT CCATATGGAC TTTGACTCAG 4620 

CTGTGGCTCC TCGGGCAAAA TCTGTACGGG CAAAGAAACC TATAAAGTAC CTGGAAGAGT 4680 

CAGATGAAGA TGATCTGTTT TAAAATGTGA GGCGATTATT TTAAGTAATT ATCTTACCAA 4 740 

25 GCCCAAGACT GGTTTTAAAG TTACCTGAAG CTCTTAACTT CCTCCCCTCT GAATTTAGTT 4800 

TGGGGAAGGT GTTTTTAGTA CAAGACATCA AAGTGAAGTA AAGCCCAAGT GTTCTTTAGC 48 60 

TTTTTATAAT ACTGTCTAAA TAGTGACCAT CTCATGGGCA TTGTTTTCTT CTCTGCTTTG 4 92 0 

TCTGTGTTTT GAGTCTGCTT TCTTTTGTCT TTAAAACCTG ATTTTTAAGT TCTTCTGAAC 4 980 

TGTAGAAATA GCTATCTGAT CACTTCAGCG TAAAGCAGTG TGTTTATTAA CCATCCACTA 5040 

30 AGCTAAAACT AGAGCAGTTT GATTTAAAAG TGTCACTCTT CGTCCTTTTC TACTTTCAGT 5100 

AGATATGAGA TAG AG CATAA TTATCTGTTT TATCTTAGTT TTATACATAA TTTACCATCA 5160 

GATAGAACTT TATGGTTCTA GTACAGATAC TCTACTACAC TCAGCCTCTT ATGTGCCAAG 522 0 

TTTTTCTTTA AG C AATGAGA AATTGCTCAT GTTCTTCATC TTCTCAAATC ATCAGAGGCC 52 80 

AAAGAAAAAC ACTTTGGCTG TGTCTATAAC TTGACACAGT CAATAGAATG AAGAAAATTA 5340 

35 GAGTAGTTAT GTGATTATTT CAGCTCTTGA CCTGTCCCCT CTGGCTGCCT CTGAGT CTGA 5400 

ATCTCCCAAA GAGAGAAACC AATTTCTAAG AGGACTGGAT TGCAGAAGAC TCGGGGACAA 5460 

CATTTGATCC AAGATCTTAA ATGTTATATT GATAACCATG CTCAGCAATG AGCTATTAGA 552 0 

TTCATTTTGG GAAATCTCCA TAATTTCAAT TTGTAAACTT TGTTAAGACC TGT GTACATT 5580 

GTTATATGTG TGTGACTTGA GTAATGTTAT CAACGTTTTT GTAAATATTT ACTATGTTTT 5640 

40 TCTATTAGCT AAATTCCAAC AATTTTGTAC TTTAATAAAA TGTTCTAAAC ATTGC 
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Seq ID NO: 178 Protein sequence: 
Protein Accession #: NP_001058.1 



45 



50 



55 



60 



65 



70 



1 

I 

MEVSPLQPVN 
MWVYDEDVGI 
NGKGIPWEH 
TASREYKKMF 
RRAYD I AGST 
LTMSEKGFQQ 
WIFVNALIEN 
VQLNKKCSAV 
YGVFPLRGKI 
DQDQDGSHIK 
STPNHKKWKV 
RKEWLTNFME 
GLKPGQRKVL 
IiNLLQPlGQF 
YIPIIPMVLI 
ELAPNQYVIS 
HTDTTVKFW 
DFFELRLKYY 
GYDSDPVKAW 
KDELCRLRNE 
KGKKTQMAEV 
RLEKKQKREP 
ATKTKFTMDL 
ADDVKGSVPL 
TKRDPALNSG 
DFDSAVAPRA 



11 
I 

ENMQVNKI KK 
NYREVTFVPG 
KVEKMYVPAL 
KQTWMDNMGR 
KDVKVFLNGN 
ISFVNSIATS 
PTFDSQTKEN 
KHNRIKGIPK 
LNVREASHKQ 
GLLINFIHHN 
KYYKGLGTST 
DRRQRKLLGL 
FTCFKRNDKR 
GTRLHGGKDS 
NGAEG I GTGW 
GEVAILNSTT 
KMTEEKLAEA 
GLRKEWLLGM 
KEAQQKVPDE 
KEQELDTLKR 
LPSPRGQRVI 
GTKTKKQTTL 
DSDEDFSDFD 
SSSPPATHFP 
VSQKPDPAKT 
KSVRAKKPIK 



21 
1 

NEDAKKRLSV 
LYKIFDEILV 
IFGQLLTSSN 
AGEMELKPFN 
KLPVKGFRSY 
KGGRHVDYVA 
MTLQPKSFGS 
LDDANDAGGR 
I MENAE INN I 
WPS LLRHRFL 
SKEAKEYFAD 
PEDYLYGQTT 
EVKVAQLAGS 
AS PRY I FTML 
SCKIPNFDVR 
IEISELPVRT 
ERVGLHKVFK 
LGAESAKLNN 
EENEESDNEK 
KSPSDLWKED 
PRITIEMKAE 
AFKP I KKGKK 
EKTDDEDFVP 
DETEITNPVP 
KNRRKRKPST 
YLEESDEDDL 



31 

i ' 

ERIYQKKTQL 
NAADNKQRDP 
YDDDEKKVTG 
GEDYTCITFQ 
VDMYLKDKLD 
DQ I VTKLVDV 
TCQLSEKFIK 
NSTECTLILT 
IKIVGLQYKK 
EEFITPIVKV 
MKRHRIQFKY 
TYLTYNDFIN 
VAEMSSYHHG 
SSLARLLFPP 
EIVNNIRRLM 
WTQTYKEQVL 
LQTSLTCNSM 
QARFILEKID 
ETEKSDSVTD 
LATFIEELEA 
AEKKNKKK I K 
RNPWPDSESD 
SDASPPKTKT 
KKNVTVKKTA 
SDDSDSNFEK 
F 



41 

I 

EHILLRPDTY 
KMSCIRVTID 
GRNGYGAKIiC 
PDLSKFKMQS 
ETGNSLKVIH 
VKKKNKGGVA 
AAIGCGIVES 
EGDSAKTLAV 
NYEDEDSLKT 
SKNKQEMAFY 
SGPEDDAAIS 
KELILFSNSD 
EMSLMMTIIN 
KDDHTLKFLY 
DGEEPLPMLP 
EPMLNGTEKT 
VLFDHVGCLK 
GKI I IENKPK 
SGPTFNYLLD 
VEAKEKQDEQ 
NENTEGSPQE 
RSSDESNFDV 
SPKLSNKELK 
AKSQSSTSTT 
IVSKAVTSKK 



51 
I 

IGSVELVTQQ 
PENNLISIWN 
NIFSTKFTVE 
LDKDIVALMV 
EQVNHRWEVC 
VKAHQVKNHM 
ILNWVKFKAQ 
SGLGWGRDK 
LRYGKIMIMT 
SLPEFEEWKS 
LAFSKKQIDD 
NERSIPSMVD 
LAQNFVGSNN 
DDNQRVEPEW 
SYKNFKGTIE 
PPLITDYREY 
KYDTVLDILR 
KELIKVLIQR 
MPLWYLTKEK 
VGLPGKGGKA 
DGVELEGLKQ 
PPRETEPRRA 
PQKSWSDLE 
GAKKRAAPKG 
SKGESDDFHM 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



75 Seq ID NO: 179 DNA sequence 

Nucleic Acid Accession #: Eos sequence 
Coding sequence: 148-7095 



80 



85 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATATCCAA 
CAAGTAAATG 



11 
I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 



31 

! 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
AT CATTGGAA 



60 
120 
180 
240 
300 
360 
420 
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AACACATTCA TTCATAACAC TGGGAAAACA 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT 
AAATGCAATA TGTCAT CTGA TGGATCAGAG 
GAGATGCAAA TCTACTGCTT TGATGCGGAC 
5 GGAAAAGGGA AGTTAAGAGC TTTATCCATT 

GATTTCAAAG CGATTATTGA TGGAGTCGAA 
TTAGATCCAT TCATACTGTT GAACCTTCTG 
AATGGCTCAT TGACATCTCC TCCCTGCACA 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT 

10 TCTGGTTATG T CATGCTGAT GGACTACTTA 

TTGT CTAGAC AGGTGTTTTC CTCATACACT 
AGTTCAGAAC CAGAAAATGT TCAGGCTGAC 
TGGGAAAGAC CTCGAGTCGT TTATGATACC 
CAGTTGGATG GAGAGGACCA AACCAAGCAT 

15 GGTGCTATTC TCAATAATTT GCTACCCAAT 

TGCACTAATG GCTTATATGG AAAATACAGC 
AATCCTGAAC TTGATCTTTT CCCTGAATTA 
GAAGAGGGAA AAGACATTGA AGAAGGCGCT 
AACCAAATCA GGAAAAAGGA ACCCCAGATT 

20 ACGAAATACA ATGAAGCCAA GACTAACCGA 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT 
A C AG AAAAAG ATATTTCCTT GACTTCTCAG 
GAAGGTACTT CAGCCTCTTT AAATGATGGC 
AACTTGTCGG GGACTGCAGA ATCCTTAAAT 

25 AGTTTATTGA CCAGTTTCAA GCTTGATACT 

GCAACTTCTG CTATCCCATT CATCTCTGAG 
GAAAACCCAG AGACAATAAC ATATGATGTC 
GAAGATTCAA CTT CAT CAGG TTCAGAAGAA 
GTGTGGTTTC CTAGCTCTAC AGACATAACA 

30 AGCTTTCTCC AGACTAATTA CACTGAGATA 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG 
CATTATTCTA CCTTTGCCTA CTTCCCAACT 
TCCAGACAAC AGGATTTGGT CTCCACGGTC 
GTATACAATG GTGAGACACC TCTTCAACCT 

35 ACCCCTTTGT TGCTTGACAA TCAGATCCTC 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC 
TCTTCCTATG ATGGTGCACC TTTGCTTCCA 
TTTCGCCATC TGCATACAGT TTCTCAAATC 
GATAAGGTGC CCTTGCATGC TTCTCTGCCA 

40 AGCCTTGCTC AGTATTCTGA TGTGCTGTCC 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA 
AGCAGTGATG CCATGATGCA TGCACGTTCT 
GATAATGAGG GCTCCCAACA CATCTTCACT 
GATTCTGTGG GTGTAACTTA TCAGGGTTCC 

45 CCTAAGTCTT CGTTAATAAC CCCAACTGCA 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT 
GGGCTGACAG CCCTTAACAT TTCTTCACCT 
TCTGTGTTTG GTGATGATAA TAAGGCGCTT 
ACTGAACTGC AAATTCCTTC TTTCAATGAG 

50 CCCAACATGT ATGATAATGT AAATAAGTTG 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG 
GATCATGAGA TTAGTCAAGT TCCAGAAAAT 
TCTCAAGCAT CTGGTGACAC TTCGCTTAAA 
TCCTCTGACC CTGCTTCTAG TGAAATGTTA 

55 ACCTCAGCTT CTTTTAGTAC TGAAGTATTG 

GACACCTTGC TTAAAACTGT TCTTCCAGCT 
CCCAAAGTTG ATAAAATTAG TTCTACAATG 
AGTGAAAACA TGCTGCACTC TACATCTGTA 
ATGCACTCTG CTTCACTTCA AGGTTTGACC 

60 GTTTTGTTAA AAAGTGAAAG TTCCCACCAA 

TTGTT CCAAA CGGCCAATTT GGAGATTAAC 
TTTGCTACAC CTGTTTTATC AATTGATGAA 
CATTCCGATG AAATTTTAAC CTCCACCAAA 
ATTCCAACAG TTGCTTCTGA TACATTTGTA 

65 GGGCATGTTG CCATTACAGC TGTTTCTCCC 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG 
TTAGTGGGTG GTGGTGAAGA TGGTGACACT 
AGAGGTAGTG ATGGCTTATC CATT CATAAG 
CAGGAAAAGG TAATGAATGA TTCAGACACC 

70 CCAATCTCAT ACTC ACT AT C TGAGAATTCT 

TCAGACAGTC AAACTGGTAT GGACAGAAGT 
TCCCAAAAGC ACAATGATGG AAAAGAGGAA 
CCTCTCAGCC CTGAATCTAA AGCATGGGCA 
GGGCAAGGTA CCTCAGATAG CCTTAATGAG 

75 GACACTAATG AAAAAGATGC TGATGGGATC 

GGATTCCCAC AGTCCCCAAC AT CATCTGTT 
TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT 
GAATCCGAGA AGAAGGCAGT TATACCCCTT 
CTAGTGGTTC TTGTGGGTAT TCTCATCTAC 

80 TACTTAGAGG ACAGTACATC CCCTAGAGTT 

ATTTCAGATG ATGTCGGAGC AATTCCAATA 
CATGCAAGTA GTGGGTTTAC TGAAGAATTT 
CAGAGCTGTA CTGTTGACTT AGGTATTACA 
CACAAGAATC GATACATAAA TATCGTTGCC 

85 CTTGCTGAAA AGGATGGCAA ACTGACTGAT 

AACAGACCAA AAGCTTATAT TGCTGCCCAA 
TGGAGAATGA TATGGGAACA TAATGTGGAA 



GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

CATAGTTTAG AAGGACAAAA ATTTCCACTT 60 0 

CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

CAAAACAATT TTCGAGAGCA ACAGTACAAG 102 0 

GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

ATGATTGAGA AGTTTGCAGT TTTGTACCAG 12 00 

GAATTTTTGA CAGATGGCTA TCAAGACTTG 12 60 

ATGAGTTATG TT CTT CAG AT AGTAGCCATA 132 0 

GACCAACTGA TTGTCGACAT GCCTACTGAT 13 80 

ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

TCTACCACAA C AC AC T AC AA TCGCATAGGG 1560 

TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 162 0 

TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

TCTAAAACTG TTCTTAGATC TCCACATATG 180 0 

ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

AACATAT CCC AAGGGTATAT ATTTTCCTCC 1980 

CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 210 0 

GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

CGTGTTGATG AAT CTGAGAA GACAACCAAG 222 0 

GGTCCCTCAG TTACAGATCT GGAAATGCCA 22 80 

GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

AACGTGGTAT ACTCGCAGAC AACCCAACCG 240 0 

TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

AGTGTCGATG TGTCATTTGA AT CCAT CCTG 2580 

TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2 640 

CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GTGGCTGGGG GTGATTTGCT AT T AGAGCCC 2 760 

ACT ACT CATG CTGCTTCAGA GACGCTGGAA 282 0 

ACGCTTATGT TTTCTCAAGT TGAACCACCC 2 880 

TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2 940 

GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

TTATTTAGCG GCCCTAGCCA TATACCAATA 3 0 60 

TCATTACTGC AGCCTACTCA TGCCCTCTCT 312 0 

GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GTTTCTGTAG CTGAATTTAC ATATACAACA 3 240 

TCTAAAAGTG AAATAATATA TGGAAATGAG 33 00 

ATGGTTTACC CTTCTGAAAG CACAGTCATG 33 60 

AATGCGTCTT TACAAGAAAC CTCTGTTTCC 342 0 

TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3 60 0 

TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3 660 

CTACAACCTT CCTTTCAGGC TTCTGATGTT 3 72 0 

GTGCCCAGTG ATCCAATATT GGTTGAAACC 3 780 

TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3 840 

CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3 900 

ATTTCCTATG CAAGTGAGAA ATATGAACCA 3 960 

GTGGTACCTT CTTTGTACAG TAATGATGAG 4 02 0 

CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 42 00 

TCTACTGATC ATTCTGTTCC TATAGGAAAT 42 60 

CACAGAGATG GTTCTGTAAC CTCAACAAAG 432 0 

CTGAGT CAT A GTGCCAAATC TGATGCCGGT 43 80 

GATGATGATG GTGATGATGA TGATGATGAC 4440 

TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

CCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

GTTCTGACAA GTGATGAAGA AAGTGGATCA 4 800 

AATGAGACTT CCACAGATTT CAGTTTTGCA 4 860 

CTGGCAGCAG GTGACTCAGA AATAACTCCT 4920 

ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

AT AT CCAC AC CTCCAACACC TATCTTTCCA 5220 

AAGCACTTTC CAAAGCATGT TGCAGATTTA 52 80 

GAGACACTGA AAGAGTTTTA CCAGGAAGTG 5340 

GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

TATATCAATG CCAATTATGT TGATGGCTAC 5520 

GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 
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AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 5820 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 6000 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGT CAT A TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 6300 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 63 60 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 6420 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAG CTTTAA GGT CACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

T TAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 684Q 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAAT CTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GAT CCTCAGC CTTGTGAGCA CAAGGCAGGA AG AG AAT CCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 70 80 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACTCAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 7260 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 7380 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAG CAATT AT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 762 0 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGTCAAG TTTTCTAGTT CTGTGTAATT 7 680 

GTTTAGT TTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT ATTGCCATTA AAAAAAAAAA 792 0 
AAAAAAAAAA AAAAAAAAAA AAAA 

Seq ID NO: 180 Protein sequence: 
Protein Accession #: Eos sequence 
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45 
50 
55 
60 
65 
70 
75 
80 
85 



1 

1 

MRILKRFLAC 
QSPINIDEDL 
FKASK.IT FHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LNTTPAASSS 
ILPQVTSATE 
KTLMFSQVEP 
SLFSGPSHIP 
PVSVAEFTYT 
LNASLQETSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASEKYE 
EPLNTLINKIi 
PHRDGSVTST 
KCMSCSSYRE 
SPGKSPSANG 
ENETSTDFSF 
HESRIGLAEG 
VISTPPTPIF 
TADS SNHPDN 
QGPLKSTAED 
VLAYYTVRNF 
AKRHAVGPW 
QYVFIHDTLV 
SAALKQCNRE 
HTIKDFWRMI 
NEEKLIIQDF 
VHDEHGGVTA 
SLVSTRQEEN 



11 
1 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
DSALHATPVF 
SDKVPLHASL 
PSSDAMMHAR 
IPKSSLITPT 
TSVFGDDNKA 
SISSTKGMFP 
ASSDPASSEM 
TPKVDKISST 
PVLLKSESSH 
IHSDEILTST 
KLLFP SKATS 
SQEKVMNDSD 
LSQKHNDGKE 
ADTNEKDADG 
LESEKKAVIP 
PISDDVGAIP 
KHKNRYINIV 
FWRMIWEHNV 
TLRNTKI KKG 
VHCSAGVGRT 
EAILSKETEV 
KNRTSSIIPV 
WDHNAQLWM 
ILEATQDDYV 
GTFCALTTLM 
PSTSLDSNGA 



21 
1 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VL I PESARNA 
IRVDESEKTT 
VNWYSQTTQ 
PSVDVSFESI 
PVAGGDLLLE 
SSGPEPSYAL 
ASLLQPTHAL 
LSKSEIIYGN 
GSIiAHTTTKV 
IiSPSTQLIiFY 
MLHLIVSNSA 
QWPSLYSND 
KSSVTGKVFA 
ELSHSAKSDA 
THENSLMDQN 
ENDIQTGSAL 
ILAAGDSEIT 
LVIVSALTFI 
IKHFPKHVAD 
AYDHSRVKLA 
EVIVMITNLV 
SQKGRPSGRV 
GTYIVLDSML 
LDSHIHAYVN 
ERSRVGISSL 
I PDGQNMAED 
IjEVRHFQCPK 
HQLEKENSVD 
ALPDGNIAES 



31 

1 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNGETPLQ 
LSSYDGAPLL 
PSLAQYSDVL 
SDNEGSQHIF 
SGDGEWSGAS 
ETELQIPSFN 
FDHEISQVPE 
ETSASFSTEV 
SSENMLHSTS 
ELFQTANLEI 
GIPTVASDTF 
GLVGGGEDGD 
NPISYSLSEN 
LPLSPESKAW 
PGFPQSPTSS 
CLWLVGIIjI 
LHASSGFTEE 
QLAEKDGKLT 
EKGRRKCDQY 
VTQYHYTQWP 
QQIQHEGTVN 
ALLIPGPAGK 
SGEGTDYINA 
EFVYWPNKDE 
WPNPDSPISK 
VYQVAKMINL 
LESLV 



41 

1 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTEIjPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
PSYSSEVFPL 
PFSSASFSSE 
STTHAASETL 
TVSYSSAIPV 
SDSEFLLPDT 
EMVYPSESTV 
NNFSVQPTHT 
LLQPSFQASD 
VPVFDVSPTS 
NQAHPPKGRH 
VSTDHSVPIG 
TDDDGDDDDD 
SEEDNRVTSV 
AVLTSDEESG 
VTSENSEVFH 
YWRKCFQTAH 
FETIjKEFYQE 
DYINANYVDG 
WPADGSEEYG 
DMGVPEYSLP 
IFGFLKHIRS 
TKLEKQFQLL 
SYIMGYYQSN 
PINCESFKVT 
TFELISVIKE 
MRPGVFADIE 



51 

I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
VTPIjLLDNC3I 
LFRHLHTVSQ 
EFGSE SGVLY 
HDSVGVTYQG 
DGLiT ALN I S S 
MPNMYDNVNK 
VSQASGDTSL 
VDTLLKTVLP 
HMHSASLQGL 
VFATPVLSID 
NGHVAITAVS 
DRGSDGLSIH 
SSDSQTGMDR 
SGQGTSDSLN 
VSEAEASNSS 
FYLEDSTSPR 
VQSCTVDLGI 
YNRPKAY I AA 
NFLVTQKSVQ 
VLTFVRKAAY 
QRNYLVQTEE 
SQSNIQQSDY 
EFIITQHPLL 
LMAEEHKCLS 
EAANRDGPMI 
QYQFLYKVIL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



Seq ID NO: 181 DNA sequence 

Nucleic Acid Accession #t Eos sequence 
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Coding sequence: 148-4518 
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1 11 21 31 41 51 

* I 1 I I i I 

3 CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 12 0 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

10 AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 3 60 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 42 0 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTT CAG A AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

15 GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 72 0 

GATTTCAAAG CGATTATTGA TGGAGT CGAA AGTGT TAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 90 0 

20 ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 12 00 

25 CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 1320 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13 80 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 144 0 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

30 AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG * 1800 

35 AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 192 0 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 198 0 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

40 GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 222 0 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

45 GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGTCTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 2520 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2 640 

CCAATTTCAG ATGATGTCGG AGCAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

50 TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2 760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 282 0 

AAGCACAAGA AT CG AT AC AT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2 880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2940 

TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3000 

55 TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3060 

GAGAAAGGAA GGAGAAAATG TGATCAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 

GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

60 GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 3360 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3480 

CAAAGAAATT ATTTGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3 54 0 

GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATTCATGC CTATGTTAAT 3600 

65 GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 37 80 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3 840 

GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

70 TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3 960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

75 GAAGCTGCCA ATAGGGATGG GCCTATGATT GTT CATGATG AGCATGGAGG AGTGACGGCA 4260 

GGAACTTTCT GTGCT CTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGTCTTTGC TGACATTGAG 43 80 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 4500 

80 TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TATCTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

85 CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TATCAGGTTT GCTAGAAATA TAACTTTTAA TACAGTAGCC 4920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4980 



258 



WO 02/086443 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG 
ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA 
ATTGTTTAGT TTAATGACGT AGTTCATTAG CTGGTCTTAC TCTACCAGTT 
GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT 
ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC 
TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 
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GACCAAATTT 5040 

GTTCTGTGTA 510 0 

TTCTGACATT 5160 

TTTGTGGAAA 522 0 

CAAAC AT TGT 5280 

TTAAAAAAAA 5340 
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Seq ID NO: 182 Protein sequence: 
Protein Accession #: Eos sequence 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDW I VFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 
GSQKGRPSGR 
TGTYIVLDSM 
VLDSHIHAYV 
VERSRVGISS 
MIPDGQNMAE 
VLEVRHFQCP 
MHQLEKENSV 
AALPDGNIAE 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMS SDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
ICLWLVGIL 
DLHASSGFTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALL I PGP AG 
LSGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLIiV 
NMSYVLQIVA 
A IVNPGRDS A 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
IYWRKCFQTA 
EFETLKEFYQ 
TDY INANYVD 
YWPADGSEEY 
PDMGVPEYSIj 
NIFGFLKHIR 
KTKLEKQFQL 
ASYIMGYYQS 
EPINCESFKV 
KTFELISVIK 
LMRPGVFAD I 



31 

I 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNAEASNS 
HFYLEDSTSP 
EVQSCTVDLG 
GYNRPKAYIA 
GNFLVTQKSV 
PVLTFVRKAA 
SQRNYLVQTE 
LSQSNIQQSD 
NEFIITQHPL 
TLMAEEHKCL 
EEAANRDGPM 
EQYQFLYKVT 



Seq ID NO: 183 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AAATAT CCAA 
CAAGTAAATG 
AACACATTCA 
GTCAGCGGAG 
AAATG C AAT A 
GAGATGCAAA 
GGAAAAGGGA 
GATTTCAAAG 
TTAGAT CCAT 
AATGGCTCAT 
ACAGTT AG CA 
TCTGGTTATG 
TTCTCTAGAC 
AGTT CAGAAC 
TGGGAAAGAC 
CAGTTGGATG 
GGTGCTATTC 
TGCACTAATG 
AATCCTGAAC 
GAAGAGGGAA 
AACCAAATCA 
ACGAAATACA 
AAGGGTGATG 
ACAGAAAAAG 
GAAGGTACTT 
AACTTGT CGG 
AGTTTATTGA 
GCAACTTCTG 
GAAAACCCAG 
GAAGATTCAA 
GTGTGGTTTC 
AGCTTTCTCC 
TCCTTTTCTG 
CATTATTCTA 
TCCAGACAAC 
GTATACAATG 



11 
1 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
CATGTAATAG 
TGAATCTTAA 
TTCATAACAC 
GAGTTTCAGA 
TGTCATCTGA 
TCTACTGCTT 
AGTTAAGAGC 
CGATTAT TGA 
TCATACTGTT 
TGACATCTCC 
TCTCTGAAAG 
TCATGCTGAT 
AGGTGTTTTC 
CAGAAAATGT 
CTCGAGT CGT 
GAGAGGACCA 
TCAATAATTT 
GCTTATATGG 
TTGATCTTTT 
AAGACATTGA 
GGAAAAAGGA 
ATGAAGCCAA 
TTCCCAATAC 
ATATTTCCTT 
CAGCCTCTTT 
GGACTGCAGA 
CCAGTTTCAA 
CTATCCCATT 
AGACAATAAC 
CTTCATCAGG 
CTAGCTCTAC 
AGACTAATTA 
CAGGCCCAGT 
CCTTTGCCTA 
AGGATTTGGT 
AGGCCAGTAA 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCCAAAACAA 
GAAACTTAAA 
TGGGAAAACA 
AATGGTGTTT 
TGGATCAGAG 
TGATGCAGAC 
TTTATCCATT 
TGGAGTCGAA 
GAACCTTCTG 
TCCCTGCACA 
CCAGTTGGCT 
GGACTACTTA 
CTCATACACT 
TCAGGCTGAC 
TTATGATACC 
AACCAAGCAT 
GCTACCCAAT 
AAAATACAGC 
CCCTGAATTA 
AGAAGGCGCT 
ACCCCAGATT 
GACTAACCGA 
ATCTTTAAAT 
GACTTCTCAG 
AAATGATGGC 
ATCCTTAAAT 
GCTTGATACT 
CATCTCTGAG 
ATATGATGTC 
TTCAGAAGAA 
AGACATAACA 
CACTGAGATA 
GATGTCACAG 
CTTCCCAACT 
CTCCACGGTC 
TAGTAGCCAT 



31 

I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
TCTCCTATCA 
TTTCAGGGTT 
GTGGAAATTA 
AAAGCAAGCA 
CATAGTTTAG 
CGATTTT CAA 
TTGTTTGAGG 
AGTGTTAGTC 
CCAAAGTCAA 
GACACAGTTG 
GTTTTTTGTG 
CAAAACAATT 
GGAAAGGAAG 
CCAGAGAATT 
ATGATTGAGA 
GAATTTTTGA 
ATGAGTTATG 
GACCAACTGA 
ATTGGAACTG 
ATTGTGAATC 
TCTACCACAA 
TCCCCAACAA 
TCCACTTCCC 
ACTGTGACTG 
TCTAAAACTG 
ACAGTTTCTA 
GGAGCTGAAG 
AACATATCCC 
CTTATACCAG 
TCACTAAAGG 
GCACAGCCCG 
CGTGTTGATG 
GGTCCCTCAG 
GAGGTAACAC 
AACGTGGTAT 
GAGTCTCGTA 



41 
I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMI EKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
SHESRIGLAE 
RVISTPPTPI 
ITADSSNHPD 
AQGPLKSTAE 
QVLAYYTVRN 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLIIQD 
IVHDEHGGVT 
LSLVSTRQEE 



41 
I 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAA 
ATATTGATGA 
GGGATAAAAC 
ATCTCACTAA 
AGATAACTTT 
AAGGACAAAA 
GTTTTGAGGA 
TTGGGACAGA 
GTTTTGGGAA 
CTGACAAGTA 
ACTGGATTGT 
AAGTTCTTAC 
TTCGAGAGCA 
AGATTCATGA 
ATACCAGCCT 
AGTTTGCAGT 
CAGATGGCTA 
TTCTTCAGAT 
TTGTCGACAT 
AAGAAATAAT 
CTGGTAGAGA 
CACACTACAA 
GAGGAAGTGA 
AACCAGTCAC 
AACTGCCACC 
TTCTTAGATC 
TAACAGAATA 
ATTCTTCAGG 
AAGGGTATAT 
AATCTGCTAG 
ATCCTTCTAT 
ATGTTGGATC 
AATCTGAGAA 
TTACAGATCT 
CTCATGCTTT 
ACTCGCAGAC 
TTGGTCTAGC 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
GLESEKKAVI 
FPISDDVGAI 
NKHKNRYINI 
DFWRMIWEHN 
FTIiRNTKIKK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSIIP 
IWDHNAQLW 
FILEATQDDY 
AGTFCALTTL 
NPSTSLDSNG 



51 
I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TTGGGGAAAG 
AGATCTTACA 
ATCATTGGAA 
TGACTACCGT 
TCACTGGGGA 
ATTTCCACTT 
AGCAGT CAAA 
AGAAAAT TTG 
GCAGGCTGCT 
TTACATTTAC 
TTTTAAAGAT 
AATGCAACAA 
ACAGTACAAG 
AGCAGTTTGT 
TCTTGTTACA 
TTTGTACCAG 
TCAAGACTTG 
AGTAGCCATA 
GCCTACTGAT 
CAAGGAGGAG 
CAGTGCTACA 
TCGCATAGGG 
ATTCTCTGGA 
TAAATTAGCC 
TCACACTGTG 
TCCACATATG 
TGAGGAGGAG 
CTCCAGTCCC 
ATTTTCCTCC 
AAATGCTTCC 
GGAGGGAAAT 
AGGCAGAGAG 
GACAACCAAG 
GGAAATGCCA 
TACCCCATCC 
AACCCAACCG 
TGAGGGGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 252 0 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 25 80 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2 640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 270 0 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 

GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 282 0 

ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2 880 

CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2 940 

GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 30 0 0 

AATGTGGAAG T TAT T GT CAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 

CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 312 0 

GTGCAAGTGC TTGCCTATTA TACT GTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 318 0 

AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 

TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 

GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 33 60 

AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 342 0 

GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 34 80 

GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 

GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 360 0 

GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3660 

GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3 72 0 

CCTGTGGAAA GAT CAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3780 

AATGCCTCCT AT AT C ATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 

CTCCTTCATA CCAT CAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3 900 

GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3 960 

GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 402 0 

CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 40 80 

TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 

AGTAAAACTT TTGAACTTAT AAGTGT T ATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 42 00 

ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 42 60 

CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 43 2 0 

AAT CTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 43 80 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAG AAT C CAT CCACCTCTCT GGACAGTAAT 4440 

GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG AGT CTTTAGT TTAACACAGA 450 0 

AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 

ATCAGT CT AG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 

GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 

TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 

GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATTCC 4 800 

AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATTATC 48 60 

AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 4 920 

GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4 98 0 

GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTTTAT 5040 

ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 

CATTAGCTGG TCTTACTCTA CCAGTTTTCT G AC AT TGT AT TGTGTTACCT AAGTCATTAA 5160 

CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 522 0 

AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA ATGGTTTTTA TCCAAGGAAT 52 80 

TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 

Seq ID NO: 184 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MR I LKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIKNTGK TVEINLTNDY RVSGGVSEMV 120 

FKASKITFHW GKCNMS SDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILiFEVGTEEN L.DFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 30 0 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 3 60 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 42 0 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VL I PESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 6 60 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP .72 0 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FEEVQSCTVD LGITADSSNH PDNKHKNRYI NIVAYDHSRV 900 

KLAQLAEKDG KLTDYINANY VDGYNRPKAY IAAQGPLKST AEDFWRMIWE HNVEVIVMIT 960 

NLVEKGRRKC DQYWPADGSE EYGNFLVTQK SVQVLAYYTV RNFTLRNTKI KKGSQKGRPS 1020 

GRWTQYHYT QWPDMGVPEY SLPVLTFVRK AAYAKRHAVG PVWHCSAGV GRTGTYIVLD 10 80 

SMLQQIQHEG TVNIFGFLKH IRSQRNYLVQ TEEQYVFIHD TLVEAILSKE TEVLDSHIHA 1140 

YVNALLIPGP AGKTKLEKQF QLLSQSNIQQ SDYSAALKQC NREKNRTSSI IPVERSRVGI 12 00 

SSLSGEGTDY INASYIMGYY QSNEFIITQH PLLHTIKDFW RMIWDHNAQL WMIPDGQNM 12 60 

AEDEFVYWPN KDEPINCESF KVTLMAEEHK CLSNEEKLII QDFILEATQD DYVLEVRHFQ 1320 

CPKWPNPDSP ISKTFELISV I KEEAANRDG PMIVHDEHGG VTAGTFCALT TLMHQLEKEN 13 80 

SVDVYQVAKM INLMRPGVFA DIEQYQFLYK VILSLVSTRQ EENPSTSLDS NGAALPDGN I 1440 
AESLESLV 



Seq ID NO: 185 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 501-4514 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 12 0 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAT TGGGGAAAGA 3 00 

AATATCCAAC ATGTAATAGC CCAAAACAAT CTCCTATCAA TATTGATGAA GATCTTACAC 360 

AAGTAAATGT GAATCTTAAG AAACTTAAAT TTCAGGGTTG GGATAAAACA TCATTGGAAA 420 

ACACATTCAT T CAT AACACT GGGAAAACAG TGGAAATTAA TCTCACTAAT GACTACCGTG 480 

TCAGCGGAGG AGTTTCAGAA ATGGTGTTTA AAGCAAGCAA GATAACTTTT CACTGGGGAA 540 

AATGCAATAT GTCATCTGAT GGATCAGAGC ATAGTTTAGA AGGACAAAAA TTTCCACTTG 600 

AGATGCAAAT CTACTGCTTT GATGCGGACC GATTTTCAAG TTTTGAGGAA GCAGT CAAAG 660 

GAAAAGGGAA GTTAAGAGCT TTATCCATTT TGTTTGAGGT TGGGACAGAA GAAAATT TGG 720 

ATTTCAAAGC GATTATTGAT GGAGT CGAAA GTGTTAGTCG TTTTGGGAAG CAGGCTGCTT 780 

TAGATCCATT CATACTGTTG AACCTTCTGC CAAACTCAAC TGACAAGTAT TACATTTACA 840 

ATGGCTCATT GACATCTCCT CCCTGCACAG ACACAGTTGA CTGGATTGTT TTTAAAGATA 900 

CAGTTAGCAT CTCTGAAAGC CAGTTGGCTG TTTTTTGTGA AGTTCTTACA ATGCAACAAT 960 

CTGGTTATGT CATGCTGATG GACTACTTAC AAAACAATTT TCGAGAGCAA CAGTACAAGT 1020 

TCTCTAGACA GGTGTTTTCC TCATACACTG GAAAGGAAGA GATTCATGAA GCAGTTTGTA 1080 

GTTCAGAACC AGAAAATGTT CAGGCTGACC CAGAGAATTA TACCAGCCTT CTTGTTACAT 1140 

GGGAAAGACC TCGAGTCGTT TATGATACCA TGATTGAGAA GTTTGCAGTT TTGTACCAGC 1200 

AGTTGGATGG AGAGGACCAA ACCAAGCATG AATTTTTGAC AGATGGCTAT CAAGACTTGG 1260 

GTGCTATTCT CAATAATTTG CTACCCAATA • TGAGTTATGT TCTTCAGATA GTAGCCATAT 1320 

GCACTAATGG CTTATATGGA AAATACAGCG ACCAACTGAT TGTCGACATG CCTACTGATA 13 80 

AT CCTGAACT TGATCTTTTC CCTGAATTAA TTGGAACTGA AGAAATAATC AAGGAGGAGG 1440 

AAGAGGGAAA AGAGATTGAA GAAGGCGCTA TTGTGAATCC TGGTAGAGAC AGTGCTACAA 1500 

ACCAAATCAG GAAAAAGGAA CCCCAGATTT CTACCACAAC ACACTACAAT CGCATAGGGA 1560 

CGAAATACAA TGAAGCCAAG ACTAACCGAT CCCCAACAAG AGGAAGTGAA TTCTCTGGAA 1620 

AGGGTGATGT TCCCAATACA TCTTTAAATT CCACTTCCCA ACCAGTCACT AAATTAGCCA 1680 

CAGAAAAAGA TATTTCCTTG ACTTCTCAGA CTGTGACTGA ACTGCCACCT CACACTGTGG 1740 

AAGGTACTTC AGCCTCTTTA AATGATGGCT CTAAAACTGT TCTTAGATCT CCACATATGA 180 0 

ACTTGTCGGG GACTGCAGAA TCCTTAAATA CAGTTTCTAT AACAGAATAT GAGGAGGAGA 1860 

GTTTATTGAC CAGTTTCAAG CTTGATACTG GAGCTGAAGA TTCTTCAGGC TCCAGTCCCG 1920 

CAACTTCTGC TATCCCATTC ATCTCTGAGA ACATATCCCA AGGGTATATA TTTTCCTCCG 1980 

AAAACCCAGA GACAATAACA TATGATGTCC TTATACCAGA ATCTGCTAGA AATGCTTCCG 2 040 

AAGATTCAAC TTCATCAGGT TCAGAAGAAT CACTAAAGGA TCCTTCTATG GAGGGAAATG 2100 

TGTGGTTTCC TAGCTCTACA GACATAACAG CACAGCCCGA TGTTGGATCA GG C AG AG AGA 2160 

GCTTTCTCCA GACTAATTAC ACTGAGATAC GTGTTGATGA ATCTGAGAAG ACAACCAAGT 222 0 

CCTTTTCTGC AGGCCCAGTG ATGTCACAGG GTCCCTCAGT TACAGATCTG GAAATGCCAC 22 80 

ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 23 40 

CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 2400 

TATACAATGA GGCCAGTAAT AGTAGCCATG AGTCTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 

AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGAT CGTGTC AGCCCTGACT TTTATCTGTC 252 0 

TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 

ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2 640 

TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2 700 

ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGTTTTAC CAGGAAGTGC 2 760 

AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

ACAAGAATCG ATACATAAAT ATCGTTGCCT ATG AT CAT AG CAGGGTTAAG CTAGCACAGC 2 880 

TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2 940 

ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 

GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3 0 60 

AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 

TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC 3180 

TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 

CACAGTATCA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 

TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3 3 60 

ACTGCAGTGC TGG AGT TGG A AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3420 

AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACAT C CGTTCACAAA 3480 

GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 

CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 3600 

TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATTCCAG CTCCTGAGCC 3 660 

AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 3720 

ATCGAACTTC TTCTAT CAT C CCTGTGGAAA GAT CAAGGGT TGGCATTTCA TCCCTGAGTG 3780 

GAGAAGGCAC AG ACT ACAT C AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 

T CAT CATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 

ACCATAATGC CCAACTGGTG GTTATGATT C CTGATGGCCA AAACATGGCA GAAGATGAAT 3960 

TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 

TGGCTGAAGA ACACAAATGT CTAT CTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 

TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 

CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 

CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 

ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 

AT CAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 

AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGG ACT CACA TCTGAGCATT GTTTTCCTCT 4560 

TCCTAAAATT AGG CAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 

ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 

TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 

ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 480 0 

TTATAGAGGT TAGGAATTCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

CTGTATTTGT AG CAAT TAT C AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4920 

AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4980 

GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 
TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 
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TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 522 0 

AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTTCAA 52 80 

ATGGTTTTTA TCCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 186 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

1 1 I 1 I i 

MVFKASKITF HWGKCNMSSD GSEHSLEGQK FPLEMQIYCF DADRFSSFEE AVKGKGKLRA 60 

LSILFEVGTE ENLDFKAI ID GVESVSRFGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP 120 

PCTDTVDWIV FKDTVSISES QLAVFCEVLT MQQSGYVMLM DYLQNNFREQ QYKFSRQVFS 180 

SYTGKEEIHE AVCSSEPENV QADPENYTSL LVTWERPRW YDTMIEKFAV LYQQLDGEDQ 240 

TKHEFLTDGY QDLGAILNNL LPNMSYVLQI VAX CTNGLYG KYSDQLIVDM PTDNPELDLF 300 

PELIGTEEII KEEEEGKDIE EGAIVNPGRD SATNQIRKKE PQISTTTHYN RIGTKYNEAK 360 

TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT KLATEKDISL TSQTVTELPP HTVEGTSASL 42 0 

NDGSKTVLRS PHMNLSGTAE SLNTVSITEY EEESLLTSFK LDTGAEDSSG SSPATSAIPF 480 

ISENISQGYI FSSENPETIT YDVLI PESAR NASEDSTSSG SEESLKDPSM EGNVWFPSST 540 

DITAQPDVGS GRESFLQTNY TEIRVDESEK TTKSFSAGPV MSQGPSVTDL EMPHYSTFAY 600 

FPTEVTPHAF TPSSRQQDLV STVNWYSQT TQPVYNEASN SSHESRIGLA EGLESEKKAV 660 

IPLVIVSALT FICLWLVGI LIYWRKCFQT AHFYLEDSTS PRVISTPPTP IFPISDDVGA 720 

IPIKHFPKHV ADLHASSGFT EEFETLKEFY QEVQSCTVDL GITADSSNHP DNKHKNRYIN 78 0 

IVAYDHSRVK LAQLAEKDGK LTDYINANYV DGYNRPKAYI AAQGPLKSTA EDFWRMIWEH 840 

NVEVIVMITN LVEKGRRKCD QYWPADGSEE YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 900 

KGSQKGRPSG RWTQYHYTQ WPDMGVPEYS LPVLTFVRKA AYAKRHAVGP VWHCSAGVG 960 

RTGTYIVLDS MLQQIQHEGT VNIFGFLKHI RSQRNYLVQT EEQYVFIHDT LVEAILSKET 1020 

EVLDSHIHAY VNALLIPGPA GKTKLEKQFQ LLSQSNIQQS DYSAALKQCN REKNRTSSII 10 80 

PVERSRVGIS SLSGEGTDYI NASYIMGYYQ SNEFIITQHP LiIiHTIKDFWR MIWDHNAQLV 1140 

VMIPDGQNMA EDEFVYWPNK DEPINCESFK VTLMAEEHKC LSNEEKIjIIQ DFILEATQDD 1200 

YVLEVRHFQC PKWPNPDSPI SKTFELISVI KEEAANRDGP MIVHDEHGGV TAGTFCALTT 1260 

LMHQLEKENS VDVYQVAKMI NLMRPGVFAD IEQYQFIiYKV ILSLVSTRQE ENPSTSLDSN 1320 
GAALPDGN I A ESLESLV 

Seq ID NO: 187 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4632 

1 11 21 31 41 51 

I I I I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 12 0 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA AT CTCACT AA TGACTACCGT 48 0 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGAT CAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTT CAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 78 0 

TTAGAT CCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGG CT CAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTT CTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 10 8 0 

AGT TCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 132 0 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13 80 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 162 0 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

.GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 192 0 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 204 0 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGT CACAG GGTCCCTCAG TTACAGATCT GGAAATG CC A 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 240 0 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGT CTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2640 

ATTT CAGATG ATGT CGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATG CAAGT A GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 282 0 
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CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2 880 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2 940 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3060 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3 420 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3540 

GCCATACTTA GT AAAG AAA C TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3 600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3 660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 372 0 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GG AC TAT ACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATG CAAC AG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3 900 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3 960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GAT ATGG G AC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGG CCAAA ACATGGCAGA AGATGAATTT 40 80 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC AC AAATGT CT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 42 00 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 42 60 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 432 0 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGAT CAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TGAGCAGTAT 4500 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAATCCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 462 0 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACT CACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 480 0 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 4860 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT T T AAC AG AAA ATTTCAATTT 4920 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4980 

GTATTTGTAG CAATT AT CAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 52 80 

TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTT CAAAT 5400 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 188 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

1111)1 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTF I HNTGK TVEINLTNDY RVSGGVSEMV 12 0 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPET I TYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQ PDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 72 0 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHASSGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 900 

AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAY I AA QGPLKSTAED FWRMIWEHNV 960 

EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKI KKG 102 0 

SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGT ILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 12 60 

YIMGYYQSNE FIITQHPLLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPNKDEP 1320 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 1380 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIAESL ESLV 



Seq ID NO: 189 DNA sequence 
Nucleic Acid Accession #: NM_002820 
Coding sequence: 304.. 831 

1 11 21 31 41 51 

1 I I I ! I 

CCGGTTCGCA AAGAAGCTGA CTT CAGAGGG GGAAACTTTC TTCTTTTAGG AGGCGGTTAG 60 

CCCTGTTCCA CGAACCCAGG AGAACTGCTG GCCAGATTAA TTAGACATTG CTATGGGAGA 120 

CGTGTAAACA CACTACTTAT CATTGATGCA TATATAAAAC CATTTTATTT TCGCTATTAT 180 
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TTCAGAGGAA GCGCCTCTGA TTTGTTTCTT TTTTCCCTTT TTGCTCTTTC TGGCTGTGTG 240 

GTTTGGAGAA AGCACAGTTG GAGTAGCCGG TTGCTAAATA AGTCCCGAGC GCGAGCGGAG 300 

ACGATGCAGC GGAGACTGGT TCAGCAGTGG AG CGTCGCGG TGTTCCTGCT GAGCTACGCG 360 

GTGCCCTCCT GCGGGCGCTC GGTGGAGGGT CTCAGCCGCC GCCTCAAAAG AGCTGTGTCT 42 0 

GAACATCAGC TCCTCCATGA CAAGGGGAAG TCCATCCAAG ATTTACGGCG ACGATTCTTC 480 

CTTCACCATC TGAT CGCAGA AATCCACACA GCTGAAATCA GAGCTACCTC GGAGGTGTCC 540 

CCTAACTCCA AGCCCTCTCC CAACACAAAG AACCACCCCG TCCGAT TTGG GTCTGATGAT 60 0 

GAGGGCAGAT ACCTAACTCA GGAAACTAAC AAGGTGGAGA CGTACAAAGA GCAGCCGCTC 660 

AAGACACCTG GGAAGAAAAA GAAAGGCAAG CGCGGGAAAC GCAAGGAGCA GGAAAAGAAA 720 

AAACGGCGAA CTCGCTCTGC CTGGTTAGAC TCTGGAGTGA CTGGGAGTGG GCTAGAAGGG 780 

GACCACCTGT CTGACACCTC CACAACGTCG CTGGAGCTCG ATTCACGGTA ACAGGCTTCT 840 

CTGGCCCGTA GCCTCAGCGG GGTGCTCTCA GCTGGGTTTT GGAGCCTCCC TTCTGCCTTG 900 

GCTTGGACAA ACCTAGAATT TTCTCCCTTT ATGTATCTCT ATCGATTGTG TAGCAATTGA 960 

CAGAGAATAA CTCAGAATAT TGTCTGCCTT AAAGCAGTAC CCCCCTACCA CACACACCCC 1020 

TGTCCTCCAG CACCATAGAG AGGCGCTAGA GCCCATTCCT CTTTCTCCAC CGTCACCCAA 10 80 

CATCAATCCT TTACCACTCT ACCAAATAAT TTCATATTCA AGCTTCAGAA GCTAGTGACC 1140 

AT CTTCAT AA TTTGCTGGAG AAGTGTATTT CTTCCCCTTA CTCTCACACC TGGGCAAACT 1200 

TTCTTCAGTG TTTTTCATTT CTTACGTTCT TTCACTTCAA GGGAGAATAT AGAAGCATTT 1260 

GAT ATT AT CT ACAAACACTG C AG AAC AG C A TC ATGT CAT A AACGATTCTG AGCCATTCAC 1320 

ACTTTTTATT TAATTAAATG TATTTAATTA AATCTCAAAT TTATTTTAAT GTAAAGAACT 13 80 

TAAATTATGT TTTAAACACA TGCCTTAAAT TTGTTTAATT AAATTTAACT CTGGTTTCTA 1440 

CCAGCTCATA CAAAATAAAT GGTTTCTGAA AATGTTTAAG TATTAACTTA CAAGGATATA 1500 

GGTTTTTCTC ATGTATCTTT TTGTTCATTG GCAAGATGAA ATAATTTTTC TAGGGTAATG 1560 
CCGTAGGAAA AATAAAACTT CACATTTAAA AAAAA 
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Seq ID NO: 190 Protein sequence: 
Protein Accession #: NP 002811 



1 11 21 31 41 51 

111)11 
MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPLK 
TPGKKKKGKP GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 



60 
120 



40 
45 
50 
55 
60 
65 



Seq ID NO: 19,1 DNA sequence 
Nucleic Acid Accession #: XM_059328 
Coding sequence: 52.. 1023 



GGGCTGTCCG 
CCTCGCATGC 
GGTATCGTGG 
GCGGCCACGG 
GCCAACCTGT 
GGCCCGGAAG 
GTGGATTTGC 
CTGGGCAGGG 
TGCCAGGTGT 
GAGCGCGGTG 
GTGGAGCGCG 
GACGCCTTCG 
GCCCTGGCGC 
GCGCACCCCG 
TTCTCTTGCT 
GCCCAGCTTG 
CCAGGGGAGG 
TGACCCCCTA 
TTAGTCCTGG 
GGACACTGCC 
AGCCTTCTTG 
TGGTGCCCCT 
CTATATTAAT 



11 
I 

GCCCACTCCC 
GCCTGGTGGT 
AGGCCTTTCT 
AGAGCGCGGC 
CCGAGGGCCG 
GCTTCTTCCT 
CTCAGGTGCG 
CCCCCACGCA 
TCGCCGAGGC 
TGGGTGGCTG 
ACGCCCGGGC 
TGGGCCTGAG 
GGGTCCTGGA 
GCTACCCCAG 
CTTGGGAGCG 
CCCAGGATGG 
AGGTCCCCTG 
CAGACAACCA 
CCCAGCCCAG 
ACCTCTGGGC 
GCTGCAGGCA 
CCATGTTGCA 
AAAATAACGT 



21 
I 

CTGGGAGCGC 
CACCGCGGAC 
GGCCGGGGCT 
GGAGCTGGCC 
CCCCGTGGGT 
TGGCAAGATG 
GGAGGAGCTC 
CGCGGACGGG 
GCTGCAGGCC 
CACTTGGCTG 
CGCCGTGGGC 
CACTTGCGGC 
AGGTACCCTA 
TGTGCCTCCC 
GCTGCATGAG 
CGTGCAGCTT 
TGAGCCCACT 
AGCACTAATC 
AGCTGGGACC 
TCAGGTCCTC 
GGCCTAGCCT 
ATGCAAACAC 
GTGTCTTTC 



31 

I 

GAGCGGTGGA 
GACTTTGGTT 
GTGACCAGCG 
CGCAGGCACA 
CCGGCCCGCC 
GGATTCCGGG 
GAGGCCCAAC 
CACCAGCACG 
TATGGGGTGC 
GAGGCCCCCG 
CCCTTCTCCC 
CGGCACATGT 
GCGGGCCACA 
ACCGGCGGCT 
CTGCGCGTCC 
TGCGCCCTCG 
CTGGAACCCT 
CCCTTAGTAC 
TGGAGCACGA 
ATGCCTCCAA 
GTGGCAGCGG 
CTTCACCACT 



41 
1 

CCCAGGCGGC 
ACTGCCCGCG 
TGTCCCTGCT 
GCATCCCCAC 
GTGGCGCCTC 
AGGCGGTGGC 
TAAGCTGCTT 
TGCACGTGCT 
GCTTTACGCG 
CGCGTGCCTT 
GCCACGGCCT 
CCGCTCACCG 
CCCTGACAGC 
GCGGTGAAGG 
TCACCGCGCC 
ACGACCTGGA 
TCCTGGAACC 
CAAGAAAGGG 
TCTGTTGACT 
ATGGCATCTA 
GCTAGGGCCC 
GGGGCAGTGG 



51 
I 

CATGTCCCGC 
ACGCGATGAG 
GGT CAACGGT 
GGGCCTCCAC 
ATCGCTGCTC 
GGCCGGAGAC 
CCGGGAGCTG 
CCCAGGCGTG 
ACTGCCGCTG 
CGCCTGCGCC 
GCGGTGGACA 
CGTGTCCGGG 
CGAGCTGATG 
CCCCGACGCT 
CACGCTGCGG 
CTCCAAGAGG 
CTCCCTACTC 
GAGCCAGGAT 
TCCCTGGGTA 
GAGTTTGAGC 
GCAGAGCATT 
GGAGAGATGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



70 



75 



80 



Seq ID NO: 192 Protein sequence: 
Protein Accession #: XP 059328 



1 
I 

MSRPRMRLW 
GLHANLSEGR 
RELLGRAPTH 
ACAVERDARA 
ELMAHPGYPS 
SKRPGEEVPC 



11 
I 

TADDFGYCPR 
PVGPARRGAS 
ADGHQHVHVL 
AVGPFSRHGL 
VPPTGGCGEG 
EPTLEPFLEP 



21 
I 

RDEGIVEAFL 
SLLGPEGFFL 
PGVCQVFAEA 
RWTDAFVGLS 
PDAFSCSWER 
SLL 



31 
I 

AGAVTSVSLL 
GKMGFREAVA 
LQAYGVRFTR 
TCGRHMSAHR 
LHELRVLTAP 



41 
I 

VNGAATESAA 
AGDVDLPQVR 
LPLERGVGGC 
VSGALARVLE 
TLRAQLAQDG 



51 
I 

ELARRHSIPT 
EELEAQLSCF 
TWLEAPARAF 
GTLAGHTLTA 
VQLCALDDLD 



Seq ID NO: 193 DNA sequence 

Nucleic Acid Accession #: NM_005688.1 

Coding sequence: 126.. 4439 



60 
120 
180 
240 
300 



„ 1 11 21 31 41 51 

85 | j i i i I 

CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 
AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 



60 
120 
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AGAAGATGAA GGATATCGAC ATAGGAAAAG 
GTGTGAGGGA GAGAACCAGC ACTTCTGGGA 
GGAGAACTCG ACCGTTGGAA TGCCAAGATG 
TCTCTCTTGA TGCCTCCATG CATTCTCAGC 
5 GAAAGTACCA TCATGGCTTG AGTGCTCTGA 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT 
CCCGTGTGGC CCACAAGAAG GGGGAGCTCT 
ACGAGTCTTC TGACGTGAAC TGCAGAAGAC 
AAGT TGGGCG AGACGCTGCT TCCCTGCGAA 

10 TCATCCTGTC CAT CGTGTGC CTGATGATCA 

TCATGGTGAA ACACCTCTTG GAGTATACCC 
TGTTGTTAGT GCTGGGCCTC CTCCTGACGG 
CTTGGGCATT GAATTACCGA ACCGGTGTCC 
TTAAGAAGAT CCTTAAGTTA AAGAACATTA 

15 TTTGCTCCAA CGATGGGCAG AGAATGTTTG 

GAGGACCCGT TGTTGCCATC TTAGGCATGA 
GCTTCCTGGG ATCAGCTGTT TTTATCCTCT 
TCACAGCATA TTTCAGGAGA AAATGCGTGG 
ATGAAGTTCT TACTTACATT AAATTTATCA 

20 AGAGTGTTCA AAAAATCCGC GAGGAGGAGC 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG 
CTGTTCATAT GACCCTGGGC TTCGATCTGA 
TCTTCAATTC CATGACTTTT GCTTTGAAAG 
AAGCCTCAGT GGCTGTTGAC AGATTTAAGA 

25 TAAAGAACAA ACCAG CCAGT CCTCACATCA 

GGGACTCCTC CCACTCCAGT ATCCAGAACT 
ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA 
AGGCGGTGCT GGCAGAGCAG AAAGGCCACC 
CCGAAGAGGA AGAAGGCAAG CACATCCACC 

30 ACAGCATCGA TCTGGAGATC CAAGAGGGTA 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT 
TTGCAAT CAG TGGAACCTTC GCTTATGTGG 
TGAGAGACAA CATCCTGTTT GGGAAGGAAT 
ACAGCTGCTG CCTGAGGCCT GACCTGGCCA 

35 GAGAGCGAGG AGCCAACCTG AGCGGTGGGC 

TGTATAGTGA CAGGAGCATC TACATCCTGG 
TGGGCAACCA CATCTTCAAT AGTGCTATCC 
TTGTTACCCA CCAGTTACAG TACCTGGTTG 
GCTGTATTAC GGAAAGAGGC ACCCATGAGG 

40 CCATTTTTAA TAACCTGTTG CTGGGAGAGA 

AAACCAGTGG TTCACAGAAG AAGTCACAAG 
AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC 
GTTCAGTGCC CTGGTCAGTA TATGGTGTCT 
TCCTGGTTAT TATGGCCCTT TTCATGCTGA 

45 GGTTGAGTTA CTGGAT CAAG CAAGGAAGCG 

CCTCGGTGAG TGACAGCATG AAGGACAATC 
CCCTCTCCAT GGCAGTCATG CTGATCCTGA 
GCACGCTGCG AGCTTCCTCC CGGCTGCATG 
CTATGAAGTT TTTTGACACG ACCCCCACAG 

50 TGGATGAAGT TGACGTGCGG CTGCCGTTCC 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG 
GGCCCCTTGT CATCCTCTTT TCAGTCCTGC 
TGAAGCGTCT GGACAATATC ACGCAGTCAC 
AGGGCCTTGC CACCATCCAC GCCTACAATA 

55 AGCTGCTGGA TGACAACCAA GCTCCTTTTT 

CTGTGCGGCT GGACCTCATC AGCATCGCCC 
TTATGCACGG GCAGATTCCC CCAGCCTATG 
TAACGGGGCT GTTCCAGTTT ACGGTCAGAC 
CGGTGGAGAG GATCAATCAC TACATTAAGA 

60 AGAACAAGGC TCCCTCCCCT GACTGGCCCC 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG 
CTAAAGAGAA GATTGGCATT GTGGGGCGGA 
CCCTCTTCCG TCTGGTGGAG TT AT CTGGAG 
GTGATATTGG CCTTGCCGAC CTCCGAAGCA 

65 TGTTCAGTGG CACTGTCAGA TCAAATTTGG 

TTTGGGATGC CCTGGAGAGG ACACACATGA 
TTGAATCTGA AGTGATGGAG AATGGGGATA 
GCATAGCTAG AGCCCTGCTC CGCCACTGTA 
CCATGGACAC AGAGACAGAC TTATTGATTC 

70 GTACCATGCT GACCATTGCC CATCGCCTGC 
TGCTGGCCCA GGGACAGGTG GTGGAGTTTG 
GTTCCCGATT CTATGCCATG TTTGCTGCTG 
TCCTCCCTGT TGACGAAGTC TCTTTTCTTT 
CCCCTCATCG CGTCCTCCTA CCGAAACCTT 

75 GTTCCGGATT GGCTTGTGTG TTTCACTTTT 

ATTCCATATT CATGTAAACA AAATTTAGTT 
GGGAACCGTT ATTATAATTG TATCAGAGGC 
TCTATATATA ATTCTGTACA TAGCCTATAT 
TATTAAAATA AGCACTGTGC TAATAACAGT 

80 TTGCTGTACT AGAGATCTGG TTTTGCTATT 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT 
ATAGTGGGCC CTCCGACAGC CCCCTCTGCC 
GAGACGGGTG GGCGGCTGGA GACCATGCAG 
CTGTCCTGGT GTCACTTACT GTTTCTGTCA 

85 TTTCACTCCC TC CAT CAAG A ATGGGGATCA 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT 
TCCCACTGCC TCAGGTTCCT ATGGCTGGCC 



AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCAGAATCCT GGATGAGGAG CATCCCAAGG 360 

AGCCCATCCG GACTACTTCC AAACACCAGC 42 0 

GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 

CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 72 0 

AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

GCTTGCGGGG GGCCATCCTA ACCATGGCAT 900 

AAGAGAAATC CCTGGGTGAG CT CATC AAC A 960 

AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

TT TATAATGT AATTATTCTG GGACCAACAG 10 80 

TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

AAATGTATGC CTGGGTCAAA GCATTTTCTC 12 60 

GTCGGATATT GGAAAAAGCC GGGTACTTCC 132 0 

TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

AGATAGAGAT GAAAAATGCC ACCTTGGCAT 1620 

CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

AACTGGTTGG AATCTGCGGC AGTGTGGGAA 192 0 

TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

CCCAGCAGGC CTGGATCCTC AATGCTACTC 2040 

ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 2220 

ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CACCGCCAGT TGAGATCAAT TCAAAAAAGG 2 520 

ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2 760 

GGAACACCAC TGTGACTCGA GGGAACGAGA 2 82 0 

CTCATATGCA GTACTATGCC AGCATCTACG 2 88 0 

AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2 940 

ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

GGAGGATTCT CAACAGGTTT TCCAAAGACA 3 060 

AGGCCGAGAT GTTCATCCAG AACGTTATCC 3120 

GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 342 0 

TCATCACCAC CACGGGGCTG ATGATCGTTC 34 80 

CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TGGCATCTGA GACAGAAGCT CGATTCACCT 3 600 

CTCTGTCCTT GGAAGCACCT GCCAGAAT T A 3 660 

AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3 72 0 

TCCTAAAGAA AGTATCCTTC ACGATCAAAC * 3780 

CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3840 

GCTGCATCAA GATTGATGGA GTGAGAATCA 3 900 

AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3 960 

ACCCCTTCAA CCAGTACACT GAAGACCAGA 402 0 

AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 4080 

ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

AGATTCTGAT TTTAGATGAA GCCACAGCTG 42 0 0 

AAGAGACCAT CCGAGAAGCA TTTGCAGACT 4260 

ACACGGTTCT AGGCTCCGAT AGGATTATGG 4320 

ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

AGAG CATTGC CATTCCCTGC CTGGGGCGGG 4500 

GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

AGGGAGAGTC ATATTTTGAT TATTGTATTT 462 0 

TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

AGACTGTAGG AAGAGTAGCA TTT CATTCTT 4920 

TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

CTAAACAAGA ATCAGT CTAT CCACAGAGAG 52 80 

ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 
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GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 

ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 

CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 5820 
AAAAAAAAAA AAAAAAAA 

Seq ID NO: 194 Protein sequencer 
Protein Accession #: NP_0 05679.1 



1 11 21 31 41 51 

; I | 1 I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFS CM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLL LVLGLLLTEI VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVIILGPTGF LGSAVFILFY PAMMFASRKT AYFRRKCVAA TDERVQKMNE 360 

VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 420 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHIiLi LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSIiISAILG QMTIiLEGSIA ISGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 72 0 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNIiLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWL SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVML I LKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFIA7AVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHITSSIQG L AT I HAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAI S YAVQIjT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 1200 

RYRENLPLVli KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 1260 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLI.CI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 

Seq ID NO: 195 DNA sequence 
Nucleic Acid Accession #: NM_006470 
Coding sequence: 2 2 8.. 192 2 



1 11 21 31 41 51 

GCTGTCCTGA GCCTGAGTAC TCTAGCTGCC TTGTCGCCAT CGCATCTGGC TGCCATCCAG 60 

CGCCAGCACA CAGTAATGAG TGGCCGAGCT TCCTCTGGGA GGGAGGAAAC AGTTAAAATC 12 0 

TTGCAGCAGC TGCAATCATC TAGGCGTGGT TCTCTTGTCT GACTTGGGCT GCACAGATCC 180 

TGGGCCAAGG GACAGAAGAA AGACAGCCTA GGAGCAGAGC CTCCCAGATG GCTGAGTTGG 240 

ATCTAATGGC TCCAGGGCCA CTGCCCAGGG CCACTGCTCA GCCCCCAGCC CCTCTCAGCC 300 

CAGACTCTGG GTCACCCAGC CCAGATTCTG GGTCAGCCAG CCCAGTGGAA GAAGAGGACG 360 

TGGGCTCCTC GGAGAAGCTT GGCAGGGAGA CGGAGGAACA GGACAGCGAC TCTGCAGAGC 420 

AGGGGGATCC TGCTGGTGAG GGGAAAGAGG TCCTGTGTGA CTTCTGCCTT GATGACACCA 480 

GAAGAGTGAA GGCAGTGAAG TCCTGTCTAA CCTGCATGGT GAATTACTGT GAAGAGCACT 540 

TGCAGCCGCA TCAGGTGAAC AT C AAACTGC AAAGCCACCT GCTGACCGAG CCAGTGAAGG 600 

ACCACAACTG GCGATACTGC CCTGCCCACC ACAGCCCACT GTCTGCTTTC TGCTGCCCTG 660 

ATCAGCAGTG CATCTGCCAG GACTGTTGCC AGGAGCACAG TGGCCACACC ATAGTCTCCC 720 

TGGATGCAGC CCGCAGGGAC AAGGAGGCTG AACTCCAGTG CACCCAGTTA GACTTGGAGC 780 

GGAAACTCAA GTTGAATGAA AATGCCATCT CCAGGCTCCA GGCTAACCAA AAGTCTGTTC 840 

TGGTGTCGGT GTCAGAGGTC AAAGCGGTGG CTGAAATGCA GTTTGGGGAA CTCCTTGCTG 900 

CTGTGAGGAA GGCCCAGGCC AATGTGATGC TCTTCTTAGA GGAGAAGGAG CAAGCTGCGC 960 

TGAGCCAGGC CAACGGTATC AAGGCCCACC TGGAGTACAG GAGTGCCGAG ATGGAGAAGA 102 0 

GCAAGCAGGA GCTGGAGAGG ATGGCGGCCA TCAGCAACAC TGTCCAGTTC TTGGAGGAGT 1080 

ACTGCAAGTT TAAGAACACT G AAGACAT CA CCTTCCCTAG TGTTTACGTA GGGCTGAAGG 1140 

ATAAACT CTC GGGCATCCGC AAAGTTATCA CGGAATCCAC TGTACACTTA ATCCAGTTGC 1200 

TGGAGAACTA TAAGAAAAAG CTCCAGGAGT TTTCCAAGGA AGAGGAGTAT GACATCAGAA 1260 

CTCAAGTGTC TGCCGTTGTT CAGCGCAAAT ATTGGACTTC CAAACCTGAG CCCAGCACCA 1320 

GGGAACAGTT CCTCCAATAT GCGTATGACA TCACGTTTGA CCCGGACACA GCACACAAGT 1380 

ATCTCCGGCT GCAGGAGGAG AACCG CAAGG TCACCAACAC CACGCCCTGG GAGCATCCCT 1440 

ACCCGGACCT CCCCAGCAGG TTCCTGCACT GGCGGCAGGT GCTGTCCCAG CAGAGTCTGT 1500 

ACCTGCACAG GTACTATTTT GAGGTGGAGA TCTTCGGGGC AGGCACCTAT GTTGGCCTGA 1560 

CCTGCAAAGG CATCGACCGG AAAGGGGAGG AGCGCAACAG TTGCATTTCC GGAAACAACT 1620 

TCTCCTGGAG CCTCCAATGG AACGGGAAGG AGTTCACGGC CTGGTACAGT GACATGGAGA 1680 

CCCCACTCAA AGCTGGCCCT TTCCGGAGGC TCGGGGT CTA TATCGACTTC CCGGGAGGGA 1740 

TCCTTTCCTT CTATGGCGTA GAGTATGATA CCATGACTCT GGTTCACAAG TTTGCCTGCA 180 0 

AATTTTCAGA ACCAGTCTAT GCTGCCTTCT GGCTTTCCAA GAAGGAAAAC GCCATCCGGA 1860 

TTGTAGATCT GGGAGAGGAA CCCGAGAAGC CAGCACCGTC CTTGGGGGTG ACTGCTCCCT 1920 

AGACT CCAGG AGCCATATCC CAGACCTTTG CCAGCTACAG TGATGGGATT TGCATTTTAG 1980 

GGTGATTTGT GGGCAGAAAT AACTGCTGAT GGTAGCTGGC TTTTGAAATC CTATGGGGTC 2040 

TCTGAATGAA AACATTCTCC AGCTGCTCTC TTTTGCTCCA TATGGTGCTG TTCTCTATGT 2100 

GTTTGCAGTA ATTCTTTTTT TTTTTTTTGA GACGGAGTCT CGCACTGTTG CCCAGGCTGG 2160 

AGAGCAGTGG CGCGATCTTG GCTCACTGCA AGCTCCGCCT CCCGAGTTCA AGCAATTCTC 2220 

CTGCCTCAGC CTCCCGAGTA GCTGGGATTA CAGGTGCCTG CCACCACACC CAGCTAATGT 2280 

TTTGTATTTT TAGTAGAGAT GGGGTTTCAC CATGTTGGCC AGGCAGATCT CAAACT CCTG 2340 
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ACCTCGTGAT GCACCCACCT CGGCCTCCCA AAGTGCTGGG ATTACATGCG TGAGCCACTG 
CGCCCTGCCT GTTTGTAGTA ATTTTTAGGC ACCAAATCTC CCTCATCTTC TAGTGCCATT 
CTCCTCTCTG TTCAGGTAAA TGTCACACTG TGCCCAGAAT GGATGACCAG GAACCTTAAA 
GAGTGGCTGA AAAGATTGCA GAGTTATCAT AATAAATTGC TAACTTGCGT 



Seq ID NO: 196 Protein sequence: 
Protein Accession #: NP_006461 
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MAELDLMAPG 
DSAEQGDPAG 
EPVKDHNWRY 
LDLERKLKLN 
EQAAIiSQANG 
VGLKDKLSGI 
EPSTREQFLQ 
QQSLYLHRYY 
SDMETPLKAG 
NAIRIVDLGE 



PLPRATAQPP 
EGKEVLCDFC 
CPAHHSPLSA 
ENAISRLQAN 
IKAHLEYRSA 
RKVITESTVH 
YAYDITFDPD 
FEVEI FGAGT 
PFRRLGVYID 
EPEKPAPSLG 



APLSPDSGSP 
LDDTRRVKAV 
FCCPDQQCIC 
QKSVLVSVSE 
EMEKSKQELE 
LIQLLENYKK 
TAHKYLRLQE 
YVGLTCKGID 
FPGGILSFYG 
VTAP 



SPDSGSASPV 
KSCLTCMVNY 
QDCCQEHSGH 
VKAVAEMQFG 
RMAAISNTVQ 
KLQEFSKEEE 
ENRKVTNTTP 
RKGEERNSC I 
VEYDTMTLVH 



EEEDVGSSEK 
CEEHLQPHQV 
TIVSLDAARR 
ELLAAVRKAQ 
FLEEYCKFKN 
YDIRTQVSAV 
WEHPYPDLPS 
SGNNFSWSLQ 
KFACKFSEPV 



LGRETEEQDS 
NIKLQSHLLT 
DKEAELQCTQ 
ANVMLFLEEK 
TEDITFPSVY 
VQRKYWTSKP 
RFLHWRQVLS 
WNGKEFTAWY 
YAAFWLSKKE 



2400 
2460 
2520 



60 
120 
180 
240 
300 
360 
420 
480 
540 



25 
30 
35 
40 
45 
50 
55 



Seq ID NO: 197 DNA sequence 
Nucleic Acid Accession #: NM_004316 
Coding sequence: 433-1149 



CCCGAGACCC 
GCGCGTTCAG 
GCGCCAGCGG 
GGAGGAGGGG 
TTGCTCCCAC 
GCTCCCGCTT 
GTCCCCCTCG 
CCGCCGCTGC 
CAGCCGCAGC 
GCCGCGGCGG 
CAGCAGCAGC 
TCAGGGGGCG 
GAACTGATGC 
CAGCAGCCGG 
AACCTGGGCT 
AGTAAGGTGG 
GACGAGCATG 
CCCAACTACT 
GACGAGGGCT 
TGGTTCTGAG 
ATCGCACAAC 
AAAAGAAAAA 
CCAACCCCAT 
AGCGCTCAGA 
ACCTGAGTCA 
GAGCAGCACA 
GCTCGGGTCC 
GAGTTGGTGT 



11 
I 

GGCGCAAGAG 
CACTGACTTT 
CAGCCTCACA 
AGGGAGGAGG 
TCTAAGAAGT 
CATATTTCCT 
CGGGCCCCGC 
GCATGGAAAG 
CCCAGCAGCG 
CCGCAGCCGC 
AGCAGCAGCA 
GTCACAAGTC 
GCTGCAAACG 
CCGCCGTGGC 
TTGCCACCCT 
AGACACTGCG 
ACG CGGTG AG 
CCAACGACTT 
CTTACGACCC 
GGGCTCGGCC 
CTGCATCTTT 
AAAAGAAGAA 
CGCCAACTAA 
ACAGTAT CTT 
ATGCGCAAAA 
CGCGTTATAG 
CTTCACCTCC 
CTTTC 



21 
I 

AGCGCAGCCT 
TGCTGCTGCT 
CGCGAGCGCC 
AGGCGGCGTG 
CTCCCGGGGA 
TTTCTTTCCC 
ACCTCGCGTC 
CTCTGCCAAG 
CTTCCTGCCG 
CGCAGCGGCA 
GCAGGCGCCG 
AG CGCCCAAG 
CCGGCTCAAC 
GCGCCGCAAC 
TCGGGAGCAC 
CTCGGCGGTC 
CGCCGCCTTC 
GAACT CCATG 
GCTCAGCCCC 
TGGTCAGGCC 
AGTGCTTTCT 
GAAGAAGAAA 
GCGAGGCATG 
TGCACTCCAA 
TGCAGCTTGT 
TAACTCCCAT 
CCGCCCTTTC 



31 

I 

TAGTAGGAGA 
TCTGCTTTTT 
ACGCGAGGCT 
CAGGGAGGAG 
TTTTGTATAT 
TCTCTGTTCC 
CCGGATCGCT 
ATGGAGAGCG 
CCCGCAGCCT 
GCGCAGAGCG 
CAGCTGAGAC 
CAAGTCAAGC 
TTCAGCGGCT 
GAGCGCGAGC 
GTCCCCAACG 
GAGTACATCC 
CAGGCAGGCG 
GCCGGCTCGC 
GAGGAGCAGG 
CTGGTGCGAA 
TGTCAGTGGC 
AGAGAAGAAG 
CCTGAGAGAC 
TCATTCACGG 
GTGCAAAAGC 
CACCTCTAAC 
TTAGAGTGCA 



41 
I 

GGAACGCGAG 
TTTTTCTTAG 
CCCGAAGCCA 
AAAAAGCATT 
ATTTTTTAAC 
TGCACCCAAG 
CTGATTCCGG 
GCGGCGCCGG 
GTTTCTTTGC 
CGCAGCAGCA 
CGGCGGCCGA 
GACAGCGCTC 
TTGGCTACAG 
GCAACCGCGT 
GCGCGGCCAA 
GCGCGCTGCA 
TCCTGTCGCC 
CGGTCTCATC 
AGCTTCTCGA 
TGGACTTTGG 
GTTGGGAGGG 
AAAAAAACGA 
ATGGCTTTCA 
AGATATGAAG 
AGTGGGCTCC 
ACGCACAGCT 
GTTCTTAGCC 



51 

1 

ACGCGGCAGA 
AAACAAGAAG 
ACCCGCGAAG 
TTCACCTTTT 
TTCCGTCAGG 
TTCTCTCTGT 
GACTCCTTGG 
CCAGCAGCCC 
CACGGCCGCA 
GCAGCAGCAG 
CGGCCAGCCC 
GTCTTCGCCC 
CCTGCCGCAG 
CAAGTTGGTC 
CAAGAAGATG 
GCAGCTGCTG 
CACCATCTCC 
CTACTCGTCG 
CTTCACCAAC 
AAGCAGGGTG 
GGAGAAAAGG 
AAACAGTCAA 
GAAAACGGGA 
AGCAACTGGG 
TGGCAGAAGG 
GAAAGTTCTT 
CTCTAGAAAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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Seq ID NO: 198 Protein sequence: 
Protein Accession #: NP__004307 
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MESSAKMESG GAGQQPQPQP QQPFLPPAAC FFATAAAAAA AAAAAAAQSA QQQQQQQQQQ 
QQQQAPQLRP AADGQPSGGG HKSAPKQVKR QRSSSPELMR CKRRLNFSGF GYSLPQQQPA 
AVARRNERER NRVKLVNLGF ATLREHVPNG AANKKMSKVE TLRSAVEYIR ALQQLLDEHD 
AVSAAFQAGV LSPTISPNYS NDLNSMAGSP VSSYSSDEGS YDPLSPEEQE LLDFTNWF 

Seq ID NO: 199 DNA sequence 
Nucleic Acid Accession #: NM_007015 
Coding sequence: 1-1005 



ATGACAGAGA 
TGCAGCCCCC 
AAGGTGGGAG 
GCCTTCTACT 
ATCAATGGGA 
TTTAAAATGG 
ACAGGAATTC 
ATTCCTGAGG 
ATGCCAGTCA 
GACAACAGCT 
CTTAAACCAA 
GTTCCAACTA 
CTGAATAATG 



11 

I 

ACTCCGACAA 
CGGCGTACGC 
CCGTGGT CCT 
TCTGGAAGGG 
AACTACAAGA 
GAAGTGGAGC 
GTTTTGCTGG 
TGGGCGCCGT 
AATATGAAGA 
TCTTGAGTTC 
CCTATCCAAA 
CCACAAAAAG 
AAACCAGACC 



21 

I 

AGTTCCCATT 
TACGCTGACG 
CATTTCGGGA 
GAG CG ACAGT 
TGGGTCAATG 
TGAAGAAGCA 
AGGAGAGAAG 
GACCAAACAG 
AAATTCTCTT 
TAAGGTGTTA 
AGAAATCCAG 
ACCACACAGT 
CAGTGTTCAA 



31 
I 

GCCCTGGTGG 
GTGAAGCCCT 
GCTGTGCTGC 
CACATTTACA 
GAAATAGACG 
ATTGCAGTTA 
TGCTACATTA 
AGCATCTCCT 
ATCTGGGTGG 
GAACT CTGCG 
AGGGAAAGAA 
GGACCACGGA 
GAGGACTCAC 



41 
I 

GACCTGATGA 
CCAGCCCCGC 
TGCTCTTTGG 
ATGTCCATTA 
CTGGGAACAA 
ATGATTTCCA 
AAGCGCAAGT 
CCAAACTGGA 
CTGTAGATCA 
GTGACCTTCC 
GAGAAGTGGT 
GCAACCCAGG 
AAGCCTTCAA 



51 
I 

CGTGGAATTC 
GCGGCTGCTC 
GGCCATCGGG 
CACCATGAGT 
CTTGGAGACC 
GAATGGCATC 
GAAGGCTCGT 
AGGCAAGATC 
GCCTGTGAAG 
TATTTTCTGG 
AAGAAAAATT 
CGCTGGAAGA 
TCCTGATAAT 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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CCTTATCATC AGCAGGAAGG GGAAAGCATG ACATTCGACC CTAGACTGGA TCACGAAGGA 840 

ATCTGTTGTA TAGAATGTAG GCGGAGCTAC ACCCACTGCC AGAAGATCTG TGAACCCCTG 900 

GGGGGCTATT ACCCATGGCC TTATAATTAT CAAGGCTGCC GTTCGGCCTG CAGAGTCATC 960 

ATGCCATGTA GCTGGTGGGT GGCCCGTATC TTGGGCATGG TGTGAAATCA CTTCATATAT 1020 

CACGTGCTGT AAAATAAGAA CTAGCTGAAG AGACAACCAA AGAAGCATTA AGGCAGGTTG 1080 

ATGCTGATGG GACCATAAAA TATTTTTACA CGCAGCCTGA GCGGTTATTC TTGACACTCT 1140 

TAACAGAATT TTTTTAATCG TTTTCCAGAA CTTTAGTATA TGCAAATGCA CTGAAAGGGT 1200 

AGTTCAAGTC TAAAATGCCA TAACCCCGTT ATTTGTTATT TTTTATTTGC ATTGATTTGC 12 60 

CATAAGTCTT CCCTTGCTTG CATCTTCCAA AGCTATTTCG AAATAAACAC GAAAATTTAC 132 0 
AGTTTGCC 



Seq ID NO: 200 Protein sequence: 
Protein Accession #: NP_008946 

1 11 21 31 41 51 

I I | ] I I 

MTENSDKVPI ALVGPDDVEF CSPPAYATLT VKPSSPARLL KVGAWLISG AVLLLFGAIG 60 
AFYFWKGSDS H I YNVH YTMS INGKLQDGSM EIDAGNNLET FKMGSGAEEA IAVNDFQNGI 120 
TGIRFAGGEK CYIKAQVKAR IPEVGAVTKQ SISSKLEGKI MPVKYEENSL IWVAVDQPVK 180 
DNSFLSSKVL ELCGDLPIFW LKPTYPKEIQ RERREWRKI VPTTTKRPHS GPRSNPGAGR 240 
LNNETRPSVQ EDSQAFNPDN PYHQQEGESM TFDPRLDHEG ICCIECRRSY THCQKICEPL 300 
GGYYPWPYNY QGCRSACRVI MPCSWWVARI LGMV 



Seq ID NO: 201 DNA sequence 

Nucleic Acid Accession NM_000728.2 

Coding sequence: 112.. 495 

1 11 21 31 41 51 

GTAATAAGAG CGGGGTCTCC GCGGGGAAGG CGCCCACAGC AGGTGTGGTG TTCATCCCGG 60 

GTCGACCGGC CGCTCGCGCT GCCCTGAAAC TCTAGTCGCC AGAGAGGCGG CATGGGTTTC 120 

CGGAAGT TCT CCCCCTTCCT GGCTCTCAGT ATCTTGGTCC TGTACCAGGC GGGCAGCCTC 180 

CAGGCGGCGC CATT CAGGTC TGCCCTGGAG AGCAGCCCAG ACCCGGCCAC ACTCAGTAAA 240 

GAGGACGCGC GCCTCCTGCT GGCTGCACTG GTGCAGGACT ATGTGCAGAT GAAGGCCAGT 300 

GAGCTGAAGC AGGAGCAGGA GACACAGGGC TCCAGCTCCG CTGCCCAGAA GAGAGCCTGC 360 

AACACTGCCA CCTGTGTGAC T CAT CGGCTG GCAGGCTTGC TGAGCAGATC AGGGGGCATG 42 0 

GTGAAGAGCA ACTTCGTGCC CACCAATGTG GGTT CCAAAG CCTTTGGCAG GCGCCGCAGG 480 

GACCTTCAAG C C TG AGC AG A TGAATGACTC CAGGAAGAAG GTGTGTCCTA AATCCAATGA 540 

CATATCCTTA TAAGAGATTC ACTCAGAAGA CACATGTGGA GAAGGTGACA TGACAGAGGC 60 0 

AAGGAGGCAC AAGCCAAGGA AGTCTGTGTC TACCAGAAGC CAGAATCACA GAACAGTCTC 660 

TGGAAGAAGA GCAGCCCTGC TGACACCTAG AGTTTGGACT TCCAGCTTCC AGAACTGTGA 720 

GAGAATAATT TCTGTTGTTT TAAGCCACAA AGTTTGTGGT AATTTGTTAT GACAGCCCTA 780 

GGAAACTAAT ACAATACATT TTCATTTATT TTGGGTAAAT GCCTTGGAGT GGGATTGCTG 840 

GGTTATTTGG AAAGTGTGTA TTTAACTCTG TAAGAAACTG CCAAACTATT TTCTGAAGTG 900 

ACTGTACCAC TTCGCCTTCT TGCCAGCCAC ATATGAGAGC TCTAGTATTT CCACAAATAG 960 

GTATGTAGCA GTATCTCATT GCTGTTTTAA TTTGTATTTC CCCAATGACT AATGACGTTG 102 0 

AGCATCTATT TTACCATATG TTTATCACCT TTATTGAAGG GTCTGTTTAA ATCTTCTGCT 1080 

AAATTTTTGT TGGCTTGCTT GCTTTATTAG TGTTGAGTTT TTAGAGCTCT TTATATGTTG 1140 

TGGATGCAAG ATTGTTTTCA GATATATAGT TTGGAAACTT CCTTCCCCTG AATCTGCGGA 1200 

TTGCTTTTTC ATTTTCTTAG CAGTGTCTCT CACAGAGAAA AAGTTGTAAT TTGAATAAGA 1260 

TCCAATTCAT CTTTTTTTTT CTTTTATGTA TTGTGCTTTT AGTTCATGTC TAAGAACTCT 132 0 

TTGCCTAACT AAGGTCCCAA GGTCACAATA ACCTTATTCT ATACTTTCTT GTAAAAGTTT 13 80 

TATAGTTTTA T ATT TTAT AT GTAGATTAGT GATCTATTTT GAGTTAATTT TTGTATAAGG 1440 

TGAGAGGTGT AGGTTGAAAT T C AT ACCTGT GAATATAGAT ACCCAATTGT TTCAGTGCCA 1500 

TTTGTTAAAA AGACTGTTAT TTCACCATTT AATTGCCCCT GCACCTTTGT CAAAAAGCAA 1560 

CTGATCATAT TTGTGTGGGT ATATTTCTGG GTTCTCAATT CTGTCTCATT GATTGATTTG 1620 

ACCATTCTTT TGCCAATGTC ATACTGCCTT GATTAGTGTA GTGTTAAAGT GAAT CTCAAA 1680 

ACCAGATAAT GTGGGTCTAC CAACATTGTT CATTCTTGTT CAAAAAGATT TTAGCTACAT 1740 

CTAAAATATT TTCTACATCT TTTATACATT TTAGAATCAG TGTGTTACTA TCTACAAAAT 1800 

TTCTGATGAG ATTTTTAATG GGATTGTGTT AAATCAGTGG GTTAATTTTG GGAGAATTAG 1860 

CATATTAATA AT AT T AAGT C GTTCAATTCA TGAACACAAT ACATGTTTTC ACTTATTTAG 1920 

GTTTTCTCTG TTTTTTTTTT TTTAACAGTG TTCTCAGTTT TCAACAGAAA TATTCTACAC 1980 

ATATCTTGTT AGATTTTTAA CTATTTTATT TTTTGGTGCT AATGTAAATG GTACTTAAAC 2 040 

ATTTTTGTTT TTAATTGTTC ATTGCTAGTA GATAGAAATA CAATATTTAA AATATTAGGA 2100 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



Seq ID NO: 202 Protein sequence: 
Protein Accession #: NP_000719.1 

1 11 21 31 41 51 

MGFRKFSPFL ALSILVLYQA GSLQAAPFRS ALESSPDPAT LSKEDARLLL AALVQDYVQM 60 
KASELKQEQE TQGSSSAAQK RACNTATCVT HRLAGLLSRS GGMVKSNFVP TNVGSKAFGR 120 
RRRDLQA 



Seq ID NO: 203 DNA sequence 
Nucleic Acid Accession #: NM_001741 
Coding sequence: 71.. 496 

1 11 21 31 41 51 

I ! 1 I I 1 

CTCTGGCTGG ACGCCGCCGC CGCCGCTGCC ACCGCCTCTG ATCCAAGCCA CCTCCCGCCA 60 
GAGAGGTGTC ATGGGCTTCC AAAAGTTCTC CCCCTTCCTG GCTCTCAGCA TCTTGGTCCT 120 
GTTGCAGGCA GGCAGCCTCC ATGCAGCACC ATTCAGGTCT GCCCTGGAGA GCAGCCCAGC 180 
AGACCCGGCC ACGCT CAGTG AGGACGAAGC GCGCCTCCTG CTGGCTGCAC TGGTGCAGGA 240 
CTATGTGCAG AT G AAGGCCA GTGAGCTGGA GCAGGAGCAA GAGAGAGAGG GCTCCAGCCT 300 



268 



10 



WO 02/086443 

GGACAGCCCC AGATCTAAGC GGTGCGGTAA TCTGAGTACT TGCATGCTGG GCACATACAC 3 60 

GCAGGACTTC AACAAGTTTC ACACGTTCCC CCAAACTGCA ATTGGGGTTG GAGCACCTGG 420 

AAAGAAAAGG GATATGTCCA GCGACTTGGA GAGAGACCAT CGCCCTCATG TTAGCATGCC 480 

CCAGAATGCC AACTAAACTC CTCCCTTTCC TTCCTAATTT CCCTTCTTGC ATCCTTCCTA 540 

TAACTTGATG CATGTGGTTT GGTTCCTCTC TGGTGGCTCT TTGGGCTGGT ATTGGTGGCT 600 

TTCCTTGTGG CAGAGGATGT CTCAAACTTC AGATGGGAGG AAAGAGAGCA GGACT CACAG 660 

GTTGGAAGAG AATCACCTGG GAAAATACCA GAAAATGAGG GCCGCTTTGA GTCCCCCAGA 720 

GATGTCATCA GAGCTCCTCT GTCCTGCTTC TGAATGTGCT GATCATTTGA GGAATAAAAT 780 
TATTTTTCCC C 



PCT/US02/12476 



Seq ID NO: 2 04 Protein sequence: 
Protein Accession #: NP 001732 



15 



20 



25 



30 



35 



40 



ii 



21 



31 

! 



41 



51 



MGFQKFSPFL ALSILVLLQA GSLHAAPFRS ALESSPADPA TLSEDEARLL LAALVQDYVQ 
MKASELEQEQ EREGSSLDSP RSKRCGNLST CMLGTYTQDF NKFHTFPQTA IGVGAPGKKR 
DMSSDLERDH RPHVSMPQNA N 

Seq ID NO: 205 DNA sequence 
Nucleic Acid Accession #: NM_005361 
Coding sequence: 1-945 



ATGCCT CTTG 
GAGGCCCTGG 
TCCTCTTCTA 
CCTCCCCACA 
AGACAATCCG 
CTGGAGTCCG 
CTCCTCAAGT 
AGAAATTGCC 
GTCTTTGGCA 
TGCCTGGGCC 
CTCCTGATAA 
ATCTGGGAGG 
CATCCCAGGA 
GTGCCCGGCA 
ACCAGCTATG 
TACCCACCCC 



11 
I 

AGCAGAGGAG 
GCCTGGTGGG 
CTCTAGTGGA 
GTCCTCAGGG 
ATGAGGGCTC 
AGTTCCAAGC 
ATCGAGCCAG 
AGGACTTCTT 
TCGAGGTGGT 
TCTCCTACGA 
TCGTCCTGGC 
AG CTGAGTAT 
AGCTGCTCAT 
GTGATCCTGC 
TGAAAGTCCT 
TGCATGAACG 



21 
I 

TCAGCACTGC 
TGCGCAGGCT 
AGTTACCCTG 
AGCCTCCAGC 
CAGCAACCAA 
AGCAATCAGT 
GGAGCCGGTC 
TCCCGTGATC 
GGAAGTGGTC 
TGGCCTGCTG 
CATAAT CGCA 
GTTGGAGGTG 
GCAAGATCTG 
ATGCTACGAG 
GCACCATACA 
GGCTTTGAGA 



31 
I 

AAGCCTGAAG 
CCTGCTACTG 
GGGGAGGTGC 
TTCTCGACTA 
GAAGAGGAGG 
AGGAAGATGG 
ACAAAGGCAG 
TTCAGCAAAG 
CCCATCAGCC 
GGCGACAATC 
ATAGAGGGCG 
TTTGAGGGGA 
GTGCAGGAAA 
TTCCTGTGGG 
CTAAAGATCG 
GAGGGAGAAG 



41 
I 

AAGGCCTTGA 
AGGAGCAGCA 
CTGCTGCCGA 
CCATCAACTA 
GGCCAAGAAT 
TTGAGTTGGT 
AAATGCTGGA 
CCTCCGAGTA 
ACTTGTACAT 
AGGTCATGCC 
ACTGTGCCCC 
GGGAGGACAG 
ACTACCTGGA 
GTCCAAGGGC 
GTGGAGAACC 
AGTGA 



51 

I 

GGCCCGAGGA 
GACCGCTTCT 
CTCACCGAGT 
CACTCTTTGG 
GTTTCCCGAC 
TCATTTTCTG 
GAGTGTCCTC 
CTTGCAGCTG 
CCTTGTCACC 
CAAGACAGGC 
TGAGGAGAAA 
TGTCTTCGCA 
GTACCGGCAG 
CCTCATTGAA 
TCACATTTCC 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



45 



Seq ID NO: 206 Protein sequence: 
Protein Accession #: NP 005352 



50 
55 
60 
65 
70 
75 
80 
85 



MPLEQRSQHC 
PPHSPQGASS 
LLKYRAREPV 
CLGLSYDGLL 
HPRKLLMQDL 
YPPLHERALR 



11 
I 

KPEEGLEARG 
FSTTINYTLW 
TKAEMLESVL 
GDNQVMPKTG 
VQENYLEYRQ 



21 

I 

EALGLVGAQA 
RQSDEGSSNQ 
RNCQDFFPVI 
LLIIVLAIIA 
VPGSDPACYE 



31 

I 

PATEEQQTAS 
EEEGPRMFPD 
FSKASEYLQL 
IEGDCAPEEK 
FLWGPRALIE 



Seq ID NO: 207 DNA sequence 
Nucleic Acid Accession #: NM_021115 
Coding sequence; 743-2893 



AAAGGAAGGG 
GGCACCGCCC 
CCCAAACTAA 
CCCTTTGGGT 
GCACCCTGAA 
GGGCGAGCTG 
ACCGCTGCTT 
TTCGCTCAAG 
CACTGTCCAA 
CACGGAGAAG 
AGAAGTGCCC 
GCAAATCTCC 
ACCCGGGGAG 
GGCCCTGATG 
GACCACTACC 
CTGCAGTGTG 
GCCCCTCAAC 
GGAGCTCCAG 
GGACGGCCCT 
CCGAAGCCCC 
GACCTTCCAG 
CTCTGGGGAT 
CCTGGGCTAT 
CTGGAGCAGC 
CATCGGCCGC 
CTGGACGATT 



11 

I 

AGGGAGGGAG 
TTAGGAGGGC 
CTGGTGTCTT 
CCTTACCTCC 
GAGAGAGTGG 
GTGCTGGATG 
CCAGAGGAGG 
CAGGTGAACT 
AGGGCAGGGT 
CCTGGCCCAC 
CTTTGGCTGG 
CCCTTCACTT 
CCTGGGCCTG 
GACAAAGGTG 
TCCACCATTA 
AGCTTCTCCA 
AACTTTCTGG 
GTGAAGAGTG 
ACCCTGACCG 
ACCAACACCA 
CTTCACTACC 
GTCACGGTGA 
GAGCTCCAGG 
CAGGAGCCCA 
GTCCTCTCCC 
GAAGCTCCAG 



21 

I 

AAAGGAGAAG 
CACCCTCAGA 
TTCTCCTCTT 
TGCCCTCAGG 
TAACAGCGCC 
GGACCGCACC 
CCCGCCCCAA 
CTGCCAGGAA 
CCCAGCCAGC 
CGGGGGACCC 
ACCGAAAGGA 
CGCAGCCCTA 
ACATGGCCCA 
AGAATGAGCT 
TCACCACCAC 
AT CCTGAGGG 
AGTG CACAT A 
TGAACCTGTC 
TCCTGGCCAA 
TCTCCGTCTA 
AGGCCTTCAT 
TGGACCTGCA 
GCGCTAAGAT 
TCTGCTCAGC 
CAAGTTACCC 
AGGGCCAGAA 



31 
1 

TTGGTTTAGA 
GTCTGACAGC 
CCAAGATGCT 
AGCCCCGGAG 
CCCCAGTTCC 
CTCTGCACAT 
GCACGCCTTG 
GCAGCTGAGG 
GTCCCAGGGC 
GGACCCCATC 
GAGTGCGGTC 
TGTGGCCCAC 
GGAGGCCCCC 
GACTGGGTCA 
GGTCATCACC 
GTACATTGAC 
CAACGTGACA 
CGATGGGGAA 
CCAGACACTC 
CTTCCGGACC 
GCTGAGCTGC 
CTCAGGTGGG 
GCTGACATGC 
TCCTTGTGGA 
TGAAAACACA 
GCTGCACCTG 



41 
I 

SSSTLVEVTL 
IiESEFQAAIS 
VFGIEWEW 
IWEELSMLEV 
TSYVKVLHHT 



41 

I 

GGCCAGCCGG 
AGGTGAAGGT 
CTTCCCGAGG 
AGAGGCAGTC 
TCACAGTCGG 
CACGACATCC 
CCCCCCAAGA 
CCCAAGGCCA 
CTAGATCTCC 
GTGGCCTCCG 
CCTACAACAC 
ACACTCCCCC 
CAGGAGGACA 
GCCTCAGAGG 
ACCGAGCAGG 
TCCAGCGACT 
GTCTACACTG 
CTGCTCTCCA 
CTGGTGGAGG 
TTCCAGGACG 
AACTTTCCCC 
GTGGCCCACT 
ATCAATGCCT 
GGGGCAGTGC 
AATGGGAGCC 
CACTTTGAGA 



51 
I 

GEVPAADSPS 
RKMVELVHFL 
PISHLYILVT 
FEGREDSVFA 
LKIGGEPHIS 



51 
I 

ACGAGCTTTG 
CCTAAATCTC 
GAGATGCTAG 
CTGGCAAAGA 
CGGAAGTGCT 
CAGCCCTGTC 
AGAAACTGCC 
CCTCCGCAGC 
TCTCCTCCTC 
AGGAGGCATC 
CCGCACCCCT 
AGAGGCCAGA 
CCAGCCCCAT 
AGAGCCAGGA 
CACCAGCTCT 
ACCCACTGCT 
GCTATGGGGT 
TCCGCGGGGT 
GGCAGGTAAT 
ACGGCCTTGG 
GCCGGCCTGA 
TTCACTGCCA 
CCAAGCCGCA 
ACAATGCCAC 
AATTCTGCAT 
GGCTGTTGCT 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



269 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

GCATGACAAG GACAGGATGA CGGTTCACAG 
CGACTCCCTT CAAACCGAGA GTGTCCCTTT 
CCGCATCGAG TTCACGTCCG ACCAGGCCCG 
AGCGTTTGAG AAAGGCCACT GCTATGAGCC 
CGACCCGACC TATAACATTG GGACTATAGT 
GGAGCAGGGC CCGGCCATCA TCGAATGCAT 
AGAGCCCCTG TGCAGAGCCA TGTGTGGTGG 
GTCCCCAAAC TGGCCCGAGC CCTACGTGGA 
GGGAGAAGAG AAACGGATCT TCTTAGATAT 
CTTGACCATC TACGATGGCG ACGAGGT CAT 
CAGTGGCCCC CAGAAACTGT ACTCCTCCAC 
CCCTGCTGGC CTCATCTTTG GAAAGGGCCA 
AAGGAATGAC TCCTGCTCGG ATTTACCCGA 
CACGGAGTTG GTG CGGGGAG CCAGAATCAC 
GGGGAGTGAC ACCCTCACCT GCCAGTGGGA 
TGAGAAAATT ATGTACTGCA CCGACCCCGG 
GGATCCTGTG CTGCTGGTGG GGACCACCAT 
TGAAGGGAGT TCTCTTCTGA CCTGCTACAG 
TCGCCTGCCC CACTGCGTTT CAGAAGCGGC 
GGCCCTGGCT AT CTT CAT CC CGGTCCTCAT 
TTACATCACA AGATGTCGCT ACTATTCCAA 
CTACAGCCAG ATCACCGTGG AAACCGAGTT 
CCAAAAGGTT TAGGGTTTCA TTTAAAAAGA 
AACCCCAATT TCCCCGAGAC ATTT AT CCAA 
AAAGGCGGCT GTTTTTTGGT TAAACTTTTT 
TTTATAAATT TTAAAAGTG 



Seq ID NO: 208 Protein sequence: 
Protein Accession #: NP 066938 



CGGGCAGACC 
TGAGGGCCTG 
GGCGGCCTCC 
CTACATCCAG 
GGAGTT CACC 
CAATGTGCGG 
GGAGCTCTCT 
AGGTGAAGAT 
CCAGTTCCTG 
GCCCCACATC 
GCCAGACTTA 
GGGATTTATC 
GAT CCAGAAT 
CTACCAGTGT 
CCTCAGCTGG 
AGAGGTGGAT 
CCAATACACC 
CCGTGAAACA 
AGCAGAGACG 
CATCTCCTTA 
CCTCCGCCTG 
TGACAACCCC 
GGTACCCTTT 
AGGCCCTGGG 
AACAAAGGGT 



I 

MAQEAPQEDT 
PEGYIDSSDY 
LANQTLLVEG 
DLHSGGVAHF 
SYPENTNGSQ 
VPFEGIiLSEG 
TIVEFTCDPG 
YVEGEDCIWK 
SSTPDLTIQF 
RITYQCDPGY 
TTIQYTCNPG 
VIiIISLLLGG 



11 
I 

SPMALMDKGE 
PLLPLNNFLE 
QVIRSPTNTI 
HCHLGYELQG 
FCIWTIEAPE 
NTIRIEFTSD 
HSIiEQGPAII 
I HVGEEKR I F 
HSDPAGLIFG 
D I VGSDTLTC 
FVLEGSSLLT 
AYIYITRCRY 



21 
I 

NELiTGSASEE 
CTYNVTVYTG 
SVYFRTFQDD 
AKMLTCINAS 
GQKLHLHFER 
QARAASTFNI 
ECINVRDPYW 
LDIQFLNLSN 
KGQGFIMNYI 
QWDLSWSSDP 
CYSRETGTP I 
YSNLRLPLMY 



31 



I 

SQETTTSTII 
YGVELQVKSV 
GLGTFQLHYQ 
KPHWSSQEPI 
IiLLHDKDRMT 
RFEAFEKGHC 
NDTEPLCRAM 
SDILTIYDGD 
EVSRNDSCSD 
PFCEKIMYCT 
WTSRLPHCVS 
SHPYSQITVE 



Seq ID NO: 209 DNA sequence 

Nucleic Acid Accession #: NM_001327. 

Coding sequence: 89-631 



AG CAGGGGGC 
CTGAGAGCCG 
GACGGGCGAT 
TGGCGGCCCA 
AAGGGCCTCG 
GCTGAATGGA 
CCTCGCCATG 
GGATGCCCCA 
CATACTGACT 
CTGTCTCCAG 
GGCTCAGCCT 
GCCTCCTCCC 
GTTTGTCGCT 



11 

I 

GCTGTGTGTA 
GGCAGAGGCT 
GCTGATGGCC 
GGAGAGGCGG 
GGGCCGGGAG 
TGCTGCAGAT 
CCTTTCGCGA 
CCGCTTCCCG 
ATCCGACTGA 
CAGCTTTCCC 
CCCTCAGGGC 
CTAGGGAATG 
GGAGGAGGAC 



21 

I 

CCGAGAATAC 
CCGGAGCCAT 
CAGGAGGCCC 
GTGCCACGGG 
GAGGCGCCCC 
GCGGGGCCAG 
CACCCATGGA 
TGCCAGGGGT 
CTGCTGCAGA 
TGTTGATGTG 
AGAGGCGCTA 
GTCCCAGCAC 
GGCTTACATG 



31 
I 

GAGAATACCT 
GCAGGCCGAA 
TGGCATTCCT 
CGGCAGAGGT 
GCGGGGTCCG 
GGGGCCGGAG 
AGCAGAGCTG 
GCTTCTGAAG 
CCACCGCCAA 
GATCACGCAG 
AGCCCAGCCT 
GAGTGGCCAG 
TTTGTTTCTG 



Seq ID NO: 210 Protein sequence: 
Protein Accession #: NP_001318.1 

1 11 21 31 



Seq ID NO: 211 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52-45 9 



CCTCGTGGGC 
GAAGGCCAGG 
CCTGATGGCC 
GGTCCCCGGG 
CCGCATGGCG 



11 

I 

CCTGACCTTC 
GCACAGGGGG 
CAGGGGGCAA 
GCGCAGGGGC 
GTGCCGCTTC 



21 
1 

TCTCTGAGAG 
TTCGACGGGC 
TGCTGGCGGC 
AGCAAGGGCC 
TGCGCAGGAT 



31 
I 

CCGGGCAGAG 
GATGCTGATG 
CCAGGAGAGG 
TCGGGGCCGA 
GGAAGGTGCC 



AACAAGTCAG 


CTCTTCTCTA 


1620 


CTGAGCGAAG 


GCAACACCAT 


1680 


ACCTTCAACA 


TCCGATTTGA 


1740 


AATGGGAACT 


TCACTACATC 


1800 


TGCGACCCCG 


GCCACTCCCT 


1860 


GACCCATACT 


GGAATGACAC 


1920 


GCTGTGGCTG 


GGGTGGTATT 


1980 


TGTATCTGGA 


AGATCCACGT 


2040 


AATCTGAGCA 


ACAGTGACAT 


2100 


TTGGGGCAGT 


ACCTTGGGAA 


2160 


ACCATCCAGT 


TCCATTCGGA 


2220 


ATGAACTACA 


TAGAGGTATC 


2280 


GGCTGGAAAA 


CCACTTCTCA 


2340 


GACCCCGGCT 


ATGACATCGT 


2400 


AGCAGCGACC 


CCCCATTTTG 


2460 


CACTCGACCC 


GCTTAATTTC 


2520 


TGCAACCCCG 


GTTTTGTGCT 


2580 


GGGACTCCCA 


TCTGGACGTC 


2640 


TCG CTGGAAG 


GGGGGAACAT 


2700 


CTGCTGGGAG 


GAGCCTACAT 


2760 


CCTCTGATGT 


ACTCCCACCC 


2820 


ATTTACGAGA 


CAGGGGGAAC 


2880 


AAAAAGGGGC 


TTGTGAACTC 


2940 


GGCCTTGATT 


TAAACCCCCA 


3000 


TACGGGTTTT 


TTCCCCGGAT 


3060 


41 


51 




I 

TTTVITTEQA 


1 

PALCSVSFSN 


60 


NLSDGELLSI 


RGVDGPTLTV 


120 


AFMLSCNFPR 


RPDSGDVTVM 


180 


CSAPCGGAVH 


NATIGRVLSP 


240 


VHSGQTNKSA 


LLYDSLQTES 


300 


YEPYIQNGNF 


TTSDPTYNIG 


360 


CGGELSAVAG 


WLSPNWPEP 


420 


EVMPHILGQY 


LGNSGPQKLY 


480 


LPEIQNGWKT 




540 


DPGEVDHSTR 


LISDPVLLVG 


600 


EAAAETSLEG 


GNMALAIFIP 


660 


TEFDNPIYET 


GGTQKV 




41 


51 




I 

CGTGGGCCCT 


j 

GACCTTCTCT 


60 


GGCCGGGGCA 


CAGGGGGTTC 


120 


GATGGCCCAG 


GGGGCAATGC 


180 


CCCCGGGGCG 


CAGGGGCAGC 


240 


CATGGCGGCG 


CGGCTTCAGG 


300 


AGCCGCCTGC 


TTGAGTTCTA 


360 


GCCCGCAGGA 


GCCTGGCCCA 


420 


GAGTTCACTG 


TGTCCGGCAA 


480 


CTGCAGCTCT 


CCATCAGCTC 


540 


TGCTTTCTGC 


CCGTGTTTTT 


600 


GGCGCCCCTT 


CCTAGGTCAT 


660 


TTCATTGTGG 


GGGCCTGATT 


720 


TAGAAAATAA 






41 


51 




I 

GGRGPRGAGA 


i 

ARASGPGGGA 


60 


EAELARRSLA 


QDAPPLPVPG 


120 


WITQCFLPVF 


LAQPPSGQRR 




41 


51 




1 

GCTCCGGAGC 


1 

CATGCAGGCC 


60 


GCCCAGGAGG 


CCCTGGCATT 


120 


CGGGTGCCAC 


GGGCGGCAGA 


180 


GAGGAGGCGC 


CCCGCGGGGT 


240 


CCTGCGGGGC 


CAGGAGGCCG 


300 



PCT/US02/12476 



270 



WO 02/086443 

GACAGCCGCC TGCTTCAGTT CCGACTGACT GCTGCAGACC ACCGCCAACT GCAGCTCTCC 360 

ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG TTGATGTGGA TCACGCAGTG CTTTCTGCCC 420 

GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 480 

TAGGTCATGC CTCCTCCCCT AGGGAATGGT CCCAGCACGA GTGGCCAGTT CATTGTGGGG 540 

GCCTGATTGT TTGTCGCTGG AGGAGGACGG CTTACATGTT TGTTTCTGTA GAAAATAAAG 60 0 
CTGAGCTA 

Seq ID NO: 212 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I 1 1 I I I 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 

PRGPHGGAAS AQDGRCPCGA RRPDSRDLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 120 
FLPVFLAQAP SGQRR 

Seq ID NO: 213 DNA sequence 
Nucleic Acid Accession #: NM_000555 
Coding sequence: 416.. 1498 

1 11 21 31 41 51 

I I I 1 I I 

CTTATTTTTT ATGAATGTCG GATAGCTGCA CCAGCTTGGT GGGGAAAGGG TTTGATGAAT 60 

AG CACAAAG A CACTGGCTGT TCCCTGGAGG CTGTCCCTTT AAAGGAGAAT CTTAGTTTAT 120 

TCTGGGGGGA GGGGATGCAC ACATTAGAGT AGGAAAGAGG GCTTGGAATA AAATGAAAAC 18 0 

ACTCCCCCTT CATAGTCATT GTACTGAAAT GCAAAGACTG CTTCCTAAGC TGGAGATGCT 240 

AACCTTGGGT AGCTCCTTCT GTTCTCTTCA AGGGGAATTT TGTCAGGCTA TGGATTCATT 300 

TACAACTGTT AGTCATGTGG GCATGTGTGA GGAAACAGAT GCCAGTTTTA ATGTATTTAG 360 

CCCGAAGTTC CAATTTGATA GGAGCCACTG TCAGTCTCTG AGGTTCCACC AAAATATGGA 42 0 

ACTTGATTTT GGACACTTTG ACGAAAGAGA TAAGACATCC AGGAACATGC GAGGCTCCCG 480 

GATGAATGGG TTGCCTAGCC GCACT CACAG CGCCCACTGT AGCTTCTACC GAACCAGAAC 540 

CTTGCAGGCA CTGAGTAATG AGAAGAAAGC CAAGAAGGTA CGTTTCTACC GCAATGGGGA 600 

CCGCTACTTC AAGGGGATTG TGTACGCTGT GTCCTCTGAC CGTTTTCGCA GCTTTGACGC 660 

CTTGCTGGCT GACCTGACGC GATCTCTGTC TGACAACATC AACCTGCCTC AGGGAGTGCG 72 0 

TTACATTTAC ACCATTGATG GATCCAGGAA GATCGGAAGC ATGGATGAAC TGGAGGAAGG 780 

GGAAAGCTAT GTCTGTTCCT CAGACAACTT CTTTAAAAAG GTGGAGTACA CCAAGAATGT 840 

CAATCCCAAC TGGTCTGTCA ACGTAAAAAC ATCTGCCAAT ATGAAAG CCC CCCAGTCCTT 900 

GGCTAGCAGC AACAGTGCAC AGGCCAGGGA GAACAAGGAC TTTGTGCGCC CCAAGCTGGT 960 

TACCAT CAT C CGCAGTGGGG TGAAGCCTCG GAAGGCTGTG CGTGTGCTTC TGAACAAGAA 102 0 

GACAGCCCAC TCTTTTGAGC AAGTCCTCAC TGATATCACA GAAGCCATCA AACTGGAGAC 1080 

CGGGGTTGTC AAAAAACTCT ACACTCTGGA TGGAAAACAG GTAACTTGTC TCCATGATTT 1140 

CTTTGGTGAT GATGATGTGT TTATTGCCTG TGGTCCTGAA AAATTTCGCT ATGCTCAGGA 1200 

TGATTTTTCT CTGGATGAAA ATGAATGCCG AGT CATGAAG GGAAACCCAT CAGCCACAGC 12 60 

TGGCCCAAAG GCATCCCCAA CACCTCAGAA GACTT CAGCC AAGAGCCCTG GTCCTATGCG 1320 

CCGAAGCAAG TCTCCAGCTG ACTCAGCAAA CGGAACCTCC AGCAGCCAGC TCTCTACCCC 13 80 

CAAGTCTAAG CAGTCTCCCA TCTCTACGCC CACCAGTCCT GGCAGCCTCC GGAAGCACAA 1440 

GGACCTGTAC CTGCCTCTGT CCTTGGATGA CTCGGACTCG CTTGGTGATT CCATGTAAAG 150 0 

GAGGGGAGAG TGCTCAGAGT CCAGAGTACA AATCCAAGCC TATCATTGTA GTAGGGTACT 1560 

TCTGCTCAAG TGTCCAACAG GGCTATTGGT GCTTTCAAGT TTTTATTTTG TTGTTGTTGT 1620 

TATTTTGAAA AACACATTGT AATATGTTGG GTTTATTTTC CTGTGATTTC TCCTCTGGGC 1680 

CACTGATCCA CAGTTACCAA TTATGAGAGA TAGATTGATA ACCATCCTTT GGGGCAGCAT 1740 

TCCAGGGATG CAAAATGTGC TAGT CCATGA CCTTTCAATG GAAAGCTTAG GGGCCTGGGG 1800 

TAAATTTGCC CCGTTTAAAT TTGCCCAAAC AGTTTTCCTT TTGTAGAGGG GTGTTTAAAT 1860 

ATACAGCAAT TAAAAAGTTT GTGTGGGGAA AAAAAAAACT CATTGGCAGA TCCAAGAATG 192 0 

ACAAACACAA GTGCCCCTTT TCTCTGGATC TCAAGAATGG TGGAGGACCC TGGAAGGACA 1980 

GCAAGGCAGC TCCCCAGCCT CACTCTTCAC TCCTGATTGA GGCCCGGGTT TGTTGTCCAG 204 0 

CACCAATTCT GGCTGTCAAT GGGGAGAAAT AAACCAACAA CTTATAATTG TGACACCAGA 2100 

TGCTTAGGAT CCTGGTGCTG GGTTAGCTAA GAGAATAGAC AGAATTGGAA AATACTGCAG 2160 

ACATTTCCGA AGAGTTTATA AAGCACAGTG AATTCCTGGT CAATCTCTCC ACTGAGGCAA 2220 

TTTGGAATCA ATAAGCAATT GATAATAGTT TGGAGTAAGG GACTTCATAT ACCTGATTCC 2280 

TCTAGAAGGC TGTCTAACAT ACCACATGAT TACATGAACT GTATGGTATC CATCTATCTC 2340 

TGTTCTATTG AATGCCTTGT TAACAG CCAA CACTGAAAAC ACTGTGAGAA TTTGTTTTCA 240 0 

GGTCTGACAC CTTTCAGTCT CTTTTTATAG CAAGAAATCA ATATCCTTTT TATAAAAATT 2460 

CATGTCTGTA TTTCAGGAGC AAACTCTTCA GGCTCCTTTT TTATAAACTG GTGATTTTTC 2520 

TTTTGTCTAA AAAACACATG AAGAAAATTT ACCAGAAAAA AAAAAAAAAG CCGAAGAATA 2580 

ATGTTATTTA GAAATTATGC TGT CACTGCC AAACAGTAAC CTCCAGGAGA AAACAAGATG 2 640 

AATAGCAGAG GCCAATTCAA TAGAATCAGT TTTTTGATAG CTTTTTAACA GTTATGCTTG 2700 

CATTAATAAT TTCAATGTGG ACCAGACATT CTAATTATAT TTTAAATGAA ATGTTACAGC 2760 

ATATTTTAAG CAACTCTTTT TATCTATAAT CCTAATATTT CATACTGAAG ACACAGAAAT 2820 

CTTTCACTTG TCTTTAACAT TAGAAAGGAT TTCTCTTTAC TAAGGACTGA TCATTTGAAA 2880 

TAGTTTT CAG TCTTTTGAGA TACAGGTTTA TAACACTGCT TTTTTTTTCC TGTAAACATA 2 940 

GCCCATAATG GCAAAAACAA CTAATTTTAA TTGAAGGTCT TGCTTGCCAN TCCTGTGTTG 3000 

GCTTTNACCA AATATAAAAA TTCCCTTATT CCTTGGTAAT GGTGCAAATN TTTGGAAAGG 3060 

CACAGCATCC AAACCAAGCT GCTGTTTGGC TACTGAATGG CTTGCAGTTG TTCCTCCACT 3120 

CTAAATGGAA TGAGCTTGCT GTGTGTGTGT GTGGTGGTGG TGGGAGGGGG TGGTGCATGT 3180 

GTGTGTGTGT GTGTGCATCT GCAGCTGCTT CAAAATTAAG AAATACTACA AGACACCCCT 3240 

GTAATGGATT GGTGGCAACT GGGTGGCACT GCTGATGTGC ACTGTGTAGG GGGGAACCCA 3300 

GTGGTGGTGG GGTATCTCAA ATGCCCCTAG ACAAGCTTCA GATGTCTGTA GCTACCAAAA 3360 

ACATTTTCGG TTCAAGAAAA GTGAGATGAT GGTAGTACTG GTTTCTGGTG AAATTGAAAA 3420 

ACCCCAAATG ATGAGGATCT CTTTTTGCCC CCTCTCCTTT TTTTGTAAAC CCATTCAAAA 3480 

CCATTAATAA GCCCATTTTA CTAANCCCCT ATTTCTTTCT AGAAGCTCAG GGTTTNCTTA 3540 

GTGCCTCCCA NAACATTTTG TAGTTAATTG GGAAAAAGTG ATACTTGGAT TAGGGGGTGT 3600 

GGGCATAAAG AATGGTGGGA GGCCTGATTT TAAAATTCAG GCCAGAACCC CCAATGACTC 3660 

CACCCATAGT NTCACTTTAG GTCTCATTTA GTCCATCACC TTTATTTTAA GTTGAGGAAG 372 0 

TGGAGGCTGG TAAAGAGCAG GACCAGAGGA AGAATCCAGA TTTCCTTATG CTTGGGCCTC 37 80 

ACACTAGCTC TNTGAGTATT TCCTTGATTG CGGTATATGT ACTACTAGAA AATACCAAAT 3840 

GGATATATTT T CTTTAGG AT AACCTTTGAA CCAACAATNT TCAATAACAA TAGTACATCT 3 900 
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TCCATCTTAC TTTTAATCGA GTATAAGGAA 
AGGGGATGAG GCTTGGCATA GTCCAAAATT 
TTGTTTTAAA TTGGCCCACT TTCAAGGCAA 
CCACCCCTGT CATTCACTTC CAATTTTACC 
5 TGTTAATTTT TGCACGGTCT ACACACATCA 

ACTTCTCCCT CTTTTTTACA CACACACACA 
TTCCTACCTC CCTGATTTTT CTTCCCTACA 
ATGTATATAT TGGGGCTGGG CTGAACAACT 
AT TG AGAGAA AAGCTCCTTT TCTCTTCACT 

10 GCTTTGTGTC CTTATGGACT TTAGTATTAG 

GAAGGAACCT TAAGATCACA TCATCTACTC 
AGGAAACCGA GACACAGAGG TAAAGTAATT 
GATTGGGTTT ACAACCCACA TCTCCTGGCT 
TATTGCCTTC CATTAGGCTC CTGAGAGTTA 

15 ACATGCTGCT GCCCTGATCT CAGTGGGAAA 

CCCTGCATTC ACCTGGTTCC CAT CC ACATG 
ATTGAGGGCC AATAGGAGCA ATGGGGTCCC 
TGCTCCATCG TGAGGAGCCC TCTGAATAGC 
GAATGGAGGA AGATTGATTT TCTCCATCAG 

20 CTTTCCAGGC TGAGGGAAAT GTTTCTTGTT 

ATANCTTTGT TCATCCTAAC TTTCTGAGAT 
GGTACCATTC ACTGGCANGA TTTNTTTTAG 
AACCTGGTCT CACTGTGGTT GCCCTCATCC 
GAGGACAATT TCCAGGTATA AGCAAGGGGC 

25 TAAACATTGG CTCCTGTGTT TGCACCAAAA 

CATCGTCTTG TGTACACTGC TCCTGTGGCC 
C AAACAC AT G GTTTTCCTTG CTGCAAGGCT 
TTCAGTTNTA AGAGACCTCC TTCTGGGCTT 
CCTCCTTCTC CTCCACAGTC ACAAGTAACC 

30 TGAAGAAGGC AAGGAACCCT GAGATTCTTC 
GTGATTGGTG CTTACCTTGA ACAAAATTTT 
GGTACAATGC TCCCAATCAC CCTGCACATT 
TCCATATCCC TAGGACAAGA NAACAGGATG 
TGATGGGAAT GATCCCAANG ATCACCCCAC 

35 CCAGATAGAA NCACTGGGAC AGTGGTTTGA 

ATGGAAATAA AAGGCATTGA TTTTTTAAAA 
AGGGCCACTT GGATCCATTT CCAGGCCTTA 
GCTTTAAGTC CCAGACTGGT CTCCCAAGTG 
TGAGGCATGA GAATGTTGCC CCATCTATCC 

40 TCCTAAAGCC TGGTCCCCAA AAATTGTTTT 

CACCCANACT CTTAGTGTTG CGTCCTGCCT 
CCCGCTTTGG CTTAGCTAGC GTGACATTGG 
TTTTTTTTTG ACTGAGTCTC CCTCTGTCAC 
CTCGCTGCAA CCTTCACCCT TCACCTCCCA 

45 CGAGTAGCTG GGATTACAGG CGTGCGCCAC 

TTT TAGTAGA GATGGGGTTT CACCATGTTG 
ATTATCTGCC CACCTCGGCC TCCCAAAGTG 
GCTGACAAGA CTAATTTTTT ATCCCTTGGT 
GGTGATTTTT TCTTACCTTG GATGCCTGAG 

50 TTAAGGCATC TTTCTGCTCC TGATCAGAAG 

CAACAGAAGT CACCTTGTAA GTAAGGCAAA 
TTAGGTCAAT AACCTTGAGG GAATCAATGG 
TCTTTGACTT TTCTTTCTCT GTCTAGTTTC 
AGTCTCTCTT TCCACAGTAC AAACAT CCAT 

55 CTTATTATCT TTATTTGTAC TTTTTCCTTC 

TTCTTAGCCT GTGATTTTGC CTTGGGACTG 
CTGGTCCTAC CCCAGTCCAA TCAGAAGTAT 
TTTCTTCTTC T CCAT TTT CA TTCGTAATCC 
TATAGCTCAT GTATCTTTAG GTCTTTGCCT 

60 CCTTTTTAGT CTGACATTTT GTGGAGCAGT 

GAGAAAAAAT CCACCCATGG ATTTAT AT CA 
GTG CCCATAA TTTTTAAAGC TGCAATATAA 
GTCATTTGTT TTGGCTGGAT GGGGGTGGGG 
ATTAAACTCT CTATAATAAT CTTGTTTGGG 

65 ACTGGTAGGC AATCGGAGTT GATTTAAATG 

AAAGAGACAT TTGCTTAATT GACATGTATT 
CTTGACACCA ACTGTTCATG ATACTGAATA 
TAAAGAAGCC AGATTGTAGG TGTTAATTTA 
TCACTGCTTG GCAATACCAT ATGGCATGCC 

70 ATCCAAAAGG GATTTGAACA AGTAAGAGGT 

GTAATATATT GCAGCTTGAA GCCAATGATC 
TTTATTGAAT TTTGCATTTC CCACGTGTGG 
NTGTGCCATT AAACTTGTAC AGAAAATGTT 
AAAATGGAAA CAGCCCACCC TTTCTGCCCT 

75 CAAAACAGCT GTAATTGGTG GTTGTAGTGT 

TGGAGAGTAA ATGCATGGTA TTGTACATCA 
GAATATATTC TTCTTTGTAG TCCTTCTTCC 
CCAGTTGTCT TACAGTTGTA AATATCTGAT 
TCAGCAAACA ACAAACAAAC CAAAATGTGG 

80 AGTTATTGAT CATTTCTTAA GGAACAGCAT 

TCAGTGGTAA ATTGGGGTTG TATTGGCCAT 
AATCACATGT AATCCAAAGA CAGTAGGTAG 
GATAGAGACC TCAGAAGACT CTGCTTGACC 
AAAAATGAGA GAAATAAAAC AGATATTTAA 

85 AG CCAGAAAA AAAAACAAGG GCATGAGTTC 

ACCTAACCTA CTCTGAAATT GTGATT CAAA 
TGGTTTGCTG ACCCCACTTG GACTGGTAGG 



ATGTTTCTTT ATGGCCATTT TGGAGGGAGC 3960 

TAAGNCT CC A ATAATTAATT GCATTTTAAA 4020 

TTTTTTTTGT GTGTCTGTAA CTGAGCTCCT 4080 

CAATCCAATT TTAGCACTCA AGTTCCATTG 4140 

AGTCAGCAAG CATTTGCCAC CACTCCCTAT 42 00 

CACACACACA CACAAT C CAT CTCTTGCTTG 42 60 

GAAATAGAAA TAGGGACAAA GAAGGGGAAA 432 0 

AACTTCATAA GTAGTATTAA CTAGGGGTAA 4380 

GTTTTGGAAA GGATAGCCAT TAGCATGACT 4 440 

CCTAGATTGA ATTATAGCGT TTTTCTAGCT 4500 

CTCTACTCCA AATTTCTCAT TCTTCAGGCC 4560 

TCCCCAAGGT CACACAGCTG GCTGGGGCAG 4620 

CTTATTCCAG GGCCTTTTCC CACTAAGTAG 4680 

TTTCTCAGGG TCATGTTGCA TCTTGGAGCC 4740 

TNCACCCAGC AACCTAATAC AGCCCCTTTT 4800 

GGTTGCAGAT GTCCTTGAAG AGAGTGAGGC 4860 

TGGCCTTGTC CATCTGATTC AGGAGATCAC 4 920 

CCCCCACTGA ATGCTTGCCT TGCCCAAATG 4 980 

TTCACCTTGT GTCATCTCAT AATGGTTGGT 5040 

TCCANAGTAN AAAAAAGAAA GAGTGGAACA 5100 

GGCTTTTCAA CATTTAAAAA AAACTAGTGT 5160 

AATATGGGAG TAAGATGAGG TAGAGAAAAT 522 0 

ACAATGTCCC CAAAGCCATC CTGCTNTGAT 5280 

TTTGTGACAA AAATGTACCC TGGCTGATGT 5340 

TAGCAAGCTG TGTGCTCTAT ACACTCTTCC 540 0 

TTCCACAGCA GAAACCAGGG CAAAAGGGTC 5460 

NTTCCTGGGA ACTAAGGGGG TATTTATTAG 5520 

ACCCCACTCC TCAGGTACTT CTCTCTCCTT 5580 

AAGGAACCTG AAAGTGGATG TGTAGCTATT 5640 

TTTGAATCCT TTAGTCCAAG TCTTAGACCA 570 0 

GTCTGTGTTC CTAATCCCTT CAATACTNTG 5760 

TGATTCTAAA TGGCTTTTAT TTTTTAAAAA 5820 

CCTATATCCC CAAAATGAGC TCCAGGACAC 5880 

CTCAGAAAAC GTCTGTGCCA ANAGACTT CC 5940 

ACGACTTCTT TTATGGTTGT CCAGTTTGCT 600 0 

AAGATGATTG GAACCTGTCT TTGGCCACAT 6060 

CTCATATATT GCCTTCACTG AAGGGCTTTG 6120 

AACCATAAGT GTTTTGGAGC TCATCTGGGG 6180 

CTTCAGGAAA AGGTGCCTTC CCTCCCTTTC 6240 

TGTCTCCAAA AGTCTAGTAT GGTCTTTATA 63 00 

TGTTTCCTTG TTAAGGAT CT ATGCANACCT 63 60 

CTATCATTTG ACAAGACTAA CTTTTTTTTT 6420 

CTAGGCTGGA GTGCAGTGGC ACAATCTTGG 6480 

GGTCGAAGCG ATTCTCCTGC CTCAGTCTCC 6540 

CAAATCTGGC TATTTTTTTA TTATTATTAT 6600 

GCCAGACTGG TCTTGAACTC TTGGCCTCAA 6660 

CTGGGATTAC AGGCATGAGC ACCATGCCCA 672 0 

TTATTGGCTT CAACATCTTC TGGAAT CAGA 6780 

ACTAGGGGAG TATAGAATTC CAATTGGTAA 6840 

GGCAGGTTAG TTGGGAGAGG TCAGATGGCA 690 0 

GACTTTGAAG GCATTAGCGT TTCTCATTAC 6 960 

CTTTTTTGCC GCTCTACCTC TTTGTGTATC 7 02 0 

CTCTGTTCTC AGTTTATATT CTATGTTATC 7080 

CCTTTCTCCT GTGCAATTCT GTCTCTCCCT 7140 

CTCCCTGTCT AGGCATTGGG CATGTGCCTC 72 00 

ATGATAAATT ATTTCCAGAT TCAATCAGCC 7260 

GTTGGTGGGG AATCAACCTG ATCCTGGCCC 732 0 

CCCTCAGCAG ATCTTTACAA GCAGTTTCCT 73 80 

TCCAAGCACT GTACAGAATA CTTTGTGGTT 7440 

GAAGCGTGCT CAGAGACATA ATCAGCTGAA 7500 

GCTAAATACT AATAATTGAT TTTGTTTGAT 7560 

TATAATGAGG GACCACAGGT AATTTCTCCT 762 0 

GAGTAATTGC TTAAAGTTTT ACCATTACAC 7 680 

GCTTGCTAAC TGTTGAGCTG TTTTAACTAA 7740 

AAAAGATAAT TTAACAAATC TATACTATAA 780 0 

TTTTCCTTCT GAGTCACCTA AACATTTACT 7 860 

GACAGTCCAT ATAAGAGAAA TTAGTGGACC 7 92 0 

TTAAACAGAA TTGCAAAGCC CTTGGAAATG 7 980 

AAAATTTACA ATGACTTTTC TTTATAAGTT 8 040 

TATGCCAAAA TGTCTCCAAT GTATGGTCCT 810 0 

CCTTATGACT TGTATACAAC TAATGCATGT 8160 

TAAGTCTTTA AAATGTTTTT GATCACCTTT 822 0 

TTTATGGCCA TTTTCAAAGG GAGAAAGTTT 8280 

ATAGCTGTAG TTAGAATTGA GTACCTGTAG 8340 

TAGAGGTGTT AGCTTGCTAG TGACTAGCTT 840 0 

CATTTCTTAA CTCGTTTTAA CCTCTGAAAA 8460 

CACCCCCTTG CCCTCTCCCT CTCCCTGCTC 852 0 

TTGAGGCCCA ATAACTCTTG CCAAGTAAAG 8580 

GGAAAAGGCA TTTCTCAACC ATCTCTCAGC 8640 

TGTGATCAAA GACTCAACTT TACGTAAAAA 870 0 

TGATTACATT CAGGATTGAA TAG TTTTCAG 8760 

TGATGT CCCT TATCCCTGCA GCTGTTTTAA 882 0 

GATGACCAAT AATTATTTGA AAAAAAAAGA 8880 

GAACTTTAGC CACCTATTTA GAATAGTTAT 8940 

AAATGCATTA CTAT CAGTGT CCTAGGCAAT 9000 

AGCAGTATTT CAAGAGGCAT TCTCCTTTTT 9060 

TTTGGTGAGG CCCCCATAAA CCAGCTGGAG 912 0 
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CAGACCCTTT TCATCTCCTG TGCCTGTAAC 
AATGAGGGCT TTCTTGGGTC AGAGGACTTC 
TAAGGTGCTG TGAACTGTGA GTGCTGAAGA 
AGCTGCTCTT GCAATTATTT TGGCTCTCAA 
TGTACATTTG CAAGATGTGT GTAATGTCAT 



Seq ID NO: 214 Protein sequence: 
Protein Accession #: NP_000546 



ACCCCTCTTC CCCCACCCCC TCCGCAATT C 
AAGGTTGTCT AGAGAAGTTT GCCATGTGTG 
TTCGCAGCAT TCAATACCAG GCAGCCAAAG 
GCTCTGTTCT TCATCGCATT CTCATTTCTG 
TTTCCAAAAA TAAAATTTGA TTTCAAT 



1 11 21 



MELDFGHFDE 




GDRYFKGIVY AVSSDRFRSF DALLADLTRS 
EGESYVCSSD NFFKKVEYTK NVNPNWSVNV 
LVTIIRSGVK PRKAVRVLLN KKTAHSFEQV 
DFFGDDDVFI ACGPEKFRYA QDDFSLDENE 
MRRSKSPADS ANGTSSSQLS TPKSKQSPIS 



31 41 51 

I I I 

HSAHCSFYRT RTLQALSNEK KAKKVRFYRN 
LSDNINLPQG VRYIYTIDGS RKI GSMDELE 
KTSANMKAPQ SLASSNSAQA RENKDFVRPK 
LTD I TEA I KL ETGWKKLYT LDGKQVTCLH 
CRVMKGNP S A TAGPKASPTP QKT SAKS PGP 
TPTSPGSLRK HKDLYLPLSL DDSDSLGDSM 



Seq ID NO: 215 DNA sequence 
Nucleic Acid Accession #: NM__130467 
Coding sequence: 312.. 644 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGC AGAGCTCTGC AAGGAGAGGT TGTGTCTTCG TTCTTTCCGC CATCTTCGTT 
CTTTCCAACA TCTTCGTTCT TT CTCACTGA CCGAGACTCA GCCGGTAGGT CTGCAGAGTG 
GTCTTCCTGG TAATTTAGTT GTGAGTGAAT GTGTGGAGGA GCCAGCGGGC TTAGGACAGG 
TCCTGTGGCA CAGTCCGTGG CTTTGAGGGA AAAGGGCCTC GCGGTGGTCC TCCGCCTTCC 
CCCAGGTCGT GATGCAGGCG CCATGGGCCG GTAATCGTGG CTGGGCTGGA ACGAGGGAGG 
AAGTGAGAGA TATGAGTGAG CATGTAACAA GATCCCAATC CTCAGAAAGA GGAAATGACC 
AAGAGTCTTC CCAGCCAGTT GGACCTGTGA TTGTCCAGCA GCCCACTGAG GAAAAACGTC 
AAGAAGAGGA ACCACCAACT GATAATCAGG GTATTGCACC TAGTGGGGAG AT CAAAAATG 
AAGGAGCACC TGCTGTTCAA GGGACTGATG TGGAAGCTTT TCAACAGGAA CTGGCTCTGC 
TTAAGATAGA GGATGCACCT GGAGATGGTC CTGATGTCAG GGAGGGGACT CTGCCCACTT 
TTGATCCCAC TAAAGTGCTG GAAGCAGGTG AAGGGCAACT ATAGGTTTAA ACCAAGACAA 
ATGAAGACTG AAACCAAGAA TATTGTTCTT ATGCTGGAAA TTTGACTGCT AACATTCTCT 
TAATAAAGTT TTACAGTTTT CTGCAAAAAA AAAAAAAAAA AAA 



Seq ID NO«. 216 Protein sequence: 
Protein Accession #: NP_569734 

1 11 21 31 41 51 

i I 1 I I I 

MSEHVTRSQS SERGNDQESS QPVGPVIVQQ PTEEKRQEEE PPTDNQGIAP SGEIKNEGAP 
AVQGTDVEAF QQELALLKIE DAPGDGPDVR EGTIjPTFDPT KVLEAGEGQL 

Seq ID NO: 217 DNA sequence 

Nucleic Acid Accession #: NM_0 01476.1 

Coding sequence: 82.. 435 

1 11 21 31 41 51 

I I 1 I I I 

GCCAGGGAGC TGTGAGG CAG TGCTGTGTGG TTCCTGCCGT CCGGACTCTT TTTCCTCTAC 
TGAGATTCAT CTGTGTGAAA TATGAGTTGG CGAGGAAGAT CGACCTATTA TTGGCCTAGA 
CCAAGGCGCT ATGTACAGCC TCCTGAAGTG ATTGGGCCTA TGCGGCCCGA GCAGTTCAGT 
GATGAAGTGG AACCAGCAAC ACCTGAAGAA GGGGAACCAG CAACTCAACG TCAGGATCCT 
GCAGCTGCTC AGGAGGGAGA GGATGAGGGA GCATCTGCAG GTCAAGGGCC GAAGCCTGAA 
GCTGATAGCC AGGAACAGGG TCACCCACAG ACTGGGTGTG AGTGTGAAGA TGGTCCTGAT 
GGGCAGGAGG TGGACCCGCC AAATCCAGAG GAGGTGAAAA CGCCTGAAGA AGGTGAAAAG 
CAATCACAGT GTTAAAAGAA GACACGTTGA AATGATGCAG GCTGCTCCTA TGT TGGAAAT 
TTGTTCATTA AAATTCTCCC AATAAAGCTT TACAGCCTTC TGCAAAA 

Seq ID NO: 218 Protein sequence: 
Protein Accession #: NP_001467.1 

1 11 21 31 41 51 

l I I I l l 

MSWRGRSTYY WPRPRRYVQP PEVIGPMRPE QFSDEVEPAT PEEGEPATQR QDPAAAQEGE 
DEGASAGQGP KPEADSQEQG HPQTGCECED GPDGQEVDPP NPEEVKTPEE GEKQSQC 

Seq ID NO: 219 DNA sequence 
Nucleic Acid Accession #: NM_001476 
Coding sequence: 90-3671 

1 11 21 31 41 51 

I 1 ! I I I 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 
AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 
GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 
ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 
TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 
GCTTTTACCG GCACAGAGAA AGGGACCGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 
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CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 
CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 
ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 
ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 
CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 
GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 
TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 
AATGGT CACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGT CTATT 840 
TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 
TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 102 0 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 10 80 

TGAGTTACTT TGAGT AT CGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 12 60 

AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 132 0 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGAT CCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG T CAT AACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 162 0 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTG CAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 192 0 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG AT C AGTTT AT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTT CAG 210 0 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTT CGGGCT CTGGGAAGTC 222 0 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC T CAGATGC AG CTGAGCCTGG 22 80 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCT CAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 25 80 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2 640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2 820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2 880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2 940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

AAGCCATGAA GAGACT CTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3 0 60 

AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 312 0 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3 3 60 

CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 342 0 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3 60 0 

AGAACT TGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

AG C AACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3 72 0 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3 84 0 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3 900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3 960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 40 80 

ATGAAAT TCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 42 60 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 43 80 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTT CAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 480 0 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTG C AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4 920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GT CACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seq ID NO: 220 Protein sequence: 
Protein Accession #:NP_005553 

1 11 21 31 41 51 



274 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 



WO 02/086443 

MPALWLGCC& cfslllpaar atsrrevcdc ngksrqcifd 

DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG 
HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA 
PEGCTQCFCY GHSASCRSSA EYSVHKITST FHQDVDGWKA 
SSAQRLDPVY FVAPAKFLGN QQVSYGQSLrS FDYRVDRGGR 
MPLGKTLP CG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL 
DNVTL I SARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY 
GGACDPDTGD CYSGDENPDI ECADCPIGFY NDPHDPRSCK 
CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN 
CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS 
NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG 
QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM 
RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE 
ETEDYSKQAIj SLVRKALHEG VGSGSGSPDG AWOGLVEKL 
ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS 
NWKEEAQQLIj QNGKSGREKS DQLLSRANLA KSRAQEALSM 
QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA 
IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL 
AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGIi 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL 

Seq ID NO: 221 DNA sequence 
Nucleic Acid Accession #: NM_016529 
Coding sequence: 13-1854 



GTCAAGAAAA 
AAAGGGGCTG 
ACATTATGCC 
GCTGATCTCT 
ATATTGAAGG 
CTGCTACTTG 
GCAACACTGT 
GCGATTAATA 
AAGGAGGACT 
AATTTGCTGG 
GCGCTCTCCT 
ATATGCTGCA 
GTGAAGGCCA 
GCCCACGTGG 
TACGCCATCG 
TACAACCGGG 
ATTGAGCTTT 
TGCATCGGCC 
GAGAGGTCTT 
AATGGCGAAG 
TCCCTCATCC 
GGTCATGCTA 
GTTTGTCTGA 
TGGGGAAGCA 
ATTCCCATTG 
TGGTTGGGAT 
GCCAAGCACA 
CGAGTCCTGG 
CGCCTG AT C A 
CAGCAGGGCG 
GAAGAAGTCA 
AATTTTCCTG 
TTTGTCAGAG 
AGTTAAGCAG 
AGCTATCTTT 
ATGAAGCAT T 
AGAAAAAAAA 



11 
1 

GAATGTCTGT 
ATAATGTGAT 
ATCTGGAATA 
CTGAGAATGA 
ACAGAGCTCA 
GAGCCACAGC 
TGAAGGCAGA 
TAGGGTATTC 
CTTTGGATGC 
GCAAGGAAAA 
TCGAAGTCCG 
GAGTGTCTCC 
TCACCCTCGC 
GTGTGGGAAT 
CACAGTTTTC 
TGACCAAGTG 
GGTTCGCCTT 
TGTACAATGT 
GCACTCAGGA 
GCTTCAACAC 
TCTTCTGGTT 
CCGACTATTT 
AAGCTGGTTT 
TGCTGACCTG 
CTCCAGATAT 
TATTTCTGGT 
CCTGCAAAAA 
GAAAAGCGGT 
AGAGGCTGGG 
TCCCGCATGG 
TCCGTGCTTA 
ACTGATCTTA 
AAGACTGGCG 
TTTGTTAGTT 
GCCCTCCCAA 
CAACTGTGCT 
AAAAAAA 



21 
1 

AATTGTTCGA 
TTTTGAGAGA 
CTTTGCCACG 
GTATGAGGAG 
ACGGTTGGAA 
CATAGAAGAT 
AATTAAAATA 
CTGCCGATTG 
CACAAGGGCA 
TGACGTGGCC 
GAGGAGTTTC 
TCTG CAGAAG 
CATCGGAGAC 
CAGTGGGAAT 
CTACTTAGAG 
CATCTTGTAC 
TGTTAATGGA 
GATTTTCACC 
GAGCATGCTC 
AAAGGTTTTC 
TCCCATGAAA 
ATTTGTTGGA 
GGAGACCACA 
GCTGGTGTTT 
GAGAGGACAG 
TCCTACTGCC 
GACATTGCTG 
GCTGCGGGAT 
CCGGAAGACG 
GTATGCTTTT 
TGACACCACC 
GGAAAGAGAT 
TCCAAGGCCA 
ACATATTCCC 
CTCGTCTGCA 
CTGTGAGGTC 



31 
1 

ACTCCTTCAG 
CTTTCAAAAG 
GAAGGCTTGC 
TGGCTGAAAG 
GAGTGTTACG 
CGCCTTCAAG 
TGGGTGTTGA 
GTAT CGCAGA 
GCCATTACTC 
CTCATCATCG 
CTGGATTTGG 
TCTGAGATAG 
GGCGCCAACG 
GAAGGCATGC 
AAGCTTCTGT 
TGCTTCTATA 
TTTTCTGGGC 
GCTTTGCCGC 
AGGTTTCCCC 
TGGGGTCACT 
GCTCTGGAGC 
AATATTGTTT 
GCTTGGACTA 
TTTGGCATCT 
GCAACTATGG 
TGTTTGATTG 
GAGGAGGTGC 
AGCAATGGAA 
CCCCCGACGC 
TCT CAAGAAG 
AAAAAGAAAT 
TCAGTTTGTT 
AAACACCAGG 
TCGCAAACCT 
GTGCTTAGCC 
TCAAATTAAA 



RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGS PAKIj 
HP S AHDV I IjE 
RNLTALR I RA 
KRDSARLGPF 
PCPCHNGFSC 
NNVDPSASGN 
EPVGCRSDGT 
DGWPDTEbE 
TVERVRALGS 
ATRLAESHVE 
EKTKSIjAQQL> 
LSTLVTRHMD 
GNATFYEVES 
AADAQRAKNG 
ERKELEFDTN 
VLLEQKLSRA 
PPGCYNTQAL 



41 
1 

GACGACTTCG 
ACTCAAAATA 
GGACTCTCTG 
TCTATCAGGA 
AGAT CATTGA 
CAGGAGTTCC 
CAGGAGACAA 
ATATGGCCCT 
AGCACTGCAC 
ATGGCCACAC 
CACTCTCGTG 
TGGATGTGGT 
ATGTCGGGAT 
AGGCCACCAA 
TGGTTCATGG 
AGAACGTGGT 
AGATTTTATT 
CCTTCACTCT 
AGCTCTACAA 
GCATCAACGC 
ATGATACTGT 
ACACATATGT 
AATTCAGTCA 
ACTCGACCAT 
TCCTGAGCTC 
AAGATGTGGC 
AGGAGCTGGA 
AGAGGCTGAA 
TGTTCCGGGG 
AACACGGAGC 
CCAGGAAGAA 
GCACCCAGTG 
AAACACATTT 
GGAGTGCAGA 
TAACTTTTGT 
AACATTATGT 



FRCLNCNDNT 
ARCDRCLPGF 
SGYYNLDGGN 
QWSQRHQDVF 
GAGLRITAPL 
TYGEYSTGY I 
GTCIPCNCQG 
SVMPETEEW 
CDRLTGRCLK 
CVCKPGFGGP 
GRMQQAEQAL 
QYQNRVRDTH 
SASNMEQLTR 
TREATQAEIE 
EFKRTQKNLG 
ILKNLREFDL 
AGEALEISSE 
MDAVQMVITE 
KTQINSQLRP 
EQQ 



51 
I 

GCTTTACTGT 
TATGGAGGAA 
TGTGGCTTAT 
AGCCAGCACC 
GAAGAATTTG 
AGAAACCATC 
ACAAGAAACT 
TATCCTATTG 
TGACCTTGGG 
CCTGAAGTAC 
CAAAGCGGTC 
GAAGAAG CGG 
GATCCAGACA 
CAACT CGGAT 
AGCCTGGAGC 
CCTGTATATT 
TGAACGTTGG 
GGGAATCTTT 
AATCACCCAG 
CTTGGTCCAC 
GTTTGACAGT 
TGTTGTTACT 
TCTGGCTGTC 
CTGGCCCACC 
CGCACACTTC 
ATGGAGAGCA 
AACCAAGTCT 
CGAGCGCGAC 
CAGCTCCCTG 
TGTTAGTCAG 
ATAAGACATG 
TTAACACATC 
CTGTGGCCTT 
CCACAGGGGA 
TTATGTCGTT 
TTCACCAATA 



Seq ID NO; 222 Protein sequence: 
Protein Accession #: NP_057613 



MSVIVRTPSG 
ENEYEEWLKV 
KAEIKIWVLT 
KENDVALI ID 
TLAIGDGAND 
TKCILYCFYK 
TQESMLRFPQ 
DYLFVGNIVY 
PDMRGQATMV 
KAVLRDSNGK 
RAYDTTKKKS 



11 

! 

RLRLYCKGAD 
YQEASTILKD 
GDKQETAINI 
GHTLKYALSF 
VGMIQTAHVG 
NWLYIIELW 
LYKITQNGEG 
TYWVTVCLK 
LSSAHFWLGL 
RLNERDRLIK 
RKK 



21 

] 

NVIFERLSKD 
RAQRLEECYE 
GYSCRLVSQN 
EVRRSFLDLA 
VGISGNEGMQ 
FAFVNGFSGQ 
FNTKVFWGHC 
AGLETTAWTK 
FLVPTACLIE 
RLGRKTPPTL 



31 
I 

SKYMEETLCH 
IIEKNLLLLG 
MALILLKEDS 
LSCKAVICCR 
ATNNSDYAIA 
ILFERWCIGL 
INALVHSLIL 
FSHLAVWGSM 
DVAWRAAKHT 
FRGSSLQQGV 



41 

I 

LEYFATEGLR 
ATAIEDRLQA 
LDATRAAITQ 
VSPLQKSEIV 
QFSYLEKIiLL 
YNVIFTALPP 
FWFPMKALEH 
LTWLVFFGIY 
CKKTLLEEVQ 
PHGYAFSQEE 



51 

I 

TLCVAYADLS 
GVPET I ATLL 
HCTDLGNLLG 
DWKKRVKAI 
VHGAWSYNRV 
FTLGIFERSC 
DTVFDSGHAT 
STIWPTIPIA 
ELETKSRVLG 
HGAVSQEEVI 



60 
120 
180 
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300 
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420 
480 
540 
600 
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720 
780 
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900 
960 
1020 
1080 
1140 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 223 DNA sequence 
Nucleic Acid Accession #: BC017001 
85 Coding sequence: 1-394 



11 



21 



31 



275 



WO 02/086443 

| j I I I I 

AACGCTGGGC AGGGCCGGCG CGGGTCGGGG GGCGCCCGAG GGGCCCGGGC CGAGCGGCGG 60 

CGCGCAGGGC GGCAGCATCC ACTCGGGCCG CATCGCCGCG GTGCACAACG TGCCGCTGAG 120 

CGTGCTCATC CGGCCGCTGC CGTCCGTGTT GGACCCCGCC AAGGTGCAGA GCCTCGTGGA 180 

CACGATCCGG GAGGACCCAG ACAGCGTGCC CCCCATCGAT GTCCTCTGGA TCAAAGGGGC 240 

CCAGGGAGGT GACTACTTCT ACTCCTTTGG GGGCTGCCAC CGCTACGCGG CCTACCAGCA 300 

ACTGCAGCGA GAGACCATCC CCGCCAAGCT TGTCCAGTCC ACTCTCTCAG ACCTAAGGGT 360 

GTACCTGGGA GCATCCACAC CAGACTTGCA GTAGCAGCCT CCTTGGCACC TGCTGCCACC 420 

TTCAAGAGCC CAGAAGACAC ACCTGGCCTC CAGCAGGCTG GGCCATGCAG AAGGGATAGC 480 

AGGGGTG CAT TCTCTTTGCA CCTGGCGAGA GGGTCTGACT CTGGGCACCC CTCTCACCGG 540 

CTACAAGGCC TTGGACTCAC TGTACAGTGT GGGAGCCCCA GTTCCCACCT CTGTGACAAT 600 

AGGATCATGG CCTTACCCTT GAAGCATTAC CGAGAAGGAG AACAGAGATG GGCTTGAAGA 660 

GCCACGTGCT GCCGGCTCCA AATTCCCAAG GACAAGGATC CCTCTGCATT TTTGTCTATG 72 0 

TAACCTCTTA TATGGACTAC ATTCAGCTGC AAGGAAAGGA AAACCTTGAT TGCAGTGGTT 780 

TAAACAAACA GAAGATTGTT TTTCCACATA GCATGGATTC TGGAGATGGG TGGCTAATGG 840 

TATTGGTTCA ACAACTCCAC GGAGGTAGGG GTCACGTCTT GGATCCTTTT GCCTTAATCT 90 0 

CAGTGCTCGT TACTTCATGG TCCCAAGATG GCTGCTGTAT CCCCAAGAAT CATGT CTGCG 960 

TTCAAGGAAG GAGGGGTGGA GGAAGAGGAA GGGCCAAACT AGCTGGACCC GTCACCTTCT 102 0 

AT CAGAAAGT AAAACCTCGT CAGAAGTCTG TTTCCTGCTC TCTCCCTCTG CATATCTTCA 1080 

CTTAGATGCC CTTGGCCCGA GCCAGCTACC ATTGCACCTC TAGCTGCAAA CAAAGCTAAG 1140 

ACAGCAGGGA ACAGAATTGT CATGGCTGAA TAGACCAATC GTGTTCCATC TACTGAGACT 12 0 0 

GGCACACTGC CTCCTGCAAT AAAACTGGGA TCCCATTACC AAGAGAGAAA TGCAGAATTG 1260 

TGTACCAGTT AGCTTTTGCT GTGTAACAAA CCATCCCCAA ACTTGGCAGC TAGAAACAAA 1320 

CCCTGTATTT TCCCACAATC CTATGGGTTG GCAATTTGGG CTGGGCTCAA CAGGGCAGTT 13 80 

CTGCTGCTCA CACCTGGGAT CCCTCATGGA GCTAAGGTCA GCTGTTACCT CAGCTGGGCC 1440 

TGGATGGTCT AGGATAGCCT TACTCACTTG CCTGGCAGGT GACAGGCTGT TGGCTGGAAT 150 0 

TGCTTGGTTC TCCTCCATGT GGCCTCTCCA GCAGGCTAGC TCAGGCTTAT TCACATGATG 1560 

GCTTCAGGAT TCCAAAGAGA GTGAGAGTAG AAGCTGAAAG ACTTCTTGAG TTCTTGGCCT 1620 

GGAACTGGGA CTAGGACAGT GTCACTTCTG CTAAGTTCTT TTGGTCAGAG CAAATCACAA 168 0 

GGCTTTACCC AGATTCAAGG GATGAGAAAC AGACTACATG TCTTGATGAG GGGAACCACA 1740 

AAGAGCTTGT GGCCATTTTT CACCTATCAC AAATAATTTT GGATGGGTAT T T ATTTGG AT 180 0 

AAAGGTATTT CCCTCTTCCC CCTTTCTCTC TGTCTCATGG GGCCTCACTC TGCCAAGTTG 1860 

GAAGGCACTA AGACATTGTC CTGGCCCTCA GGGT CTAGGG GAAGAGGTGT TGGGGCAGGA 192 0 

AGTGAGTCTC TCCATGGGCT GGACCCACTG TAGTAGGAGT GCCTCCTTGT CTGCACTGCT 1980 

GGTATGGGGT TAGGCCAGGT AGGACATT CC AGAGGGGCTT CTGAAAACCA AGAGTCCCTG 2040 

GGGAAAGGGA ACAGAGTAAG GCAGGCCTTG TTCTCACTGC CCTCTAAGGG AACTTGGTCA 210 0 

CTCGGCACTT TTAAGCCTCA GTTTCTCCAG TTCAATAATA AGGACAAGAG CTTTTCCCAT 2160 

GCATTCTCTT TCCCCGGGAA AGTTGACTGA GGTGACCAGT AATAGAATTG AAAAGGGAGA 222 0 

GTGTCTTCAG TGCAATGTGG CATCCTGGAT TGGGTCTTGG AACAAAAACA GGACATTAGT 22 80 

GGGAAAATTG GAAAT CTGAA AAAAGTCTGA ATTTTAGTTA ATATACCAAT TTCAGTCTCT. 2340 

TGGTTTTGAC AGATGTACCA TGGTGATGTA AGATGTTGAC CTTGGGGTAG GCTGGGTGAA 240 0 

GGGTATACAG GAACTCTTTG TACTATCTCT GCAACTTCTC TGTAAATCTA GTATCATTCC 2460 
AAAATAAAAG TTTATTTAAT TTAAAAAAAA AAAAAAAAAA AA 



Seq ID NO: 224 Protein sequence: 
Protein Accession #: AAH17001.1 

1 11 21 31 41 51 

| 1 | I I I 

TLGRAGAGRG APEGPGPSGG AQGGSIHSGR IAAVHNVPLS VLIRPLPSVL DPAKVQSLVD 60 
TIREDPDSVP PIDVLWIKGA QGGDYFYSFG GCHRYAAYQQ LQRETIPAKL VQSTLSDLRV 120 
YLGASTPDLQ 

Seq ID NO: 225 DNA sequence 
Nucleic Acid Accession #: NM_021048 
Coding sequence: 1..1110 



1 11 21 31 41 51 

| I I 1 1 

ATGCCTCGAG CTCCAAAGCG TCAGCGCTGC ATGCCTGAAG AAGATCTTCA AT CCCAAAGT 60 

GAGACACAGG GCCTCGAGGG TGCACAGGCT CCCCTGGCTG TGGAGGAGGA TGCTTCATCA 12 0 

TCCACTTCCA CCAGCTCCTC TTTTCCATCC TCTTTTCCCT CCTCCTCCTC TTCCTCCTCC 180 

TCCTCCTGCT ATCCTCTAAT ACCAAGCACC CCAGAGGAGG TTTCTGCTGA TGATGAGACA 240 

CCAAATCCTC CCCAGAGTGC TCAGATAGCC TGCTCCTCCC CCTCGGTCGT TGCTTCCCTT 300 

CCATTAGATC AAT CTGATGA GGGCTCCAGC AGCCAAAAGG AGGAGAGT CC AAGCACCCTA 360 

CAGGTCCTGC CAGACAGTGA GTCTTTACCC AGAAGTGAGA TAGATGAAAA GGTGACTGAT 420 

TTGGTGCAGT TTCTGCTCTT CAAGTATCAA ATGAAGGAGC CG AT CACAAA GGCAGAAATA 480 

CTGGAGAGTG TCATAAAAAA TTATGAAGAC CACTTCCCTT TGTTGTTTAG TGAAGCCTCC 540 

GAGTGCATGC TGCTGGTCTT TGGCATTGAT GTAAAGGAAG TGGATCCCAC TGGCCACTCC 600 

TTTGTCCTTG TCACCTCCCT GGGCCTCACC TATGATGGGA TGCTGAGTGA TGTCCAGAGC 660 

ATGCCCAAGA CTGGCATTCT CATACTTATC CTAAGCATAA TCTTCATAGA GGGCTACTGC 720 

ACCCCTGAGG AGGTCATCTG GGAAGCACTG AATATGATGG GGCTGTATGA TGGGATGGAG 780 

CACCTCATTT ATGGGGAGCC CAGGAAGCTG CTCACCCAAG ATTGGGTGCA GGAAAACTAC 840 

CTGGAGTACC GGCAGGTGCC TGGCAGTGAT CCTGCACGGT ATGAGTTTCT GTGGGGTCCA 900 

AGGGCTCATG CTGAAATTAG GAAGATGAGT CTCCTGAAAT TTTTGGCCAA GGTAAATGGG 960 

AGTGATCCAA GATCCTTCCC ACTGTGGTAT GAGGAGGCTT TGAAAGATGA GGAAGAGAGA 1020 

GCCCAGGACA GAATTGCCAC CACAGATGAT ACTACTGCCA TGGCCAGTGC AAGTTCTAGC 1080 
GCTACAGGTA GCTTCTCCTA CCCTGAATAA 

Seq ID NO: 22 6 Protein sequence: 
Protein Accession #: NP_0663 8 6 

1 11 21 31 41 51 

MPRAPKRQRC MPEEDLQSQS ETQGLEGAQA PLAVEEDASS STSTSSSFPS SFPSSSSSSS 60 



276 



WO 02/086443 

SSCYPLIPST PEEVSADDET PNPPQSAQIA CSSPSWASL PLDQSDEGSS SQKEESPSTL 
QVLPDSESLP RSEIDEKVTD LVQFLLFKYQ MKEPITKAEI LESVIKNYED HFPLLFSEAS 
ECMLLVFGID VKEVDPTGHS FVLVTSLGLT YDGMLSDVQS MPKTGILILI LSIIFIEGYC 
TPEEVIWEAL NMMGLYDGME HLIYGEPRKL LTQDWVQENY LEYRQVPGSD PARYEFLWGP 
RAHAE I RKMS LLKFLAKVNG SDPRSFPLWY EEALKDEEER AQDRIATTDD TTAMASASSS 
ATGSFSYPE 

Seq ID NO: 227 DNA sequence 

Nucleic Acid Accession #: NM_005025.1 

Coding sequence: 82-1314 

1 11 21 31 41 51 

I I 1 I I I 

GCGGAGCACA GTCCGCCGAG CACAAGCTCC AGCATCCCGT CAGGGGTTGC AGGTGTGTGG 
GAGGCTTGAA ACTGTTACAA TATGGCTTTC CTTGGACTCT TCTCTTTGCT GGTTCTGCAA 
AGTATGGCTA CAGGGGCCAC TTTCCCTGAG GAAGCCATTG CTGACTTGTC AGTGAATATG 
TATAATCGTC TTAGAGCCAC TGGTGAAGAT GAAAATATTC TCTTCTCTCC ATTGAGTATT 
GCTCTTGCAA TGGGAATGAT GGAACTTGGG GCCCAAGGAT CTACCCAGAA AGAAATCCGC 
CACTCAATGG GATATGACAG CCTAAAAAAT GGTGAAGAAT TTTCTTTCTT GAAGGAGTTT 
TCAAACATGG TAACTGCTAA AGAGAGCCAA TATGTGATGA AAATTGCCAA TTCCTTGTTT 
GTGCAAAATG GATTT CATGT CAATGAGGAG TTTTTGCAAA TGATGAAAAA ATATTTTAAT 
GCAGCAGTAA AT CATGTGGA CTTCAGTCAA AATGTAGCCG TGGCCAACTA CAT CAATAAG 
TGGGTGGAGA ATAACACAAA CAATCTGGTG AAAGATTTGG TATCCCCAAG GGATTTTGAT 
GCTGCCACTT ATCTGGCCCT CATTAATGCT GTCTATTTCA AGGGGAAC T G GAAGTCGCAG 
TTTAGGCCTG AAAATACTAG AACCTTTTCT TTCACTAAAG ATGATGAAAG TGAAGTCCAA 
AT TCCAATG A TGTAT CAGCA AGGAGAATTT TATTATGGGG AATTTAGTGA TGGCTCCAAT 
GAAGCTGGTG GT AT CTACCA AGTCCTAGAA ATACCATATG AAGGAGATGA AATAAGCATG 
ATGCTGGTGC TGT CCAGACA GGAAGTT CCT CTTGCTACTC TGGAGCCATT AGT CAAAGCA 
CAGCTGGTTG AAGAATGGGC AAACTCTGTG AAGAAGCAAA AAGTAGAAGT ATACCTGCCC 
AGGTTCACAG TGGAACAGGA AATTGATTTA AAAGATGTTT TGAAGGCTCT TGGAATAACT 
GAAATTTTCA TCAAAGATGC AAATTTGACA GGCCTCTCTG ATAATAAGGA GATTTTTCTT 
TCCAAAGCAA TTCACAAGTC CTTCCTAGAG GTTAATGAAG AAGGCTCAGA AGCTGCTGCT 
GT CTCAGGAA TGATTGCAAT TAGTAGGATG GCTGTGCTGT ATCCTCAAGT TATTGTCGAC 
CATCCATTTT TCTTTCTTAT CAGAAACAGG AGAACTGGTA CAATTCTATT CATGGGACGA 
GT CATGCAT C CTGAAACAAT GAACACAAGT GGACATGATT TCGAAGAACT TTAAGTTACT 
TTATTTGAAT AACAAGGAAA ACAGTAACTA AGCACATTAT GTTTG CAACT GGTATATATT 
TAGGATTTGT GTTTTACAGT ATATCTTAAG ATAATATTTA AAATAGTTCC AGATAAAAAC 
AATATATGTA AATTATAAGT AACTTGTCAA GGAATGTTAT CAGTATTAAG CTAATGGTCC 
TGTTATGTCA TTGTGTTTGT GTGCTGTTGT TTAAAATAAA AGTACCTATT GAACATGTG 

Seq ID NO: 22 8 Protein sequence: 
Protein Accession #: NP_005016.1 

1 11 21 31 41 51 

I I I I 1 I 

MAFLGLFSIiL VLQSMATGAT FPEEAIADLS VNMYNRLRAT GEDENILFSP LSI AL AMGMM 

ELGAQGSTQK EIRHSMGYDS LKNGEEFSFL KEFSNMVTAK ESQYVMKIAN SLFVQNGFHV 

NEEFLQMMKK YFNAAVNHVD FSQNVAVANY INKWVENNTN NLVKDLVSPR DFDAATYLAL 

INAVYFKGNW KSQFRPENTR TFSFTKDDES EVQIPMMYQQ GEFYYGEFSD GSNEAGGIYQ 

VLEIPYEGDE ISMMLVLSRQ EVPLATLEPL VKAQLVEEWA NSVKKQKVEV YLPRFTVEQE 

IDLKDVLKAL GITEIFIKDA NLTGLSDNKE IFLSKAIHKS FLEVNEEGSE AAAVSGMIAI 

SRMAVLYPQV IVDHPFFFLI RNRRTGT I LF MGRVMHPETM NTSGHDFEEL 

Seq ID NO: 229 DNA sequence 
Nucleic Acid Accession #: NM_003695 
Coding sequences 12-398 

1 11 21 31 41 51 

! I I I I I 

CGACATCAGA GATGAGGACA GCATTGCTGC TCCTTGCAGC CCTGGCTGTG GCTACAGGGC 

CAGCCCTTAC CCTGCGCTGC CACGTGTGCA CCAGCTCCAG CAACTGCAAG CATTCTGTGG 

TCTGCCCGGC CAGCTCTCGC TTCTGCAAGA CCACGAACAC AGTGGAGCCT CTGAGGGGGA 

AT CTGGTGAA GAAGGACTGT GCGGAGTCGT GCACACCCAG CTACACCCTG CAAGGCCAGG 

TCAGCAGCGG CACCAGCTCC ACCCAGTGCT GCCAGGAGGA CCTGTGCAAT GAGAAGCTGC 

ACAACGCTGC ACCCACCCGC ACCGCCCTCG CCCACAGTGC CCTCAGCCTG GGGCTGGCCC 

TGAGCCTCCT GGCCGTCATC TTAGCCCCCA GCCTGTGACC TTCCCCCCAG GGAAGGCCCC 

TCATGCCTTT CCTTCCCTTT CTCTGGGGAT TCCACACCTC TCTTCCCCAG CCGGCAACGG 

GGGTGCCAGG AGCCCCAGGC TGAGGGCTTC CCCGAAAGTC TGGGACCAGG TCCAGGTGGG 

CATGGAATGC TGATGACTTG GAGCAGGCCC CACAGACCCC ACAGAGGATG AAGCCACCCC 

ACAGAGGATG CAGCCCCCAG CTGCATGGAA GGTGGAGGAC AGAAGCCCTG TGGATCCCCG 

GATTTCACAC TCCTTCTGTT TTGTTGCCGT TTATTTTGTA CTCAAATCTC TACATGGAGA 
TAAATGATTT AAACC 

Seq ID NO: 230 Protein sequence: 
Protein Accession ft: NP_003686 

1 11 21 31 41 51 

i 1 1 I I 1 

MRTALLLLAA LAVATGPALT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VEPLRGNLVK 
KDCAESCTPS YTLQGQVSSG TSSTQCCQED LCNEKLHNAA PTRTALAHSA LSLGLALSLL 
AVILAPSL 



Seq ID NO: 231 DNA sequence 

Nucleic Acid Accession ft : Eos sequence 

Coding sequence: 126-752 



277 



WO 02/086443 

i 11 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CCGGGCAGGT 
AGGGGCGCAG 
AGAAGATGAA 
GTGTGAGGGA 
GGAGAACTCG 
TCTCTCTTGA 
GAAAGTACCA 
ACCCAGTGGA 
CCCGTGTGGC 
ACGAGTCTTC 
AAGTTGGGCC 
TCATCCTGTC 
TTCAGGATGG 
CAAGAGTTCA 
TTCACTGCAC 
TGACTCAGTT 
ACCAGCAACC 
AGGGAGGGGA 
TTATTTCCTG 
TCTGGTATTA 
GGAATT CAGC 
AGCTCTGACT 
GGCTTCACAT 
GTGGTTTTTT 
CAGAAGGATG 
GGTAATTCCT 
TGGGGAGCTT 
TCCTAGAGAA 
TGCTGTCAGC 
TTCTACTGTG 
TGTATAACTC 
GTGTTTGTGA 



GGCTCATGCT 
GAATTCTGAT 
GGATAT CGAC 
GAGAACCAGC 
ACCGTTGGAA 
TGCCTCCATG 
TCATGGCTTG 
CAATGCTGGG 
CCACAAGAAG 
TGACGTGAAC 
AGACGCTGCT 
CATCGTGTGC 
CTGTATTCTG 
GCCTTCCTTT 
CGCCATCTTA 
CCACATTTTG 
CGGCTGTATA 
GAGGTGGAAT 
GGACTTGGCA 
ATTTAATCTC 
GTAGCTACGT 
TACAGCTGCA 
CAATTTTTTT 
GTTTTATTTT 
CCACCAGTTT 
CACAACTTAG 
TTCGTTTTGT 
TCTACTCCGT 
CTGATATACT 
TGTCGTTAAA 
GACTCTGTAT 
GATGCTCTTT 



21 

! 

CGGGAGCGTG 
GTGAAACTAA 
ATAGGAAAAG 
ACTTCTGGGA 
TGCCAAGATG 
CATTCTCAGC 
AGTGCTCTGA 
CTTTTTTCCT 
GGGGAGCTCT 
TGCAGAAGAC 
TCCCTGCGAA 
CTGATGATCA 
CGGTCAGAAT 
GGAGACTGCT 
CTGAGTTGCT 
GATTGCATAC 
CAGTGGTGAC 
ACAGTAGTTC 
AAAATCTGAT 
AGGAAAAACA 
CCAGACCGTG 
GTCACCTTTG 
CTTCCTTTAG 
TGTCAAGATT 
TTCCTTGAGG 
TGCACCAGTA 
TTTGTTTTTA 
TGCAGAATCA 
ACTTTGGACT 
CAACCGTCGG 
GTTTCAATGT 
GAAGATGGTA 



31 
I 

GTTGAGCGGC 
CAGTCTGTGA 
AGTATATCAT 
CGCACAGAGA 
CCTTGGAAAC 
TCAGAATCCT 
AGCCCATCCG 
GTATGACTTT 
CAATGGAAGA 
TAGAGAGACT 
GGGTTGTGTG 
CGCAGCTGGC 
GAGAGAGTCA 
CCATCAGTGC 
TCACGTGAGG 
TGGAAAAGAA 
CCAAGCAATG 
TTGGAATCTG 
TGGTGGGGAT 
AGAAATTAAC 
GTGTCTGGCC 
CTATAAGGCA 
GGTGGGGGAT 
GATTTTTAGA 
CCTAGGATTT 
GCACCAGCCA 
ATTCTCTTTC 
TTGCAACCTC 
CTGGAAACAG 
AGACCAGATG 
ATGTTACTGC 
CTTTTATATT 



41 
I 

TGGCGCGGTT 
GCCCTGGAAC 
CCCCAGTCCT 
CCGTGAAGAT 
AGCAGCCCGA 
GGATGAGGAG 
GACTACTTCC 
TTCGTGGCTT 
CGTGTGGTCT 
GTGGCAAGAA 
GATCTTCTGC 
TGGCTTCAGT 
AGCTGGGCAG 
CGAGGTGTGT 
AAAAGGGGGC 
GCCAATCTTC 
GATATAAACC 
AAGTCTCCTA 
CTCCTAGGAC 
CCAGAGAGAG 
TCCATTTTTG 
CCTGGGTAGA 
TGGTTTGGCT 
TGCAAGGACT 
TTTATTCTGT 
TTTTGAG CAG 
CTTAGCAGCA 
AGGAGCCCTC 
ATATGGGTTC 
ACCTGTTAGA 
AATGCTTCAC 
T 



51 
I 

GTCCTGGAGC 
CTCCACTCAG 
GGGTATAGAA 
TCCAAGTTCA 
GCCGAGGGCC 
CATCCCAAGG 
AAACACCAGC 
TCTTCTCTGG 
CTGTCCAAGC 
GAGCTGAATG 
CGCACCAGGC 
GGACCAAATT 
AATCTCTCGC 
GGGAACAGGC 
TTTGGCCCTG 
TTGCTAGTAA 
TAAAAATCTG 
TTTGATCAGG 
CTAGTGGACA 
TCTGGGTTTT 
TCTGTCATTC 
AGGGTGGATG 
TTGTTTTGTT 
TGAAAAGACC 
CCCGAGCAGA 
AGTACCTCTT 
AGGTCTTTTT 
ACTGATTGAG 
TATTCT CTAT 
TGGCTAGTCC 
CTGCTGTACA 



PCT/US02/12476 



Seq ID NO: 2 32 Protein sequence: 
Protein Accession # : Eos sequence 



31 



41 



51 



1 11 21 

I I 1 I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 

LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGliF S CM TFSWLSSliAR 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 
LSIVCLMITQ IiAGFSGPMFQ DGCILRSE 

Seq ID NO: 233 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



TTTTAATGGT 
CACAACATGA 
TGTTTCTTAG 
TAGTAAACTT 
TTTGTTTCCA 
TGTTTGACTG 
TTGTTATGGT 
AAGCTATTTT 
TATATTATGG 
ATTAGCCTAC 
AATAGGTTAA 
AATTAAAGTA 
TATGAAAGCT 
TATATATTTT 
TATCCTGTAT 
TAGAAGACCC 
TTTTTTTTAG 
GATTCTGTTA 
TCCAGATAAA 
CTTAGGCTCT 
CCATAGTATC 
ACTATAATAG 
TATCAATGTT 
TACATAAAAA 
AAGTCATTAA 
AGAATCATCT 
GGAAGATGGA 
GAGTTGAAGA 
CTTTTAAAAC 
TATTCTTCTA 
AGCCCTTGTG 
CTGTTCAGGT 
TGAGATGGAG 
ATCATATGGA 
TTTTAATTCT 
CAATCCCCAA 
TTTCTGGAAT 



11 

I 

GCTCATATAT 
ATCACATAAT 
AT CGTAGTCA 
CTCTTTCATC 
AAGTCACAAT 
AAAGCAAAAC 
ATATCTTTTT 
AATCAT CAAG 
ATTAACCAGA 
TATAGTATTA 
AAAGTAGTTA 
CATTTTAAAT 
CTGGCTATCA 
TTAGTTCCTT 
TTTTTTTAAG 
ACTCTTACTA 
GTAGTTTAAA 
AGCAT CCAAA 
AGATGGAGAA 
CTCCATGTAT 
AAGTGGAGGG 
AAGTTTGAGT 
ATCCAATGAT 
GTGCTCATGT 
TAATTTAATA 
GAGGACTTTT 
GCCATGCTTG 
CCACTGCTCT 
TAGTAATGTA 
ATAACTAGCA 
TAAGATTATT 
GAGAAAACAT 
GAGGTGGGCA 
AGTTTGGAAA 
CACTGTCTCA 
GGGGCAGTGT 
TGCACCTACA 



21 
I 

ACTGTATTTT 
CATGATTTTT 
TTGAGAAGTC 
TTTGTGTTAG 
TGAATTATTC 
AACGTGACAG 
ATTAAATATT 
TATGGAAAAC 
ATTGTATCAT 
ATAAATTTTT 
CAAATTAAAC 
GAGCTTTATA 
TCCTGGGATA 
TGAGATAACT 
AATTGTTTTA 
GGTTCCCTAA 
GCAAGCACTG 
AACAATGCCT 
TACCTCATGT 
CTTTCTTAAG 
TAGTTCAGAA 
AATATT TTAA 
TTTTATTAAA 
ATTTGAATTT 
ATTGTTTTAA 
AATATGGAAT 
TTTTTCCAAA 
AAATTAGTGC 
CCCAGTTAAG 
TTTATTACAT 
ATTTCTTCTC 
AATGGATTTT 
CATTTAAGGT 
AGAGAGCTTA 
AAAGAGAATC 
TACCTTACTC 
TGTTTTCTTA 



31 

I 

TTGTTGTTTA 
TTTTTTTACT 
CCAATAACTC 
CTCTGTAGTC 
TTAGATACCT 
TTTATTTTCA 
TATTTTGACT 
AAATTACTAT 
TTTTGGCCTA 
CAGTTGGTTT 
TTACTAATTT 
ATACCTTAAA 
GTAATTTCTA 
AATTTCTAAT 
TAAATAGGTC 
GGATCTGCCA 
ATACCAGTGG 
AATTTCAGTT 
ACTGTGACTT 
GAAAAGTTTC 
AAGTTAATAG 
TAAATTTATA 
AAATTACCTT 
TAAATAATTT 
AT CAGTGGTT 
CCACCTCATA 
AGCTCTTTGA 
AGGAAAATGC 
TTTTGATGGT 
GAAATTTAAG 
TATAACTTCA 

CAGTTCACTA 
TCACAGGTTT 
AGCTCTCCAG 
CTTCACTGCT 
TTAACATTCA 



41 

I 

GTTTTACTTA 
TTTACTCCCC 
TAAACTTTTG 
TTAACCTGGA 
TAAGCCACTG 
AACACTAACT 
AAGCTTTCAT 
TGCATTTTCC 
ATGTCTGGAT 
GGGCAAATTT 
ATACCTGATT 
AAGTTGGTTC 
ATTATATAGT 
TATATATGTT 
ATAAGATACA 
TAGATTTTTT 
GAGTTGGTCT 
CTTAGGTTAT 
GAAAATGAAT 
TGAGTGTGAT 
GAAATCTTTT 
TAATTCAAAT 
ATTATTAGAA 
ATTTAAATCA 
TTCAACCCTC 
ACAATTAAGT 
GTGATTCTAA 
TTTTATTTCT 
TTAAATTCCA 
AGTTTAAGTT 
AAATAGATAT 
CTCTGGAGCT 
ACCTATGGTT 
GTATGCTGGT 
CAGTTCTAGA 
TCTTAGAAGG 
GAATTGGGAA 



51 
I 

TTGAGAGTGT 
AAATTATTCA 
AGTTATAACG 
TTTTAATTTT 
AATTCAGTTC 
TCT TGATATT 
AAAATATTTG 
TATATATGCA 
ATAAAAGATA 
AAACCTGAAA 
TTTTTTCTTG 
TAATTTAAAA 
ATTTCAAAAC 
TCAAAAACCA 
AGGTCTGCAT 

TGATCTAGGA 
GGCTTGTGAC 
TCTTAAAATT 
CTCTCTTTTG 
GTGACAGCAG 
GATAAAAATG 
CTGTGCCTAT 
AGACCACCAT 
ACTTCATATT 
CTAAATTTCT 
TTTGTAGTCA 
CCCATGTTAA 
CTAAAGAACA 
CCATCAAACT 
TTCATTCAAA 
GCCTGTTCAG 
CAGAGTTCTG 
GAATGGATAG 
AAAGCTTTGA 
TAGAATTAAG 
TATTAATTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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TCCAGTGAGT AGTTTTCTGA AATTGGTAAC TTGGAGAGTA AAATAACGTA TTTTGCTTTT 22 80 

CAATTTTGTG TTTGTTTACT TTTATGTAAA AATTTGATAT GTGAATTACA CAGTTCTAAT 2340 

AAAACCTCAT GCCTTTTCAT TACATCTAAT TTGAACTCTC AACTTCAGTG CCAGAAGTGC 240 0 

TTTAAAGATG CTTTAATGAA AAGTATTAAG AAAATATATA GATTTGTATG TCAGTTTATA 2460 

CTT CAGAAAT CCATATATTT GT CATATTT A TTTTTTTAGA AACCTCCTAA TTGGATAACT 252 0 

AGATGGTATT TAAAATGAAT GCCCAAAAAT ATCTTGTACC TTTGTCCAAA AGTTTATCTG 2580 

TTGGAAGCCG CCAGCCATTC ATGTAGAGAG TTTATAAGAA AATAATTTAA AATTGTATGC 2 640 

ATTTTATATT ACTATGGTAT CTGTGTACCA TATTTCTAAG TATTCATTAT TAAATTGGTA 2700 

CTTCTTAAAA CCATAACCTG GCTTGCCTTT TAGTGTTAAA CACAAAATCC AACATTGTAT 2 760 

ATAGAGATTC TT CTTTTATG AAGAAGAGCT GACGTAATTT ATTACCAGTG CATCTGCACA 2 82 0 

AAGACATTAA CATAAGTCTC TGAGCAGTGA TACATTTTCA AACATGAAGA GTGACAACCA 2 880 

CCACATTAAA CAACCACGGC AACACTCAGA CTTGGCACTT TCCTACGAAT CCATCCTATA 2 940 

TGTGCCTGGT ATCGCCTCTG GCATAACTTA CACGAATCGT CCTCCCTACT TGTCTACGCT 3 000 

CCTTCATCAA GCACTTGCCA ACACATTCAC CTCTAACTTG TACAACCTTA CCAACTCACC 3 060 

ACAACATCTG CAACTCTACC CTATCAACTG CCAACCTAAA GACCCCCAAC ACAACACAAC 312 0 

CCCCAAACAC AAAACCACTA AAT CAT AAC C ACCACACACG CCACACACCA CACACCCACC 3180 

CACACAACCA ACACACCACG ACCAAACACC CCACCACAAA CAAGCTAACA ACCACAAACA . 3240 
GACAACACAT CACATACACT CACTACCCCC CCATACTCCC ACCCACCA 

Seq ID NO: 234 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 27-281 

1 11 21 31 41 51 

111111 

AGCAGGAGGA GAGCTGGCGG GAAGACATGC ACCCCTTGAA GACCCAGAGA GAGGCCGTCT 60 

GTCTACCGCG TAG C AGT T AC AT C AGACTGA GACACTTCCT GTTTACAGGA GACTATAAAA 12 0 

TTCCTGCCCC GTGCTCATTT GGGGCTGACG CCATTTTAGG CCTCAGCCCA TCTGCACCCA 180 

GGCGCTCACT GAAACAGTGT GTTGCTCCAC ACCGCCTTGT TTTGCTTGTT GGCGCGCTCT 240 

CAGGGTTCCG ACCAATCCAA GAGCCTTGCA GAAAGCATTA ACGTGCTTTT CTCTTTGGCA 300 

GAGTTTTTCT TTGCTCTGAT CTTGGAGACA TCCCTCTGCC TAGTGGAAAC ATAAGGAATA 3 60 

CAGAAAGAAT GCAAGGAGAT AGACCAACGT GAGATTCTCC TTCATGCACT CAAGAGAAAG 42 0 

ATGTTGCAGG AAGAGCTAGT CTTTCAGGCT GGGCTGGTGA CCTGAGAAAG AATGT CCAGC 480 

TTTTCTTCTC CACTTGGCAT ATCAAGAGCC AGGCGTGGAA GACTAAAACA GGAAATGTTT 540 

ATAAAAACTG TTCAGCGGTT CGCCAACAAG AAGTGGTAAA GTAGCAAAAA TGGGGATGGA 600 

GATGCCAGGA GGAAAGATGC CAGGGGTAAA GTGGGAAAAT GGGAACCTGA AGCCAGGAGG 660 

T CAAGCC AAG CCAACAGGTG TTCTGTTTTT CAT CACAGAA CTAATAAGTG GTGCTGAGGA 720 

CTCAAACCCG GGGAAGCCCA CTCTAGAACC CATGCTGGTC ATCCATATCC CCAAGGCCCT 780 

GGTCAGAACA CAGCTAAGCA GATGGCTTGG GTCATCAGGA CGTCCATTAC ATCCAAAGGA 840 

AG ACAG CCTG TGACGTTTCA AAAGCAAAAG TCCCCTACCA GCCAGTGAAG CTACCTGATT 900 

TCTCAGTATC TTACGCCCAG TGACACGATC TACCCTCAAA ACTTAAAAAA AAAAGGGAAA 960 

CATAAACACA TAACAGCAGC AGCAATAATT AAAGATGAGA TGAGAACAAT TAAGAAAAAA 1020 

GGAAAGGTCT CCTGTGACTG TTTTATTTTT AGGGAAACAG AGAGGAAGAA GAATGATTTT 1080 

TCTTTTGATG ACT CTATATC CAACTCTGAG GTTTGATTAA AGAAATGACC TTGAACCACA 1140 

GCAAAGAAAA ATAAAAGACA ATTTCCAGTA AGTATGCCAG TTCGAATTAA TGATTTACTT 1200 

TTTATTTTTA AACTGAATTC AGCAGAGATT TACATGCATT ACGATGATTA ACAT CTG AAA 12 60 

TTTGACCTTG AAATAATCTT TACATTGTAA ATTCTTAATG ATCAAAACAA GGTTCTCAGT 132 0 

GATTAAAACA TATTAGTAAT TAATTATTAA AGGAGAATAA TTGCAAATAC AACATTCCTA 1380 

AAATCTCAAG GCTTTTAAAG CATTTGTACA AATGACTGGA CATTTTTTAA ATTTGAAAAA 1440 

AAAAAAAAGC CCTCCATCTG ATTCTCATTT TCATTGTCAG TGCAACAACA AAAAAGGTAT 1500 

GCACTTCTCT TCTCATTTTC CACTGTCTCG CAAGCTAGAA ATT CTCACGA CTACCTTTGA 1560 

TCCCATCAAA GCCAAAGAAA GAAAAGAAAA TTGTTCTGTA CAGATATATG ACATTAAAAA 1620 
ATAATCCC 



Seq ID NO; 235 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MHPLKTQREA VCLPRSSYIR LRHFLFTGDY KIPAPCSFGA DAILGLSPSA PRRSLKQCVA 60 
PHRLVLLVGA LSGFRPIQEP CRKH 

Seq ID NO: 236 DNA sequence 

Nucleic Acid Accession #: NM_002075 

Coding sequence: 406.. 1428 

1 11 21 31 41 51 

I I I I 1 I 

CCACAATAGG GGCAGACCTG TCCATCCTTC TCTGTGGGTC CCCTGTACCT TTCTCCCCCA 60 

AC AGGAT CAG ACCCAGAGGC AGCTGGTTGG GGTTTGTCGA GAAGAAGGAT TATCCAGATC 120 

AGTCCTTTCT AATCTCAGCT CCTGCCTGTA CCCTCCCATA CTCACCAAAC CCTCTTCCCC 180 

ACCACCCTGA GCTGAGGAGC ACAGTTTGAG GCCCCCCCAA CCCCCCGCCG GTCGGGGCCA 240 

GGCCAGGCCA GGCCAGCTCC TCTGGCAGCA GAGCCTGGGC AGGTGACGGG CGGGCGCGGG 300 

CGTCGCAGCT GAGGGAGTAA GGAGGCTCCC AGGAACCGGA GCTGGAAACC CGGCCGAGGT 360 

CC AG C CAG AG CCCAAGAGCC AGAGTGACCC CTCGACCTGT CAGCCATGGG GGAGATGGAG 420 

CAACTGCGTC AGGAAGCGGA GCAGCTCAAG AAGCAGATTG CAGATGCCAG GAAAGCCTGT 480 

GCTGACGTTA CTCTGGCAGA GCTGGTGTCT GGCCTAGAGG TGGTGGGACG AGTCCAGATG 540 

CGGACGCGGC GGACGTTAAG GGGACACCTG GCCAAGATTT ACGCCATGCA CTGGGCCACT 600 

GATTCTAAGC TGCTGGTAAG TGCCTCGCAA GATGGGAAGC TGATCGTGTG GGACAGCTAC 660 

ACCACCAACA AGGTGCACGC CATCCCACTG CGCTCCTCCT GGGTCATGAC CTGTGCCTAT 720 

GCCCCATCAG GGAACTTTGT GGCATGTGGG GGGCTGGACA ACATGTGTTC CATCTACAAC 780 

CTCAAATCCC GTGAGGGCAA TGTCAAGGTC AGCCGGGAGC TTTCTGCTCA CACAGGTTAT 840 

CTCTCCTGCT GCCGCTTCCT GGATGACAAC AATATTGTGA CCAGCTCGGG GGACACCACG 900 

TGTGCCTTGT GGGACATTGA GACTGGGCAG CAGAAGACTG TATTTGTGGG ACACACGGGT 960 

GACTGCATGA GCCTGGCTGT GTCTCCTGAC TTCAATCTCT TCATTTCGGG GGCCTGTGAT 1020 

GCCAGTGCCA AGCT CTGGGA TGTGCGAGAG GGGACCTGCC GTCAGACTTT CACTGGCCAC 1080 



279 
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GAGTCGGACA TCAACGCCAT CTGTTTCTTC CCCAATGGAG AGGCCATCTG CACGGGCTCG 1140 

GATGACGCTT CCTGCCGCTT GTTTGACCTG CGGGCAGACC AGGAGCTGAT CTGCTTCTCC 120 0 

CACGAGAGCA TCATCTGCGG CATCACGTCC GTGGCCTTCT CCCTCAGTGG CCGCCTACTA 1260 

TTCGCTGGCT ACGACGACTT CAACTGCAAT GTCTGGGACT CCATGAAGTC TGAGCGTGTG 132 0 

GGCAT CCTCT CTGGCCACGA TAACAGGGTG AGCTGCCTGG GAGTCACAGC TGACGGGATG 1380 

GCTGTGGCCA CAGGTTCCTG GGACAGCTTC CTCAAAATCT GGAACTGAGG AGGCTGGAGA 1440 

AAGGGAAGTG GAAGGCAGTG AACACACTCA GCAGCCCCCT GCCCGACCCC ATCTCATTCA 1500 

GGTGTTCTCT TCTATATTCC GGGTGCCATT CCCACTAAGC TTTCTCCTTT GAGGGCAGTG 1560 

GGGAGCATGG GACTGTGCCT TTGGGAGGCA GCATCAGGGA CACAGGGGCA AAGAACTGCC 162 0 

CCATCTCCTC CCATGGCCTT CCCTCCCCAC AGTCCTCACA GCCTCTCCCT TAATGAGCAA 1680 

GGACAACCTG CCCCTCCCCA GCCCTTTGCA GGCCCAGCAG ACTTGAGTCT GAGGCCCCAG 1740 

GCCCTAGGAT TCCTCCCCCA GAGCCACTAC CTTTGTCCAG GCCTGGGTGG TATAGGGCGT 1800 

TTGGCCCTGT GACTATGGCT CTGGCACCAC TAGGGTCCTG GCCCTCTTCT TATTCATGCT 1860 

TTCTCCTTTT TCTACCTTTT TTTCTCTCCT AAGACACCTG CAATAAAGTG TAGCACCCTG 1920 
GT 



Seq ID NO: 237 Protein sequence: 
Protein Accession #: NP_002066 



1 11 21 31 41 51 

| | | I I I 

MGEMEQLRQE AEQLKKQIAD ARKACADVTIi AELVSGLEW GRVQMRTRRT LRGHLAKIYA 60 
MHWATDSKLL VSASQDGKLI VWDSYTTNKV HAIPLRSSWV MTCAYAP SGN FVACGGLDNM 120 
CSIYNLKSRE GNVKVSRELS AHTGYL S CCR FLDDNNIVTS SGDTTCALWD IETGQQKTVF 180 
VGHTGDCMSL AVSPDFNLFI SGACDASAKL WDVREGTCRQ TFTGHESDIN AICFFPNGEA 240 
ICTGSDDASC RLFDLRADQE LICFSHESII CGITSVAFSL SGRLLFAGYD DFNCNVWDSM 300 
KSERVGILSG HDNRVSCLGV . TADGMAVATG SWDSFLKIWN 



Seq ID NO: 23 8 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 11 21 

I I I 

TCCCAATGTG TNGAACCTAC CATAAATTCT 
TACCATTTGC TTTTAAGGCA GATAATCCTC 
ACTGATTCTG TGAATTATGA AATCTGAAAA 
TGCATTGACC AGTGTGAAGC ACAGTGGAAT 
TAAGTGACTG GAAAGCTGAA GAATCACCGG 
TTTTGACGAG TAT CGGGTG A CTTTGAGGTG 
CAAAAAGGGG AAAAAAAAGA GCAACCAAAG 
AAGAAAGAAA AATAAAATAC ACAATATGGA 
GGATCAAGAA GTTTGTGTAC ACATAATCTC 
CAGAAGTGAA T CAAAAT ATT TCAAAATGCT 
TAGAAAAGTT TTTCTGTAAA AGTCAGATAG 
CAACTACTCA ACTTTCCTAC TGTAGCACAA 
CTTGTGTTCC AATAAAGCTT CATTTACAAA 
ACTGTTGTTT GCCAAGTCCT AATATAGTTG 
GACATGAAAG TTCATTGGGT TGCTAAAAAG 
TAGGCTAAGT TATAATACAC TGTTTTAACA 
AAAATCCGAA ATAAAA 



31 41 51 

I I I 

TTTCTTACNG GACAATCTTA TNCTAANCAA 60 

CAAGTTTTCT AATGATATCT GAAACTAT T A 12 0 

GGAATTGGAA GTTGCTAAAA ATCTATCATT 180 

GAGAATGCGT GCCCTGACAC CAAAGAAAAA 240 

CTTCAGTGAC ATGGAACCCA GTGATTTGAT 3 00 

GT CAAGAAAC CACACTTTAA GAACAATGTC 3 60 

AAAAAAAATC CATAAAATTG CACAGAAGAA 42 0 

CGATGGAGAA AAACAGTTAC ATTTCTTTAT 480 

ATTTTGAGAT ATATAACTAT TTTTGTCTTT 540 

GT CTTATGAA ACTACAATAT TCTCACAGAT 600 

TAAATATTTT AGGTTTTGCA GTGTCTTTTG 660 

GAGTAGCTGT GGTACTGTGC AAATAAATTG 72 0 

AACATGCCAT GGGCCATATT TGGCCTGTAC 7 80 

CTTAGCAAGT ATTGTGAGCT ATTTGAGGAA 840 

TATGTAGAAA TTCAAAGGAA AATTAAAATT 900 

ATTGTAAAAT GTAAGAGAAA TTTACAAATA 960 



Seq ID NO: 239 DNA sequence 

Nucleic Acid Accession #: NM_001786.1 

Coding sequence: 130-1023 

1 11 21 31 41 • 51 

| | I I I I 

GGGGGGGGGG GGCACTTGGC TTCAAAGCTG GCTCTTGGAA ATTGAGCGGA GAGCGACGCG 60 

GTTGTTGTAG CTGCCGCTGC GGCCGCCGCG GAATAATAAG CCGGGATCTA CCATACCCAT 120 

TGACTAACTA TGGAAGATTA TACCAAAATA GAGAAAATTG GAGAAGGTAC CTATGGAGTT 180 

GTGTATAAGG GTAGACACAA AACTACAGGT CAAGTGGTAG CCATGAAAAA AATCAGACTA 240 

GAAAGTGAAG AGGAAGGGGT TCCTAGTACT GCAATTCGGG AAATTTCTCT ATTAAAGGAA 300 

CTTCGTCATC CAAATATAGT CAGTCTTCAG GATGTGCTTA TGCAGGATTC CAGGT TAT AT 360 

CTCATCTTTG AGTTTCTTTC CATGGAT CTG AAGAAATACT TGGATTCTAT CCCTCCTGGT 420 

CAGTACATGG ATTCTTCACT TGTTAAGAGT TATTTATACC AAATCCTACA GGGGATTGTG 480 

TTTTGTCACT CTAGAAGAGT TCTTCACAGA GACTTAAAAC CTCAAAATCT CTTGATTGAT 540 

GACAAAGGAA CAATTAAACT GGCTGATTTT GGCCTTGCCA GAGCTTTTGG AATACCTATC 600 

AGAGTATATA CACATGAGGT AGTAACACTC TGGTACAGAT CTCCAGAAGT ATTGCTGGGG 660 

TCAGCTCGTT ACTCAACTCC AGTTGACATT TGGAGTATAG GCACCATATT TGCTGAACTA 720 

GCAACTAAGA AACCACT TTT CCATGGGGAT TCAGAAATTG ATCAACTCTT CAGGATTTTC 780 

AGAGCTTTGG GCACTCCCAA TAATGAAGTG TGGCCAGAAG TGGAATCTTT ACAGGACTAT 840 

AAGAATACAT TTCCCAAATG GAAACCAGGA AGCCTAGCAT CCCATGT CAA AAACTTGGAT 900 

GAAAATGGCT TGGATTTGCT CTCGAAAATG TTAATCTATG ATCCAGCCAA ACGAATTTCT 960 

GGCAAAATGG CACTGAATCA TCCATATTTT AATGATTTGG ACAATCAGAT TAAGAAGATG 1020 

TAGCTTTCTG ACAAAAAGTT TCCATATGTT ATGTCAACAG ATAGTTGTGT TTTTATTGTT 10 80 

AACTCTTGTC TATTTTTGTC T TAT AT AT AT TTCTTTGTTA TCAAACTTCA GCTGTACTTC 1140 

GTCTTCTAAT TTCAAAAATA TAACTTAAAA ATGTAAATAT TCTATATGAA TTTAAATATA 12 00 
ATTCTGTAAA TGTGAAAAAA AAAAAAAAAA AAAAA 



Seq ID NO: 240 Protein sequence: 
Protein Accession #: NP_001777.1 

1 11 21 31 41 51 

I | I I I I 

MEDYTKI EKI GEGTYGWYK GRHKTTGQW AMKKIRLESE EEGVP STAIR EISLLKELRH 60 
PNIVSLQDVL MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKSYLY QILQGIVFCH 120 



280 



WO 02/086443 

SRRVLHRDLK PQNLLIDDKG TIKLADFGLA RAFGIPIRVY THEWTLWYR SPEVLLGSAR 
YSTPVDIWSI GT I FAELATK KPLFHGDSEI DQLFRIFRAL GTPNNEVWPE VESLQDYKNT 
FPKWKPGSLA SHVKNLDENG LDLLSKMLIY DPAKRISGKM ALNHPYFNDL DNQIKKM 



180 
240 
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Seq ID NO: 241 DNA sequence 

Nucleic Acid Accession #: NM_033379.1 

Coding sequence: 132-854 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CGCCCGCGCG 
GCTTTGCAGA 
ATTGACTAAC 
TTGTGTATAA 
TAGAAAGTGA 
AACTTCGTCA 
ATCTCATCTT 
GTCAGTACAT 
TATTGCTGGG 
TTGCTGAACT 
TCAGGATTTT 
TACAGGACTA 
AAAACTTGGA 
AACGAATTTC 
TTAAGAAGAT 
TTTTTATTGT 
AGCTGTACTT 
ATTTAAATAT 



11 
I 

CGGGCTCAAC 
GAGCGCCCTC 
TATGGAAGAT 
GGGTAGACAC 
AGAGGAAGGG 
TCCAAATATA 
TGAGTTTCTT 
GGATTCTTCA 
GTCAGCTCGT 
AGCAACTAAG 
CAGAGCTTTG 
TAAGAATACA 
TGAAAATGGC 
TGGCAAAATG 
GTAGCTTTCT 
TAACTCTTGT 
CGTCTTCTAA 
AATTCTGTAA 



21 

1 

TTTGTAGAGC 
CAGGGACTAT 
TATACCAAAA 
AAAACTACAG 
GTTCCTAGTA 
GTCAGTCTTC 
TCCATGGATC 
CTTGTTAAGG 
TACT CAACTC 
AAACCACTTT 
GGCACTCCCA 
TTTCCCAAAT 
TTGGATTTGC 
GCACTGAATC 
GACAAAAAGT 
CTATTTTTGT 
TTT CAAAAAT 
ATGTGAAAAA 



31 
I 

GAGGGGCCAA 
GCGTGCGGGG 
TAGAGAAAAT 
GT CAAGTGGT 
CTGCAATTCG 
AGGATGTGCT 
TGAAGAAATA 
TAGTAACACT 
CAGTTGACAT 
TCCATGGGGA 
ATAATGAAGT 
GGAAACCAGG 
TCT CGAAAAT 
ATCCATATTT 
TTCCATATGT 
CTTATATATA 
ATAACTTAAA 
AAAAAAAAAA 



41 
I 

CTTGGCAGAG 
ACACGGGATC 
TGGAGAAGGT 
AGCCATGAAA 
GGAAATTTCT 
TATGCAGGAT 
CTTGGATTCT 
CTGGTACAGA 
TTGGAGTATA 
TTCAGAAATT 
GTGGCCAGAA 
AAGCCTAGCA 
GTTAATCTAT 
TAATGATTTG 
TATGTCAACA 

AATGTAAATA 
AAAAAA 



51 
I 

CGCGCGGCCA 
TACCCATACC 
ACCTATGGAG 
AAAATCAGAC 
CTATTAAAGG 
TCCAGGTTAT 
ATCCCTCCTG 
TCTCCAGAAG 
GGCACCATAT 
GATCAACTCT 
GTGGAATCTT 
TCCCATGTCA 
GAT CCAG CCA 
GACAAT CAGA 
GATAGTTGTG 
ATCAAACTTC 
TTCTATATGA 



Seq ID NO: 242 Protein sequence: 
Protein Accession #: NP_203698.1 

1 11 21 31 41 51 

I I I I I I 

MEDYTKIEKI GEGTYGWYK GRHKTTGQW AMKKI RLESE EEGVPSTAIR EISLLKELRH 

PNIVSLQDVL MQDSRLYLIF EFLSMDLKKY LDSIPPGQYM DSSLVKWTL WYRSPEVLLG 

SARYSTPVDI WSIGTIFAEL ATKKPLFHGD SEIDQLFRIF RALGTPNNEV WPEVESLQDY 

KNTFPKWKPG SLASHVKNLD ENGLDLLSKM LIYDPAKRIS GKMALNHPYF NDLDNQIKKM 

Seq ID NO: 243 DNA sequence 

Nucleic Acid Accession #: AF101051.1 

Coding sequence: 221-856 



GAGCAACCTC 
CGACCCAGAG 
GCGGGGCCCA 
ACCTGCCACC 
GCTGTTGGGC 
GCCCCAGTGG 
CGAGGGGCTG 
TGACTCCTTG 
CATCCTCCTG 
CTTGGAAGAC 
TCTTGCAGGT 
ATT CTATGAC 
TGGCTGGGCT 
CCGAAAAACA 
GAAAGACTAC 
GGACATTGAG 
GT ATGGT AT T 
AAACATGGCT 
TTGTATTACT 
TATATATAGA 
CTCATTATGT 
CCATATTGAT 
CAGTCAAATA 
CTAATTTACC 
TTATTTTTTA 
TTTCATTGGT 
AGCCAAGAAG 
GTGATAAATT 
TTTGCTTTGA 
CACAACTTTA 
ACCTTTTTGT 
TATATCTTCC 
GATAATCTGG 
TCTTTTTTCT 
AATATTAATT 
TTTATTTGCT 
CTTCATGTGA 
ACACATACCT 
AAACCTACGC 
ATTCTTTCAG 
TTTCCAGTCT 



11 
I 

AGCTTCTAGT 
CTTCTCCAGC 
GCCACCTTCG 
CCTGAGCCAG 
TTCATTCTCG 
AGGATTTACT 
TGGATGTCCT 
CTGAATCTGA 
GGAGTGATAG 
GATGAGGTGC 
CTGGCTATTT 
CCTATGACCC 
GCTGCTTCTC 
ACCTCTTACC 
GTGTGACACA 
ATACTATCAT 
ACAAAACAAA 
TAATCTTATT 
GCTTCCCATT 
TATGTATATA 
TGATACTAGC 
GAAGATGTTT 
TCATTTACTC 
AAGGATGAAT 
CCATAATCTT 
CTCTATCTCC 
AATTTATTAC 
CCTGTTGACC 
AAATATTTGT 
TTGATTGAAT 
TCCCCATTCC 
TAATAAGGTG 
TGACAAATAT 
ATCTGCCAAA 
AGTTTATATT 
CAGCTGGCTG 
TTCACTGCCT 
TCATGTGGTT 
ACATACCTTC 
CTGTGTCTGA 
GTACAGAATG 



21 
I 

ATCCAGACTC 
GGCGGCGCAG 
GGAGTCCGGG 
CGCGGGCGCC 
CCTTCCTGGG 
CCTATGCCGG 
GCGTGTCGCA 
GCAGCACATT 
CAATCTTTGT 
AGAAGATGAG 
TAGTTGCCAC 
CAGTCAATGC 
TCTGCCTTCT 
CAACACCAAG 
GAGGCAAAAG 
TAACATTAGG 
CAAACAAACA 
TTATCTTCTT 
GAGTAATCAT 
TACATGTTTT 
ATACTTAAAA 
ATTGGTATAT 
TTCTTCATTA 
TCTTTCAATT 
ATAGCACTTG 
TGAATCTAAC 
AAATCAGAAC 
TTCCCACACA 
CCAATTGAGT 
TTTTAAGCTA 
TTAATTGTAT 
TGGTCTGTTT 
TCTCTCTGTA 
TTGAGATAAT 
ACTCTCATTC 
AGACACTGAA 
TCCTCTCTCT 
CAGTGCCTTC 
ATGTGGCTCA 
CATGTTTGTG 
CTATTTCACT 



31 

I 

CAGCGCCGCC 
CGAGCAGGGC 
TTGCCCACCT 
CGAGCGAGTC 
ATGGATCGGC 
CGACAACATC 
GAGCACCGGG 
GCAAGCAACC 
GGCCACCGTT 
GATGGCTGTC 
AGCATGGTAT 
CAGGTACGAA 
GGGAGGTGCC 
GCCCTATCCA 
GAGAAAATCA 
ACCTTAGAAT 
AAAAACCCAT 
TCCTCAATAT 
ACTCAAATGG 
TCTATTAAAA 
TATCTCTAAA 
TTTCTTTTTC 
GCTTTGGGTG 
CTTCATGCGT 
CATCGTTATT 
ACATTTCATA 
TTTGGAGGCA 
ATCCCTGTAC 
AGCTGCATGC 
CTTATTCATA 
TGTTTTCCCA 
GTCTGAACAA 
GCTGTAAGCA 
GATACTTAAC 
TTTGAACATG 
GAAGTCACTG 
ACCAGTCTAT 
CTCTCTCTAC 
GTGCCTTCCT 
CTCTGTTCCA 
TG AG CAAG AT 



41 
I 

CCGGGCGCGG 
TCCCCGCCTT 
GCAAACTCTC 
ATGGCCAACG 
GCCATCGTCA 
GTGACCGCCC 
CAGATCCAGT 
CGTGCCTTGA 
GGCATGAAGT 
ATTGGGGGTG 
GGCAATAGAA 
TTTGGTCAGG 
CTACTTTGCT 
AAACCTGCAC 
TGTTGAAACA 
TTTGGGTATT 
GTGTTAAAAT 
AGGAGGGAAG 
GGGAAGGGGT 
ATAGACAGTA 
ATAGGTAAAT 
GTCCTTATAT 
CCTTTGCCAC 
GCCCTTTTCA 
AAGCCCTTAT 
GCCTACATTT 
AATCTTTCTG 
TCTGACCCAT 
TGTTCCCCCA 
GTTTTATATC 
AGTGTAATTA 
AGTGCTAGAC 
AGTCACTTAA 
CAGTTAGAAG 
AACTATGCCT 
AACAAAACCT 
TTCCACTGAA 
CAGTCTATTT 
CTCTCTACCA 
TTTTAACAAC 
GATGTATGGA 



51 
I 

ACCCCAACCC 
AACTTCCTCC 
CGCCTTCTGC 
CGGGGCTGCA 
GCACTGCCCT 
AGGCCATGTA 
GCAAAGTCTT 
TGGTGGTTGG 
GTATGAAGTG 
CGATATTTCT 
TCGTTCAAGA 
CTCTCTTCAC 
GTTCCTGTCC 
CTTCCAGCGG 
AACCGAAAAT 
GTAATCTGAA 
ACTCAGTGCT 
ATTTTACCAT 
GCTCCTTAAA 
AAATACTATT 
GTATTTAATT 
ACATATGTAA 
AAGACCTAGC 
TATACTTATT 
TTGTTTTGTG 
TAGTTTCTAA 
CATGACCAAA 
AGCACTCTTG 
GGTGTTGTAA 
CCCCTAAACT 
TCATGCGTTT 
TTTCTGGAGT 
TCTTTCTACC 
AGGTAGTGTG 
ATGTAGTGTC 
ACACACGTAC 
CAAAACCTAC 
CCACTGAACA 
GTCTATTTCC 
TGCTCTTACT 
AAGGGTGTTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
840 
900 
960 
1020 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



281 
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GCACTGGTGT CTGGAGACCT GGATTTGAGT CTTGGTGCTA TCAATCACCG TCTGTGTTTG 252 0 

AGCAAGGCAT TTGGCTGCTG TAAGCTTATT GCTTCATCTG TAAGCGGTGG TTTGTAATTC 2580 

CTGATCTTCC CACCTCACAG TGATGTTGTG GGGAT CCAGT GAGATAGAAT ACATGTAAGT 2640 

GTGGTTTTGT AATTTGAAAA GTGCTATACT AAGGGAAAGA ATTGAGGAAT TAACTGCATA 2700 

5 CGTTTTGGTG TTGCTTTTCA AATGTTTGAA AATAAAAAAA TGTTAAGAAA TGGGTTTCTT 2760 

GCCTTAACCA GTCTCTCAAG TGATGAGACA GTGAAGTAAA ATTGAGTGCA CTAAACGAAT 2820 

AAGATTCTGA GGAAGTCTTA TCTTCTGCAG TGAGTATGGC CCAATGCTTT CTGTGGCTAA 2 880 

ACAGATGTAA TGGGAAGAAA TAAAAGCCTA CGTGTTGGTA AATCCAACAG CAAGGGAGAT 2 940 

TTTTGAATCA TAATAACTCA TAAGGTGCTA TCTGTTCAGT GATGCCCTCA GAGCTCTTGC 30 00 

10 TGTTAGCTGG CAGCTGACGC TGCTAGGATA GTTAGTTTGG AAATGGTACT TCATAATAAA 3060 

CTACACAAGG AAAGTCAGCC ACCGTGTCTT ATGAGGAATT GGACCTAATA AATTTTAGTG 3120 

TGCCTTCCAA ACCTGAGAAT ATATGCTTTT GGAAGTTAAA ATTTAAATGG CTTTTGCCAC 3180 

ATACATAGAT CTTCATGATG TGTGAGTGTA ATTCCATGTG GATATCAGTT ACCAAACATT 3240 

ACAAAAAAAT TTTATGGCCC AAAATGACCA ACGAAATTGT TACAATAGAA TTTATCCAAT 3300 

15 TTTGATCTTT TTATATTCTT CTACCACACC TGGAAACAGA CCAATAGACA TTTTGGGGTT 33 60 

TTATAATGGG AATTTGTATA AAGCATTACT CTTTTTCAAT AAATTGTTTT TTAATTTAAA 3420 

AAAAGGAAAA AAAAAAAAAA AAA 

20 Seq ID NO: 244 Protein sequence: 
Protein Accession #: AAD16433.1 

1 11 21 31 41 51 

04T I 1 1 1 1 1 

ZJ MANAGLQIjLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 60 

QIQCKVFDSL LNLSSTLQAT RALMWG I I>L GVIAIFVATV GMKCMKCLED DEVQKMRMAV 12 0 

IGGAIFLLAG LAILVATAWY GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 180 
LLCCSCPRKT TSYPTPRPYP KPAPSSGKDY V 

30 Seq ID NO: 245 DNA sequence 

Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

I 1 I i I 1 

TTTTTTTTTT TTTTTTTTTT TTTTTCAAGG AGAGCACAAG GAACTTTATT AATGACTTTC 60 

TTAATGGTTA AATGCTGTTT ACCAAGTGAC CCAGAGGCAG CGTGGTTTAG TGGTTTCAAC 120 

AGCATGGTCC CGAGAGTCTG ACAAACCTCA GTTCAAATCC TTCTTTTGTC TTCACTTAGT 180 

TTTTCTTCCT GAGATTTAGT TTCTTCATCG TTAACAATGA GGATATTAAT ATGTTTCACA 240 

CAGTTGTTAT GAAGAATG C A TATATTAGAA TGCCTGTAGT CTCAGCTACT CAGGAGGCTA 3 00 

40 AGGTGGGGAG GTCGCTCAAG CCCAGGAATT CAAAGCTGCA ATGCATTATG ATTACAGCTG 360 

TTAATAGCCA CTGCACTTCA GCCTGGGCAA TGTAGTAAGA TCCCATCTCT GGCTCGGAGG 42 0 
GTCCTACGCC CACGGAGTCT CGCTGATTGC TAGCACAGCA GTCTGAGATC AAACTGCA 

Seq ID NO: 246 DNA sequence 
45 Nucleic Acid Accession #: XM_058553.2 
Coding sequence: 897-1400 

1 11 21 31 41 51 

, n 1 I I I I ! 

DU AATTTTCAGA AGTTTCGTAT GGGGATGGTT TTATATAAAT TCAGGTTTTT CCCACAATAA 60 

TAAATGTATT TAGTCTCAGT GCTCAATAGA AGAGATTTCT AATAGAAAAG GATTCAAACT 12 0 

GTGAAACCAT TTCTCTTTTA ATGTTTCACA TTCCTGTTAC AGATTTGTTC TCTTGTGACT 180 

CTGTTAT CCA TAATATGGAC AGTTCTTGAG TCCTAACATT GAGAGGTTTT CCCTTAGTGC 240 

ATAGAGGGAA TGAGTATTAA TTGGAGAAGC TTAAAGTATT GCCACTTTAG CACTGAAGAT 300 

55 TGGGATGAGA GGAGGTGAAA CCTCACTAGA AAAAGGGACA ATGTTAGTGT GGCCCTTCCT 360 

GATCATGTTT AAGAAAAGTC ATGAAAATGG TGAACTAGTG TTTCCAAGCA TATTGGAAGG 420 

GTTGAGTGTA TACTGTCTGT CAAAGACTTC CAGCATTTCC AGGTCCTAGA GAGGAACAAG 480 

ACTGGTAACC TGCCTATCTG TATTTTTAAG AACCCAGGAG GAAAGCTTTA TAATAGAACA 540 

TTATTTCTGT GTTTATGTAT AAGGGGTTTT TTGTTTTTTT AAAGACAGGA TCTCACTCCA 600 

60 TTGTCCAGGC CAAGTGCAAT GGCACGAACC TCATAGCTCC TGGACTTAAG TGATCTGCCT 660 

GCCTTTGCCT CCTGAGTAGC TGGGACTACA GGCATGAGCC CCCATGCCTG GCTAAGTTTG 720 

TTTTTTTGTT TGTTTGTTTG TTTGTTTTTG GGGGGGGTTG TTTTGTTTTT TGTAGAGACG 780 

TAGTCTTGCT TTGTTGCCAG GCTAGTCTCA AACTCCTGGC TTCAAGTGAT CCTCCTGCCT 840 

CAGCCTCCCA GAGTGCTAGG ATTACAGCAC TTGGATTCAG CTTCTTCATT TCCAACATGG 90 0 

65 AAGAAACTTA CACCGACTCC CTGGACCCTG AGAAGCTATT GCAATGCCCC TATGACAAAA 960 

ACCATCAAAT CAGGGCTTGC AGGTTTCCTT ATCATCTTAT CAAGTGCAGA AAGAATCATC 102 0 

CTGATGTTGC AAGCAAATTG GCTACTTGTC CCTTCAATGC TCGCCACCAG GTTCCTCGAG 10 8 0 

CTGAAATTAG T CATCATAT C TCAAGCTGTG ATGACAGAAG TTGTATTGAG CAAGATGTTG 1140 

TCAACCAAAC CAGGAGCCTT AGACAAGAGA CTCTGGCTGA GAGCACTTGG CAGTGCCCTC 1200 

70 CTTGCGATGA AGACTGGGAT AAAGATTTGT GGGAGCAGAC CAGCACCCCA TTTGTCTGGG 1260 

GCACAACTCA CTACTCTGAC AACAACAGCC CTGCGAGCAA CATAGTTACA GAACATAAGA 1320 

ATAACCTGGC TTCAGGCATG CGAGTTCCCA AATCTCTGCC GTATGTTCTG CCATGGAAAA 13 80 

ACAATGGAAA TGCACAGTAA CTGAATACCT ATCTCATCAA ATGCCAGACC CTAGAAGACT 1440 

' GTTGCTTCTT CTTCTACCAG TGGGTTCTCA TTTTCCTCCT AATCTAATTA TAGAATGGTA 1500 

75 AACTCCCTGT GACTTTCCAA ACTGACAAGC ACACTTTTTT CCTCCCCCCT TGAATCCTCA 1560 

TTTAATGCAA GAACCCTCAT ACTCAGAAGC TTCCAAATAA ACCTTTGATA CAGATTG 



Seq ID NO: 247 Protein sequence: 
80 Protein Accession #: XP_058553.1 

1 11 21 31 41 51 

i I I i i i 

MEETYTDSLD PEKLLQCPYD KNHQIRACRF PYHLIKCRKN HPDVASKLAT CPFNARHQVP 60 

85 RAEISHHISS CDDRSCIEQD WNQTRSLRQ ETLAESTWQC PPCDEDWDKD LWEQTSTPFV 120 

WGTTHYSDNN SPASNIVTEH KNNLASGMRV PKSLPYVLPW KNNGNAQ 



282 



WO 02/086443 

Seq ID NO: 248 DNA sequence 
Nucleic Acid Accession #: NM_003392 
Coding sequence: 758..18S5 



PCT/US02/12476 
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TTAAGGAAAT 
AACTGATTAT 
CGCGCTGGTC 
CATCCTCCAC 
TCCACTCGCC 
GGCCCAGGTT 
GAGGGACTGA 
GGAAGGAGGC 
TCGCTGGCGT 
TGGGTGGATT 
GAGAAGCGCA 
TGTATGCTTG 
GTCCATTGGA 
CAAGTTCTTC 
CAATTCTTGG 
AGGAGCACAG 
CCACTTGTAT 
ATGCCAGTAT 
TTTTGGCAGG 
AGCAGGGGTG 
CTGCAGCCGC 
CGACAACATC 
GCGCATCCAC 
CGAGGCCGGC 
GTCCGGCTCA 
TGATGCCCTG 
GTTGGTACAG 
CCCCAGCCCT 
CCTGTGCAAC 
GTACGACCAG 
CTACGTCAAG 
GCCACCCAGC 
TGGTTTTTGG 
TTTTTTCCTG 
GGCAATAATG 
TACAAGACTT 
CTGTGTGGGA 
TGCCATCATA 
TTCAGCTTCT 
AAAACAAAAC 
CATTTTCAAA 
GGT AT AT C A C 
AATAGCTCAT 
CTCTTATGTC 
AAGACCCCCA 
ATGAAAT AT C 
TACATGAATC 
GCACTGCACC 
ACACTGAGCC 
GCAGCTCCAC 
TGGAAAACAG 
TAGTTTTCTA 
TATCACTGTT 
GTGTACCTTA 
GGTTTAATGG 
ATATATAAAT 
CTCTGGGGTT 
ATT CCAAAAG 
GCACGACGAA 
TGTTGGGTTG 
TTCTGTTCAC 
ATTCAAAACT 
AACCCATGCC 
CACTACATAG 
CAGG C CAT CA 
ACATCTTTTC 
CTCTTAATTT 
ATAATGATAT 
AAAGCACTAA 
TACTTTTTTT 
GTTGAGTTTA 
CATTCAGATA 
GTCTTGCGTG 
AATGGAAGAT 



CCGGGCTGCT 
GAAACATACG 
CCCGGGGCCT 
CCCCCGCGCT 
TCCCGTGCTC 
GGGGAGAGGA 
GCACAGCACC 
AGCGCCTGGC 
GCCCCGCGCA 
AATTTGGAAA 
GTCAATCAAC 
AAAATTATCT 
ATATTAAGCC 
CTAGTGGCTT 
TGGTCGCTAG 
CCTCTCTGCA 
CAGG AC C AC A 
CAATTCCGAC 
GTGATGCAGA 
GTGAACGCCA 
GCCGCGCGCC 
GACTATGGCT 
GCCAAGGGCT 
CGCAGGACGG 
TGTAGCCTGA 
AAGGAGAAGT 
GTCAACAGCC 
GACTACTGCG 
AAGACGTCGG 
TTCAAGACCG 
TGCAAGAAGT 
ACTCAGCCCC 
TTTTTAGAAA 
TTACCATCTA 
GGGGTGGGAA 
CTTTTGGATA 
CATGGTACAC 
TGGGATGGGT 
ATGACCAAAA 
AAAACAAAAC 
ATGATAATTT 
ATGTCT CAT T 
GAAATTTGGG 
AAGATGTTGA 
ATGATTCTGG 
CTGTATTTTC 
CCATTCACAG 
AGAGCAGACA 
CTCTCTGATT 
TGGGTCCCCT 
TTCACTACTT 
TGTTTTAATG 
ATGATCCTGT 
AAACTGTTCC 
TGCCTGATAT 
ATAAATATAA 
ATCTCT CTGT 
TTTTTTGAGT 
GCAACCTCGT 
AAGATATCTT 
TTTGTGGAGA 
CAGAAGCATC 
TATTAGAAAT 
ATAGCTTTTT 
AATATGTTCT 
ATACCCCCCC 
ATTGCATAAT 
TCACATCCCC 
TTAGTTTAAA 
ATTTGCTAAA 
ACAATCCTAG 
T T ATGT AT AT 
ATTTGTATAT 
AGAATATAAA 



21 

I 

CTTCCCCATC 
ATGTTAATTC 
CGCCCCCCAC 
GGCCACCCCG 
CTCTCGCCCA 
CGGAGGGTGG 
AACTAGAGAG 
ACCAGGGCTT 
CAGGATCCCA 
AAGAAACTGC 
AGTAAACTTA 
GAGAGGGAAT 
CAGGAGTTGC 
TGGCCATATT 
GTATGAATAA 
GCCAACTGGC 
TGCAGTACAT 
ATCGACGGTG 
TAGGCAGCCG 
TGAGCCGGGC 
CCAAGGACCT 
ACCGCTTTGC 
CCTACGAGAG 
TGTACAACCT 
AGACATGCTG 
ACGACAGCGC 
GCTTCAACTC 
TGCGCAATGA 
AGGGCATGGA 
TGCAGACGGA 
GCACGGAGAT 
GCTCCCAGGA 
TATTTTTTAT 
AGAACTCTGT 
CCACGAAAAA 
GTATAGAATG 
ATCCAGAAGG 
AGGTT CCAGT 
TGAGTTGTAA 
AAACCTCCCT 
ACAATGGAAG 
CTCCTCAAAT 
CAGCAGGGAG 
TTTGAAGCTG 
ACACTAGATT 
TTAGGGATAC 
GTTTCTCAGC 
ACCTATTTGA 
CCTCCGTGTT 
TTGGT TGTAG 
AGGGATTTTT 
ACAGAACTTG 
GTTTAGATTA 
CAGTGTACTT 
CTCAAAGTCT 
ATATATCTCA 
CTAGAGCATT 
CTTGAGCTTG 
TTCTGAGGAA 
TTTTTCTTTT 
GGGCATTACT 
AGCAATGTTT 
GACAGT AC T T 

TATCTCAGAC 
TTAGGAGGTT 
GATATCCACA 
TCAGTTGCAG 
ATGTCACTTT 
TCAGATTGTT 
CTTTTAAAAG 
CTTCTAGCCT 
TTCACTGGTT 
ATAAAACGTT 



31 



TGGAAGTGGC 
GGAGCTGCAT 
CCCCTGCCCT 
CCTCCTTGGC 
TGGAATTAAT 
CCGCAGCGGG 
GGGTCAGGGG 
TGACT CAAC A 
GCGAAAAT CA 
CTATATCTTG 
AGAGACCCCC 
AAACATCTTT 
TTTGGGGATG 
TTTCTCCTTC 
CCCTGTTCAG 
AGGACTTTCT 
CGGAGAAGGC 
GAACTGCAGC 
CGAGACGGCC 
GTGCCGCGAG 
GCCGCGGGAC 
CAAGGAGTTC 
TGCTCGCATC 
GGCTGATGTG 
GCTGCAGCTG 
GGCGGCCATG 
GCCCACCACA 
GAGCACCGGC 
TGGCTGCGAG 
GCGCTGCCAC 
CGTGGACCAG 
CCCGCT TATT 
TTTTCCCCAA 
GGTTTATTAT 
TATTTATTTT 
AAGGGGGAAA 
TAAAGAAATA 
TGAAAGAGGG 
ATTCTCTGGT 
TCCCCAGCAG 
GACAAGAATG 
ATTCCATTTG 
GAAAGTCCCC 
TTATAAGAAT 
TTTTGTTTGG 
TTGGTTAGTA 
CCAAGCAACA 
GGAAAAACAG 
GTGATGTGAT 
GACAGGAAAT 
GTTTCCTAAA 
GCTAATGGAA 
TCCACTCATG 
GAACAGTTGC 
TTTGTACATA 
TTGCAGCCAG 
GTTGTCCTTC 
GGCTGTGGCC 
GAAGCTTGAG 
CTGCCTCACC 
TGTTCGTTAT 
CTCTTTTCTT 
ATTAATTGAG 
TTTTTTTTAA 
TTACGTTGTT 
GGGCTTTCAT 
TCAGCCAACT 
TGAATTGTGA 
TTTGGTTTTT 
CCTTTTTAGT 
AAACTATTTA 
TTATTCTGTA 
TAAAAAACAA 
ACTTGTAAAA 



41 
! 

TTTCCCCACA 
TTCCCAGCTG 
TCCCTCCCGC 
AGCCTCTGGC 
TCTGGCTCCA 
TTCCTGAGTG 
GTGCGGGACT 
GAATTGAGAC 
GATTTCCTGG 
CCATCAAAAA 
GATGCTCCCC 
TCCTTCTTCC 
GCTGGAAGTG 
GCCCAGGTTG 
ATGTCAGAAG 
CAAGGACAGA 
GCGAAGACAG 
ACTGTGGATA 
TTCACATACG 
GGCGAGCTGT 
TGGCTCTGGG 
GTGGACGCCC 
CTCATGAACC 
GCCTGCAAGT 
GCAGACTTCC 
CGGCTCAACA 
CAAGACCTGG 
TCGCTGGGCA 
CTCATGTGCT 
TGCAAGTTCC 
TTTGTGTGCA 
TATAGAAAGT 
GAATTGCAAC 
TAATATTATA 
GTGGATCTTT 
TAACACATAC 
CATTTTCTTT 
TGGTAGAAAT 
GCAAGATAAA 
GGCTGCTAGC 
TCATATTCTC 
CAGACAGACC 
AG AAAT T AAA 
TGGGATTCCA 
GGAGGTTGGC 
AATTATAATA 
AGGTAATTGC 
TGAAATCCAC 
GCTGGCCACG 
GAAACATTAG 
ACTTTTATTT 
TTCACAGAGG 
CTTCTCCTAT 
ATTTATAAGG 
ACATATATAT 
TGATTTAGAT 
ACTGCAGTCC 
CCGCTGTGAT 
TTCTGACTCA 
CCTTTGTCTC 
AGACATGGAC 
AGTTCATTCT 
TCCCTAAGGA 
TAAGGACACC 
TTAAAAGTTT 
AT CACCTCAG 
GTGGCTCTTT 
GCAAAAGATC 
ATTATACAAA 
GACTCATGTT 
ATGTAAAATA 
CTTTTAATGT 
ACATCGAAAG 
AAAAAAAA 



51 



Seq ID NOt 249 Protein sequence: 
Protein Accession #: NP_003383 

1 11 21 

I I I 



41 
I 



TCGGCTCGTA 
GGCACTCTCG 
GTCCTGCCCC 
GGCAGCGCGC 
CTTGTTGCTC 
AATTACCCAG 
CGAGCGAGCA 
ACGTTTGTAA 
TGAGGTTGCG 
ACT CACGGAG 
TGGTTTAACT 
CTCT CCAGAA 
CAATGTCTTC 
TAATTGAAGC 
TATATATTAT 
AGAAACTGTG 
GCATCAAAGA 
ACACCTCTGT 
CCGTGAGCGC 
CCACCTGCGG 
GCGGCTGCGG 
GCGAGCGGGA 
TGCACAACAA 
GCCATGGGGT 
GCAAGGTGGG 
GCCGGGGCAA 
TCTACATCGA 
CGCAGGGCCG 
GCGGCCGTGG 
ACTGGTGCTG 
AGTAGTGGGT 
ACAGTGATTC 
CGGAACCATT 
ATTATTATTT 
GAAAAGGTAA 
CCTAACTTAG 
TTCTCAAATA 
CTATTCACAA 
AGGTCTTGGG 
TTGCTTTCTG 
AAGGAAAAAA 
GTCATATTCT 
AAATTTAAAA 
GATTTGTAAA 
TTGAACATAA 
GTAGAAATAA 
GTGCCATTCA 
CTTCCTCTTC 
TTTCCAAACG 
GAGCTCTGCT 
TGAGGAGCAG 
TGTTGCAGCG 
TGTACTGCAG 
GGGGAAATGT 
ATATATACAT 
TTACAGCTTA 
AGTTGGGATT 
CATACCCTGA 
CTGAAATGCG 
CAACCTCCAT 
GTTAAGAGAT 
GCAGAATGGA 
ATATTCAGCC 
TCTTTCCAAA 
GGAAAGATAC 
CCAACTGTGG 
AATTTATTGC 
TTGAAAGCAA 
AACCATGAAG 
TATGAAGAGA 
TTCTACATGT 
ACATATTTCT 
GCTTATTCCA 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

MAGSAMSSKF FLVAIAIFFS FAQWIEANS WWSLGMNNPV QMSEVYIIGA QPLCSQLAGL 60 
SQGQKKLCHL YQDHMQ Y I GE GAKTGIKECQ YQFRHRRWNC STVDNTSVFG RVMQIGSRET 120 
AFTYAVSAAG WNAMSRACR EGELSTCGCS RAARPKDLPR DWLWGGCGDN IDYGYRFAKE 180 
FVDARERERI HAKGSYESAR ILMNLHNNEA GRRTVYNLAD VACKCHGVSG SCSLKTCWLQ 240 
IjADFRKVGDA LKEKYDSAAA MRLNSRGKLV QVNSRFNSPT TQDLVYIDPS PDYCVRNEST 300 
GSLGTQGRLC NKTSEGMDGC ELMCCGRGYD QFKTVQTERC HCKFHWCCYV KCKKCTEIVD 360 
QFVCK 

Seq ID NO: 250 DNA sequence 
Nucleic Acid Accession ft: NM_014058 
Coding sequence: 56.. 1324 



PCT/US02/12476 



i 
I 

TGACTTGGAT 
TCGGCCAGAT 
CGTCATCTTC 
GAGATATAAT 
ACTATATGCT 
TGAATCAATG 
TCAGGTTATC 
TAGATTTCAC 
TGAAAAGCTG 
AAAAATCAAC 
TAAAACTCTA 
GCCCTGGCAG 
TGCCACATGG 
GACTGCTTCC 
AATTGTCCAT 
TTCTAGCCCT 
TGAGTTTCAA 
TTACAGTCAA 
TGAACCTCAA 
AGGAAAAACA 
AGATATCTGG 
GCCTGGTGTT 
CTAAGAGAGA 
CCATTTTTAG 
AATAAACTGT 



11 

I 

GTAGACCTCG 
GTGGTGAGGG 
ATATCCCTGA 
CAAAAGAAGA 
GAGTTTGGCA 
GTGAAAAATG 
AAGTTCAGTC 
TCTACTGAGG 
CAAGATGCTG 
AAGACAGAAA 
GGTCAGAGTC 
GCTAGCCTGC 
CTTGTGAGTG 
TTTGGAGTAA 
GAAAAATACA 
GTTCCCTACA 
CCAGGTGATG 
AATCATCTTC 
GCTTACAATG 
GATGCATGCC 
TACCTTGCTG 
TATACTAGAG 
AAAGCCTCAT 
AGATACAGAA 
TTGCTTGATG 



21 
I 

ACCTTCACAG 
CTAGGAAAAG 
TTGTCCTGGC 
CCTACAATTA 
GAGAGGCTTC 
CATTTTATAA 
AACAGAAGCA 
ATCCTGAAAC 
TAGGACCCCC 
CAGACAGCTA 
TCAGGATCGT 
AGTGGGATGG 
CTGCTCACTG 
CAATAAAACC 
AACACCCATC 
CAAATGCAGT 
TGATGTTTGT 
GACAAGCACA 
ACGCCATAAC 
AGGGTGACTC 
GAATAGTGAG 
TTACGGCCTT 
GGAACAGATA 
TTGGAGAAGA 
CAAAAAAAAA 



31 
I 

GACTCTTCAT 
AGTTTGTTGG 
AGTGTGCATT 
CTATAGCACA 
TAACAATTTT 
ATCTCCATTA 
TGGAGTGTTG 
TGTAGATAAA 
TAAAGTAGAT 
TCTAAACCAT 
TGGTGGGACA 
GAGTCATCGC 
TTTTACAACA 
TTCGAAAATG 
ACATGACTAT 
ACATAGAGTT 
GACAGGATTT 
GGTGACTCTC 
TCCTAGAATG 
TGGAGGACCA 
CTGGGGAGAT 
GCGGGACTGG 
ACATTTTTTT 
CTTGCAAAAC 
A 



41 
I 

TGCTGGTTGG 
GAACCCTGGG 
GGACTCACTG 
TTGTCATTTA 
ACAGAAATGA 
AGGGAAGAAT 
GCTCATATGC 
ATTGTTCAAC 
CCTCACTCAG 
TGCTGCGGAA 
GAAGTAGAAG 
TGTGGAGCAA 
TATAAGAACC 
AAACGGGGTC 
GATATTTCTC 
TGTCTCCCTG 
GGAGCACTGA 
ATAGACGCTA 
TTATGTGCTG 
CTGGTTAGTT 
GAATGTGCGA 
ATTACTTCAA 
TTGTTTTTTG 
AGCTAGATTT 



51 
I 

CAATGATGTA 
TTATCGGCCT 
TTCATTATGT 
CAACTGACAA 
GCCAGAGACT 
TTGTCAAGTC 
TGTTGATTTG 
TTGTTTTACA 
TTAAAATTAA 
CACGAAGAAG 
AGGGTGAATG 
CCTTAATTAA 
CTGCCAGATG 
TCCGGAGAAT 
TTGCAGAGCT 
ATGCATCCTA 
AAAATGATGG 
CAACTTGCAA 
GCTCCTTAGA 
CAGATGCTAG 
AACCCAACAA 
AAACTGGTAT 
GGTGTGGAGG 
GACTGATCTC 



Seq ID NO: 251 Protein sequence: 
Protein Accession #: NP 054777 



11 



21 



31 



41 



51 



MYRPDWRAR 
DKLYAEFGRE 
ICRFHSTEDP 
RSKTLGQSLR 
RWTASFGVTI 
SYEFQPGDVM 
LEGKTDACQG 
GI 



KRVCWEPWVI 
ASNNFTEMSQ 
ETVDKIVQLV 
IVGGTEVEEG 
KPSKMKRGLR 
FVTGFGALKN 
DSGGPLVSSD 



GLVIFISLIV 
RLESMVKNAF 
LHEKLQDAVG 
EWPWQASLQW 
RI I VHEKYKH 
DGYSQNHLRQ 
ARDIWYLAGI 



LAVCIGLTVH 
YKSPLREEFV 
PPKVDPHSVK 
DGSHRCGATIi 
PSHDYDISLA 
AQVTLIDATT 
VSWGDECAKP 



Seq ID NO: 252 DNA sequence 

Nucleic Acid Accession #: NM_003504.2 

Coding sequence: 71-1771 



GGCACGAGGC 
CGCCGTGGCT 
GAGGGTCCTT 
GGCCTTGTTC 
ACTTGAAACT 
TGGAGCTAAT 
GTGTGACACC 
ACTCATTAAA 
AGAGGAGGAT 
CACACGGTTA 
GGAGGCCCGG 
GTCAGCCATG 
GTGGTGGGCC 
ATACGTGACT 
GGATGAGGAG 
CCTGGTGCTC 
AGCCAGGTTC 
CATGGGTCTT 
GGAGAATTTG 
CGTGCAGACT 
CTTTGCCACC 
CATCCAGGCT 
ACTCGCCAAG 
CCTCGTCATC 
CATGCTGTTC 
TGTGTGTTCG 



11 
I 

CTCGTGCCGC 
ATGTTCGTGT 
CTCTTCGTGG 
CAGTGTGACC 
GCATTTCTTG 
GTAGACCTAT 
CATAGGCCAG 
CAAGATGATG 
GAAGAGCATT 
GAAGAGGAGA 
AGAAGAGACA 
GTGATGTTTG 
ATCGTTGGAC 
GATGT TGGTG 
AACACACTCT 
TACCAGCACT 
AAGCTGTGGT 
CCCCTGAAGC 
CGGGAAATGA 
TTCAGCATTC 
ATGTCTTTGA 
CTGGACAGCC 
AAGCAGCTGC 
TCCCAGGGGC 
TCTAGGCCGG 
ACAAAGAACC 



21 

I 

CGGGCTCTTG 
CCGATTTCCG 
CCTCGGACGT 
ACGTGCAATA 
AGCATAAAGA 
TGGATATTCT 
TCAATGTCGT 
ACCTTGAAGT 
CAGGAAATGA 
TAGTGGAGCA 
TCCTCTTTGA 
AGCTGGCTTG 
TAACAGACCA 
TCCTGCAGCG 
CCGTGGACTG 
GGTCCCTCCA 
CTGTGCATGG 
AGGTGAAGCA 
TTGAAGAGTC 
ATTTTGGGTT 
TGGAGAGCCC 
TCTCCAGGAG 
GAGCCACCCA 
CTTTCCTGTA 
CATCCCTAAG 
GGCGCTGCAA 



31 
I 

GTACCTCAGC 
CAAAGAGTTC 
GGATGCTCTG 
TACGCTGGTT 
ACAGTTTCAT 
TCAACCTGAT 
CAATGTATAC 
TCCCGCCTAT 
CAGTGATGGG 
AACCATGCGG 
CTACGAGCAG 
GATGCTGTCC 
GTGGGTGCAA 
CCACGTTTCC 
CACACGGATC 
TGACAGCCTG 
ACAGAAGCGG 
GAAGTTCCAG 
TGCAAATAAA 
CAAGCACAAG 
CGAGAAGGAT 
TAACCTGGAC 
GCAGACCATT 
CTGCTCTCTC 
CCTGCTCAGC 
ACTGCTGCCC 



YVRYNQKKTY 
KSQVIKFSQQ 
IKKINKTETD 
INATWLVSAA 
ELSSPVPYTN 
CNE PQ AYNDA 
NKPGVYTRVT 



41 
I 

GCGAGCGCCA 
TACGAGGTGG 
TGTGCGTGCA 
CCAGTTTCTG 
TATTTTATTC 
GAAGACACTA 
AACGATAGCC 
GAAGACATCT 
TCAGAGCCTT 
AGGAGGCAGC 
TATGAATATC 
AAGGACCTGA 
GACAAGATCA 
CGCCACAACC 
TCCTTTGAGT 
TGCAACACCA 
CTCCAGGAGT 
GCCATGGACA 
TTTGGGATGA 
TTTCTGGCCA 
GGCTCAGGGA 
AAGCTGTACC 
GCCAGCTGCC 
ATGGAGGGCA 
AAACACCTGC 
C TGGTG ATGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



NYYSTLSFTT 
KHGVLAHMIiIi 
SYLNHCCGTR 
HCFTTYKNPA 
AVHRVCLPDA 
ITPRMLCAGS 
ALRDWITSKT 



51 
I 

GGCGTCCGGC 
TCCAGAGCCA 
AGATCCTTCA 
GGTGGCAAGA 
TCATAAACTG 
TATTCTTTGT 
AGATCAAATT 
TCAGGGATGA 
CTGAGAAGCG 
GGCGAGAGTG 
ATGGGACATC 
ATGACATGCT 
CTCAAATGAA 
ACCGGAACGA 
ATGACCTCCG 
GCTATACCGC 
TCCTTGCAGA 
TCTCCTTGAA 
AGGACATGCG 
GCGACGTGGT 
CAGATCACTT 
ATGGCCTGGA 
TTTGCACCAA 
CTCCAGATGT 
TCAAGTCCTT 
CTGCCCCCCT 



60 
120 
180 
240 
300 
360 
420 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



284 



WO 02/086443 

GAGCATGGAG CATGGCACAG TGACCGTGGT GGGCATCCCC CCAGAGACCG ACAGCTCGGA 
CAGGAAGAAC TTTTTTGGGA GGGCGTTTGA GAAGGCAGCG GAAAGCACCA GCTCCCGGAT 
GCTGCACAAC CATTTTGACC TCTCAGTAAT TGAGCTGAAA GCTGAGGATC GGAGCAAGTT 
TCTGGACGCA CTTATTTCCC TCCTGTCCTA GGAATTTGAT TCTTCCAGAA TGACCTT CTT 
5 ATTTATGTAA CTGGCTTTCA TTTAGATTGT AAGTTATGGA CATGATT TGA GATGTAGAAG 

CCATTTTTTA TTAAATAAAA TGCTTATTTT AGGCTCCGTC CCCAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AA 

Seq ID NO: 253 Protein sequence: 
10 Protein Accession #: NP_003495.1 

1 11 21 31 41 51 

I I I -I I I 

MFVSDFRKEF YEWQSQRVli LFVASDVDAL CACKILQALF QCDHVQYTLV PVSGWQELET 60 

15 AFLEHKEQFH YFILINCGAN VDLLDILQPD EDTIFFVCDT HRPVNWNVY NDTQIKLLIK 120 

QDDDLEVPAY ED I FRDEEED EEHSGNDSDG SEPSEKRTRL EEEIVEQTMR RRQRREWEAR 180 

RRDILFDYEQ YE YHGT S SAM VMFELAWMLS KDLNDMLWWA IVGLTDQWVQ DKITQMKYVT 240 

DVGVLQRHVS RHNHRNEDEE NTLS VDCTR I SFEYDLRLVL YQHWSLHDSL CNTSYTAARF 3 00 

KLWSVHGQKR LQEFLADMGL PLKQVKQKFQ AMDISLKENL REMIEESANK FGMKDMRVQT 3 60 

20 FSIHFGFKHK FLASDWFAT MSLMESPEKD GSGTDHFIQA LDSLSRSNLD KLYHGLELAK 420 

KQLRATQQTI ASCLCTNLVI SQGPFLYCSL MEGTPDVMLF SRPASLSLLS KHLIiKSFVCS 480 

TKNRRCKbliP LVMAAPIjSME HGTVTWG I P PETDSSDRKN FFGRAFEKAA ESTSSRMLHN 540 
HFDLSVIELK AEDRSKFLDA LISLLS 

25 Seq ID NO: 254 DNA sequence 

Nucleic Acid Accession #: NM_022337 
Coding sequence: 48.. 683 

30 1 11 21 31 41 51 

I I I I I I 

GGCTGCGCTT CCCTGGTCAG GCACGGCACG TCTGGCCGGC CGCCAGGATG CAGGCCCCGC 60 

ACAAGGAGCA CCTGTACAAG TTGCTGGTGA TTGGCGACCT GGGCGTGGGG AAGACCAGTA 120 

TCATCAAGCG CTACGTGCAC CAGAACTTCT CCTCGCACTA CCGGGCCACA AT CGGCGTGG 180 

35 ACTTCGCGCT CAAGGTGCTC CACTGGGACC CGGAGACTGT GGTGCGCCTG CAGCTCTGGG 240 

ATATCGCAGG TCAAGAAAGA TTTGGAAACA TGACGAGGGT CTATTACCGA GAAGCTATGG 3 00 

GTGCATTTAT TGTCTTCGAT GT CACCAGGC CAGCCACATT TGAAGCAGTG GCAAAGTGGA 360 

AAAATGATTT GGACTCCAAG TTAAGTCTCC CTAATGGCAA ACCGGTTTCA GTGGTTTTGT 42 0 

TGGCCAACAA ATGTGACCAG GGGAAGGATG TGCT CATGAA CAATGGCCTC AAGATGGACC 480 

40 AGTTCTGCAA GGAGCACGGT TTCGTAGGAT GGTTTGAAAC ATCAGCAAAG GAAAATATAA 540 

ACATTGATGA AGCCT CCAGA TGCCTGGTGA AACACATACT TGCAAATGAG TGTGACCTAA 600 

TGGAGTCTAT TGAGCCGGAC GT CGTGAAGC CCCATCTCAC ATCAACCAAG GTTGCCAGCT 660 

GCTCTGGCTG TGCCAAATCC TAGTAGGCAC CTTTGCTGGT GTCTGGTAGG AATGACCTCA 720 

TTGTTCCACA AATTGTGCCT CTATTTTTAC CATTTTGGGT AAACGTCAGG ATAGATATAC 780 

45 CACATGTGGC AAG CCAAAG A TCTATGCCTC TGTTTTTTCA ATGAGAGAGA AATAGCAAAT 840 

GTTCTTTCTA TGCTTTCCTC ACCATCATCA CAGTGTTTAC AAACTTTTGA AAATATTTAG 900 

TCTGTTACAA ACTTCTGTCA TGTAGCTGAC CAAAATCCTG CAGGGCCACA GTCGGCACTG 960 

TTATTTGCTT CTTTTAATCA GCAAAGGCCT CAAGTCTTAA AATAAAAGGG GAGAAGAACA 102 0 

AACTAGCTGT CAAGTCAAGG ACTGGCTTTC ACCTTGCCCT GGTGTCTTTT TCCAGATTTC 1080 

50 AATATATTCT CTGATGGCCT GACAGGCCTA TTAAGTAGAT GTGATATTTT CTTCCAAGAT 1140 

GACCTCCATT CTCGGCAGAC CTAAGAGTTG CCTCTGAGTT AGCTCTTTGG AATCGTGAAC 1200 

ACAGGTGTGC TATATTGTCC TTGTCCTAAC TGTCACTTGC CATGGCCTGA ATGTTGGCTT 1260 

AACTGAATAT TGTATGAAAA GACATGCCTC CATATGTGCC TTTCTGTTAG CTCTCTTTGA 132 0 

CTCAAGCTGT GGGGCTCCTC TATACATGCT ATACATGTAA TATATATTAT ATATATTTTT 1380 

55 GCAAGTGAAC AATAAAACAT TAAAAGATAA AA 

Seq ID NO: 255 Protein sequence: 
Protein Accession #: NP_071732 

60 

1 11 21 31 41 51 

I I I I I 1 

MQAPHKEHLY KLLVI GDLGV GKTSIIKRYV HQNFSSHYRA TIGVDFALKV LHWDPETWR 60 
LQLWDIAGQE RFGNMTRVYY REAMGAFIVF DVTRPATFEA VAKWKNDLDS KLSLPNGKPV 12 0 
65 SWLLANKCD QGKDVLMNNG LKMDQFCKEH GFVGWFETSA KENINIDEAS RCLVKHILAN 180 

ECDLMESIEP DWKPHLTST KVASCSGCAK S 

Seq ID NO: 256 DNA sequence 
Nucleic Acid Accession #: NM_016321 
/U Coding sequence: 25.. 1464 

1 11 21 31 41 51 

| I I I I 1 

GGAACCGCCC GCTGCCAGCC CGGCCAGGCA CCCCTGCAGC ATGGCCTGGA ACACCAACCT 60 

75 CCGCTGGCGG CTGCCGCTCA CCTGCCTGCT CCTGCAGGTG ATTATGGTGA TTCTCTTCGG 12 0 

GGTGTTCGTG CGCTACGACT TCGAGGCCGA CGCCCACTGG TGGTCAGAGA GGACGCACAA 180 

GAACTTGAGC GACATGGAGA ACGAATTCTA CT AT CGCTAC CCAAGCTTCC AGGACGTGCA 240 

CGTGATGGTC TTCGTGGGCT TCGGCTTCCT CATGACTTTC CTGCAGCGCT ACGGCTTCAG 3 00 

CGCCGTGGGC TTCAACTTCC TGT TGGCAGC CTTCGGCATC CAGTGGGCGC TGCTCATGCA 3 60 

80 GGGCTGGTTC CACTTCTTAC AAGACCGCTA CATCGTCGTG GGCGTGGAGA ACCT CATGAA 42 0 

CGCTGACTTC TGCGTGGCCT CTGTCTGCGT GGCCTTTGGG GCAGTTCTGG GTAAAGTCAG 480 

CCCCATTCAG CTGCTCATCA TGACTTTCTT CCAAGTGACC CTCTTCGCTG TGAATGAGTT 540 

CATTCTCCTT AACCTGCTAA AGGTGAAGGA TGCAGGAGGC TCCATGACCA TCCACACATT 600 

TGGCGCCTAC TTTGGGCTCA CAGTGACCCG GATCCTCTAC CGACGCAACC TAGAGCAGAG 660 

85 CAAGGAGAGA CAGAATT CTG TGTACCAGTC GGACCTCTTT GCCATGATTG GCACCCTCTT 720 

CCTGTGGATG TACTGGCCCA GCTTCAACTC AGCCATATCC TACCATGGGG ACAGCCAGCA 780 

CCGAGCCGCC ATCAACACCT ACTGCTCCTT GGCAGCCTGC GTGCTTACCT CGGTGGCAAT 840 
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1620 
1680 
1740 
1800 
1860 
1920 
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WO 02/086443 

ATCCAGTGCC CTGCACAAGA AGGGCAAGCT GGACATGGTG CACATCCAGA ATGCCACGCT 900 

CGCAGGAGGG GTGG CCGTGG GTACCGCTGC TGAGATGATG CTCATGCCTT ACGGTGCCCT 960 

CATCAT CGGC TTCGTCTGCG GCATCATCTC CACCCTGGGT TTTGTATACC TGACCCCATT 102 0 

CCTGGAGTCC CGGCTGCACA TCCAGGACAC ATGTGGCATT AACAATCTGC ATGGCATTCC 1080 

TGGCAT CAT A GGCGGCATCG TGGGTGCTGT GACAGCGGCC TCCGCCAGCC TTGAAGTCTA 1140 

TGGAAAAGAA GGGCTTGTCC ATTCCTTTGA CTTTCAAGGT TTCAACGGGG ACTGGACCGC 12 00 

AAGAACACAG GGAAAGTTCC AGATTTATGG TCTCTTGGTG ACCCTGGCCA TGGCCCTGAT 12 60 

GGGTGGCATC ATTGTGGGGC TCATTTTGAG ATTACCATTC TGGGGACAAC CTTCAGATGA 1320 

GAACTGCTTT GAGGATGCGG TCTACTGGGA GATGCCTGAA GGGAACAGCA CTGT CTACAT 1380 

CCCTGAGGAC CCCACCTTCA AGCCCTCAGG ACCCTCAGTA CCCTCAGTAC CCATGGTGTC 1440 

CCCACTACCC ATGGCTTCCT CGGTACCCTT GGTACCCTAG GCTCCCAGGG CAGGTGAGGA 1500 

GCAGGCTCCA CAGACTSTCC TGGGGCCCAG AGGAGCTGGT GCTGACCTAG CTAGGGATGC 1560 

AAGAGTGAGC AAGCAGCACC CCCACCTGCT GGCTTGGCCT CAAGGTGCCT CCACCCCTGC 1620 

CCTCCCCTTC AT CCCAGGGG GTCTGMCTGA GAATGGAGAA GGAGAAGCTA CAAAGTGGGC 1680 

ATCCAAGCCG GGTTCTGGCT GCAGAAGTTC TGCCTCTGCC TGGGGTCTTG GCCACATTGG 1740 

AGAAAAACAG GCTCAAAGTG GGGCTGGGAC CTGGTGGGTG AACCTGAGCT CTCCCAGGAG 1800 

ACAACTTAGC TGCCAGTCAC CACCTATGAG GCTCTTCTAC CCCGTGCCTG CACCTCGGCC 1860 

AGCATCTCCT ATGCTCCCTG GGTCCCCCAG ACCTCTCTGT GTTGTGTGCG TGGCAGCCTC 1920 
CAGGAATAAA CATTCTTGTT GTCCTTTGTA AAAAAAAAAA AAAAAAAA 



Seq ID NO: 2 57 Protein sequence i 
Protein Accession #: NP_057405 



60 
120 



1 11 21 31 41 51 

I I I I I I 

MAWNTNLRWR LtPLTCLLLQV IMVILFGVFV RYDFEADAHW WSERTHKNLS DMENEFYYRY 
PSFQDVHVMV FVGFGFLMTF LQRYGFSAVG FNFLLAAFGI QWALLMQGWF HFIiQDRYIW 
GVENL INADF CVASVCVAFG AVLGKVSPIQ LLIMTFFQVT LFAVNEFILL NLLKVKDAGG 180 
SMTIHTFGAY FGLTVTRILY RRNLEQSKER QNSVYQSDLF AM I GTLFLWM YWPSFNSAIS 240 
YHGD SQHRAA INTYCSLAAC VLTSVAISSA LHKKGKLDMV H I QNATLAGG VAVGTAAEMM 300 
LMPYGALIIG FVCGIISTLG FVYLTPFLES RLHIQDTCGI NNLHGIPGII GGIVGAVTAA 360 
SASLEVYGKE GLVHSFDFQG FNGDWTARTQ GKFQIYGLLV TLAMALMGGI IVGLILRLPF 420 
WGQPSDENCF EDAVYWEMPE GNSTVYIPED PTFKPSGPSV PSVPMVSPLP MASSVPLVP 



Seq ID NO: 258 DNA sequence 

Nucleic Acid Accession #: NM_002358.2 

Coding sequence: 75.. 692 



1 11 21 31 41 51 

I I I I I I 

GGGAAGTGCT GTTGGAGCCG CTGTGGTTGC TGTCCGCGGA GTGGAAGCGC GTGCTTTTGT 60 

TTGTGTCCCT GGCCATGGCG CTGCAGCTCT CCCGGGAGCA GGGAATCACC CTGCGCGGGA 12 0 

GCGCCGAAAT CGTGGCCGAG TTCTTCTCAT TCGGCATCAA CAGCATTTTA TATCAGCGTG 180 

GCATATATCC ATCTGAAACC TTTACTCGAG TGCAGAAATA CGGACTCACC TTGCTTGTAA 240 

CTACTGATCT TGAGCTCATA AAATACCTAA ATAATGTGGT GGAACAACTG AAAGATTGGT 30 0 

TATACAAGTG TTCAGTTCAG AAACTGGTTG TAGTTATCTC AAATATTGAA AGTGGTGAGG 360 

TCCTGGAAAG ATGGCAGTTT GATATTGAGT GTGACAAGAC TGCAAAAGAT GACAGTGCAC 420 

CCAGAGAAAA GT CTCAG AAA GCTATCCAGG ATGAAATCCG TTCAGTGATC AGACAGATCA 480 

CAGCTACGGT GACATTTCTG CCACTGTTGG AAGTTTCTTG TTCATTTGAT CTGCTGATTT 540 

ATACAGACAA AGATTTGGTT GTACCTGAAA AATGGGAAGA GTCGGGACCA CAGTTTATTA 600 

CCAATTCTGA GGAAGTCCGC CTTCGTTCAT TTACTACTAC AATCCACAAA GTAAATAGCA 660 

TGGTGGCCTA CAAAATTCCT GTCAATGACT GAGGATGACA TGAGGAAAAT AATGTAATTG 720 

TAATTTTGAA ATGTGGTTTT CCTGAAATCA GGT CAT CTAT AGTTGATATG TTTTATTTCA 780 

TTGGTTAATT TTTACATGGA GAAAACCAAA ATGATACTTA CTGAACTGTG TGTAATTGTT ' 840 

CCTTTATTTT TTTGGTACCT ATTTGACTTA CCATGGAGTT AACATCATGA ATTTATTGCA 90 0 

CATTGTTCAA AAGGAACCAG GAGGTTTTTT TGTCAACATT GTGATGTATA TTCCTTTGAA 960 

GATAGTAACT GTAGATGGAA AAACTTGTGC TATAAAGCTA GATGCTTTCC TAAATCAGAT 102 0 

GTTTTGGTCA AGTAGTTTGA CTCAGTATAG GTAGGGAGAT ATTTAAGTAT AAAATACAAC 1080 

AAAGGAAGTC TAAATATTCA GAATCTTTGT TAAGGTCCTG AAAGTAACTC ATAATCTATA 114 0 

AACAATGAAA TATTGCTGTA TAGCTCCTTT TGACCTTCAT TTCATGTATA GTTTTCCCTA 120 0 

TTGAATCAGT TTCCAATTAT TTGACTTTAA TTTATGTAAC TTGAACCTAT GAAGCAATGG 1260 

ATATTTGTAC TGTTTAATGT TCTGTGATAC AGAACTCTTA AAAATGTTTT TTCATGTGTT 132 0 

TTATAAAATC AAGTTTTAAG TGAAAGTGAG GAAATAAAGT TAAGTTTGTT TTAAAAAAAA 13 80 
AAAAAAAAAA 



Seq ID NO: 25 9 Protein sequence: 
Protein Accession #: NP_002349.1 

1 ll 21 31 41 51 

I I I i I I 

MALQLSREQG ITLRGSAEIV AEFFSFGINS ILYQRGIYPS ETFTRVQKYG LTLLVTTDLE 60 
LIKYLNNWE QLKDWLYKCS VQKLWVISN IESGEVLERW QFD I ECDKTA KDDSAPREKS 120 
QKAIQDEIRS VIRQITATVT FLPLLEVSCS FDLLIYTDKD LWPEKWEES GPQFITNSEE 180 
VRLRSFTTTI HKVNSMVAYK IPVND 



Seq ID NO: 2 60 DNA sequence 
Nucleic Acid Accession #: NM_001211 
Coding sequence: 43.. 3195 



1 11 21 31 41 51 

I I I I I 1 

AAAGGCCTGC AGCAGGACGA GGACCTGAGC CAGGAATGCA GGATGGCGGC GGTGAAGAAG 60 

GAAGGGGGTG CTCTGAGTGA AGCCATGTCC CTGGAGGGAG ATGAATGGGA ACTGAGTAAA 120 

GAAAATGTAC AACCTTTAAG GCAAGGGCGG ATCATGT CCA CGCTTCAGGG AGCACTGGCA 180 

CAAGAAT CTG CCTGTAACAA TACTCTTCAG CAGCAGAAAC GGGCATTTGA ATATGAAATT 240 



286 



WO 02/086443 

CGATTTTACA CTGGAAATGA CCCTCTGGAT GTTTGGGATA GGTATAT CAG CTGGACAGAG 300 

CAGAACTATC CTCAAGGTGG GAAAGAGAGT AATATGTCAA CGTTATTAGA AAGAGCTGTA 3 60 

GAAGCACTAC AAGGAGAAAA ACGATATTAT AGTGATCCTC GATTTCTCAA TCTCTGGCTT 420 

AAATTAGGGC GTTTATGCAA TGAGCCTTTG GATATGTACA GTTACTTGCA CAACCAAGGG 480 

5 ATTGGTGTTT CACTTGCTCA GTTCTATATC TCATGGGCAG AAGAATATGA AGCTAGAGAA 540 

AACTTTAGGA AAGCAGATGC GATATTTCAG GAAGGGATTC AACAGAAGGC TGAACCACTA 600 

GAAAGACTAC AGTCCCAGCA CCGACAATTC CAAGCTCGAG TGTCTCGGCA AACTCTGTTG 660 

GCACTTGAGA AAGAAGAAGA GGAGGAAGTT TTTGAGTCTT CTGTACCACA ACGAAGCACA 720 

CTAGCTGAAC TAAAGAGCAA AGGGAAAAAG ACAGCAAGAG CTCCAATCAT CCGTGTAGGA 780 

10 GGTGCTCTCA AGGCTCCAAG CCAGAACAGA GGACTCCAAA ATCCATTTCC TCAACAGATG 840 

CAAAATAATA GTAGAATTAC TGTTTTTGAT GAAAATGCTG ATGAGGCTTC TACAGCAGAG 900 

TTGTCTAAGC CTACAGTCCA GCCATGGATA GCACCCCCCA TGCCCAGGGC CAAAGAGAAT 960 

GAGCTGCAAG CAGGCCCTTG GAACACAGGC AGGTCCTTGG AACACAGGCC TCGTGGCAAT 1020 

ACAGCTTCAC TGATAGCTGT ACCCGCTGTG CTTCCCAGTT TCACTCCATA TGTGGAAGAG 1080 

15 ACTGCACAAC AGCCAGTTAT GACACCATGT AAAATTGAAC CTAGTATAAA CCACATCCTA 1140 

AGCACCAGAA AGCCTGGAAA GGAAGAAGGA GATCCTCTAC AAAGGGTTCA GAGCCAT CAG 1200 

CAAGCGTCTG AGGAGAAGAA AGAGAAGATG ATGTATTGTA AGGAGAAGAT TTATGCAGGA 1260 

GTAGGGGAAT TCTCCTTTGA AGAAATTCGG GCTGAAGTTT TCCGGAAGAA ATTAAAAGAG 132 0 

CAAAGGGAAG CCGAGCTATT GACCAGTGCA GAGAAGAGAG CAGAAATGCA GAAACAGATT 13 80 

20 GAAGAGATGG AGAAGAAGCT AAAAGAAATC CAAACTACTC AGCAAGAAAG AACAGGTGAT 1440 

CAG CAAGAAG AGACGATGCC TACAAAGGAG ACAACTAAAC TGCAAATTGC TTCCGAGTCT 1500 

CAGAAAATAC CAGGAATGAC TCTATCCAGT TCTGTTTGTC AAGTAAACTG TTGTGCCAGA 15 60 

GAAACTTCAC TTGCGGAGAA CATTTGGCAG GAACAACCTC ATTCTAAAGG TCCCAGTGTA 1620 

CCTTTCTCCA TTTTTGATGA GTTTCTTCTT T C AG AAAAG A AGAATAAAAG TCCTCCTGCA 1680 

25 GATCCCCCAC GAGTTTTAGC TCAACGAAGA CCCCTTGCAG TTCTCAAAAC CTCAGAAAGC 1740 

ATCACCTCAA ATGAAGATGT GTCTCCAGAT GTTTGTGATG AATTTACAGG AATTGAACCC 1800 

TTGAGCGAGG ATGCC AT TAT CACAGGCTTC AGAAATGTAA CAATTTGTCC TAACCCAGAA 1860 

GACACTTGTG ACTTTGCCAG AGCAGCTCGT TTTGTATCCA CTCCTTTTCA TGAGATAATG 1920 

TCCTTGAAGG ATCTCCCTTC TGATCCTGAG AGACTGTTAC CGGAAGAAGA TCTAGATGTA 1980 

30 AAGACCTCTG AGGACCAGCA GACAGCTTGT GGCACTATCT ACAGTCAGAC TCTCAGCATC 2 04 0 

AAGAAGCTGA GCCCAATTAT TGAAGACAGT CGTGAAGCCA CACACTCCTC TGGCTTCTCT 2100 

GGTTCTTCTG CCTCGGTTGC AAGCACCTCC TCCATCAAAT GTCTTCAAAT TCCTGAGAAA 2160 

CTAGAACTTA CTAATGAGAC TTCAGAAAAC CCTACTCAGT CACCATGGTG TT CACAGT AT 222 0 

CGCAGACAGC TACTGAAGTC CCTACCAGAG TTAAGTGCCT CTGCAGAGTT GTGTATAGAA 2280 

35 GACAGACCAA TGCCTAAGTT GGAAATTGAG AAGGAAATTG AATTAGGTAA TGAGGATTAC 2340 

TGCATTAAAC GAGAATACCT AATATGTGAA GATTACAAGT TATTCTGGGT GGCGCCAAGA 2400 

AACTCTGCAG AATTAACAGT AATAAAGGTA TCTTCTCAAC CTGTCCCATG GGACTTTTAT 2460 

ATCAACCTCA AGTTAAAGGA ACGTTTAAAT GAAGATTTTG ATCATTTTTG CAGCTGTTAT 252 0 

CAATATCAAG ATGGCTGTAT TGTTTGGCAC CAATATATAA ACTGCTTCAC CCTTCAGGAT 2580 

40 CTTCTCCAAC ACAGTGAATA TATTACCCAT GAAATAACAG TGTTGATTAT TTATAACCTT 2640 

TTGACAATAG TGGAGATGCT ACACAAAGCA GAAATAGTCC ATGGTGACTT GAGTCCAAGG 27 00 

TGTCTGATTC TCAGAAACAG AATCCACGAT CCCTATGATT GTAACAAGAA CAATCAAGCT 2760 

TTGAAGATAG TGGACTTTTC CTACAGTGTT GACCTTAGGG TGCAGCTGGA TGTTTTTACC 2 820 

CTCAGCGGCT TTCGGACTGT ACAGATCCTG GAAGGACAAA AGATCCTGGC TAACTGTTCT 2 880 

45 TCTCCCTACC AGGTAGACCT GTTTGGTATA GCAGATTTAG CACATTTACT ATTGTTCAAG 2940 

GAACACCTAC AGGTCTTCTG GGATGGGTCC TTCTGGAAAC TTAGCCAAAA TATTTCTGAG 3 000 

CTAAAAGATG GTGAATTGTG GAATAAATTC TTTGTGCGGA TTCTGAATGC CAATGATGAG 3 060 

GCCACAGTGT CTGTTCTTGG GGAGCTTGCA G CAG AAATG A ATGGGGTTTT TGACACTACA 3120 

TTCCAAAGTC ACCTGAACAA AGCCTTATGG AAGGTAGGGA AGTTAACTAG TCCTGGGGCT 3180 

50 TTGCTCTTTC AGTGAGCTAG GCAAT CAAGT CTCACAGATT GCTGCCTCAG AGCAATGGTT 3240 

GTATTGTGGA ACACTGAAAC TGTATGTGCT GTAATTTAAT TTAGGACACA TTTAGATGCA 3300 

CTACCATTGC TGTTCTACTT TTTGGTACAG GTATATTTTG ACGTCACTGA TATTTTTTAT 3 3 60 

ACAGTGATAT ACTTACTCAT GGCCTTGTCT AACTTTTGTG AAGAACTATT TTATTCTAAA 3420 

CAGACTCATT ACAAATGGTT ACCTTGTTAT TTAACCCATT TGTCTCTACT TTTCCCTGTA 34 80 

55 CTTTTCCCAT TTGTAATTTG TAAAATGTT C TCTTATGATC ACCATGTATT TTGTAAATAA 3540 
TAAAATAGTA TCTGTTAAAA AAAAAAAAAA AAAAAAAAAA AAA 
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60 



65 



70 



75 



80 



Seq ID NO: 261 Protein sequence: 
Protein Accession #: NP_001202 



MAAVKKEGGA 
AFEYEIRFYT 
FLNLWLKLGR 
QKAEPLERLQ 
PIIRVGGALK 
PRAKENELQA 
SINHILSTRK 
RKKLKEQREA 
QIASESQKIP 
NKSPPADPPR 
ICPNPEDTCD 
SQTLSIKKLS 
PWCSQYRRQL 
FWVAPRNSAE 
CFTLQDLLQH 
NKNNQALKIV 
HLLLFKEHLQ 
GVFDTTFQSH 



11 
I 

LSEAMSLEGD 
GNDPLDVWDR 
LCNEPLDMYS 
SQHRQFQARV 
APSQNRGLQN 
GPWNTGRSLE 
PGKEEGDPLQ 
ELLTSAEKRA 
GMTLSSSVCQ 
VLAQRRPIiAV 
FARAARFVST 
PIIEDSREAT 
LKSLPELSAS 
LTVIKVSSQP 
SEYITHEITV 
DFSYSVDLRV 
VFWDGSFWKL 
LNKALWKVGK 



21 
i 

EWELSKENVQ 
YISWTEQNYP 
YLHNQGIGVS 
SRQTLLALEK 
PFPQQMQNNS 
HRPRGNTASL 
RVQSHQQASE 
EMQKQIEEME 
VNCCARETSL 
LKTSESITSN 
PFHEIMSLKD 
HSSGFSGSSA 
AELC I EDRPM 
VPWDFYINLK 
LIIYNLLTIV 
QLDVFTLSGF 
SQNISELKDG 
LTSPGALLFQ 



31 
I 

PLRQGRIMST 
QGGKESNMST 
LAQFYISWAE 
EEEEEVFESS 
RITVFDENAD 
IAVPAVLPSF 
EKKEKMMYCK 
KKLKEIQTTQ 
AENIWQEQPH 
EDVSPDVCDE 
LPSDPERIjLP 
SVASTSSIKC 
PKLEIEKEIE 
LKERLNEDFD 
EMLHKAEIVH 
RTVQILEGQK 
ELWNKFFVRI 



41 
I 

LQGALAQESA 
LLERAVEALQ 
EYEARENFRK 
VPQRSTLAEL 
EASTAELSKP 
TPYVEETAQQ 
EKIYAGVGEF 
QERTGDQQEE 
SKGPSVPFSI 
FTGIEPLSED 
EEDLDVKTSE 
LQIPEKLELT 
LGNEDYCIKR 
HFCSCYQYQD 
GDLSPRCLIL 
ILANCSSPYQ 
LNANDEATVS 



51 

I 

CNNTLQQQKR 
GEKRYYSDPR 
ADAIFQEGIQ 
KSKGKKTARA 
TVQPWIAPPM 
PVMTPCKIEP 
SFEEIRAEVF 
TMPTKETTKL 
FDEFLIiSEKK 
AIITGFRNVT 
DQQTACGTIY 
NETSENPTQS 
EYLICEDYKL 
GCIVWHQYIN 
RNRIHDPYDC 
VDLFGIADLA 
VLGELAAEMN 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



85 



Seq ID NO: 262 DNA sequence 
Nucleic Acid Accession #: NM_003784 
Coding sequence: 365.. 1507 



21 



287 




288 



WO 02/086443 



1 11 21 31 

III! 

MAAAAATKIL LCLPLLDLLS GWSRAGRADP HSLCYDITVI 
FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLRE WD I 
LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSEKRMWTT 
SFHYFSMGDC IGWLEDFLMG MDSTLEPSAG APLAMSSGTT 
FILPGI 

Seq ID NO: 2 66 DNA sequence 
Nucleic Acid Accession #: XM_084853.1 
Coding sequence: 127-444 

1 11 21 31 41 51 

i I I I i I 

ATTGATGATA TATTTAACGA AATCAAATTT GGTGAATATG TGGACACTGG AAAGCTAATC 
GACAAGATCA ACTTACCAGA TTTCCTAAAA GTGTACCTTA ACCACAAGCC ACCTTTTGGT 
AACACCATGA GTGGCATCCA CAAGAGCTTT GAGGTGCTCG GTTATACCAA CTCCAAAGGG 
AAAAAGG CCA TTCGAAGAGA GGACTTCCTG AGACTGCTCG TTACTAAAGG TGAG CATATG 
ACGGAGGAGG AGATGTTGGA TTGCTTTGCT TCACTGTTTG GCCTGAATCC CGAGGGATGG 
AAATCCGAGC CTGCAACCTG CTCCGTCAAA GGTTCAGAAA TTTGCCTTGA AGAAGAACTT 
CCAGACGAAA T CACTGCAG A AATATTCGCG ACTGAAATTC TTGGCTTAAC CATTTCAGAA 
GATTCCGGCC AGGATGGTCA GTGAAGTTAC CAGGAATGTT TAAAGCACAA AGGACTTTGG 
GTGTGTGTGC ATGCACATGT GTGTGTTTTC CATGAGGCAC TGCTTTTTAT GCATTTCCCT 
CCCCCCTCTC ATCTTTAGAA CATTTAGACA TTAAAG CAAG TTTCTGGTGA GCAATG 



41 51 

I I 
PKFRPGPRWC AVQGQVDEKT 
LTEQLRDIQL ENYTPKEPLT 
VHPGARKMKE KWENDKWAM 
QLRATATTL I LCCLLIILPC 



Seq ID NO: 267 Protein sequence: 
Protein Accession #: XP_084853.1 

1 11 21 31 41 51 

I I I I I I 

MSGIHKSFEV LGYTNSKGKK AIRREDFLRL LVTKGEHMTE EEMLDCFASL FGLNPEGWKS 
EPATCSVKGS EICLEEELPD EITAEIFATE ILGLTISEDS GQDGQ 

Seq ID NO: 268 DNA sequence 
Nucleic Acid Accession ft: NM_001898 
Coding sequence: 57-482 

1 11 21 31 41 51 

I I I I I 1 

GGCTCTCACC CTCCTCTCCT GCAGCTCCAG CTTTGTGCTC TGCCTCTGAG GAGACCATGG 
CCCAGTATCT GAGTACCCTG CTGCTCCTGC TGGCCACCCT AGCTGTGGCC CTGGCCTGGA 
GCCCCAAGGA GGAGGATAGG ATAAT CCCGG GTGGCATCTA TAACGCAGAC CTCAATGATG 
AGTGGGTACA GCGTGCCCTT CACTTCGCCA TCAGCGAGTA TAACAAGGCC ACCAAAGATG 
ACTACTACAG ACGTCCGCTG CGGGTACTAA GAGCCAGGCA ACAGACCGTT GGGGGGGTGA 
ATTACTTCTT CGACGTAGAG GTGGGCCGCA CCATATGTAC CAAGTCCCAG CCCAACTTGG 
ACACCTGTGC CTTCCATGAA CAGCCAGAAC TGCAGAAGAA ACAGTTGTGC TCTTTCGAGA 
TCTACGAAGT TCCCTGGGAG AACAGAAGGT CCCTGGTGAA ATCCAGGTGT CAAGAATCCT 
AGGGATCTGT GCCAGGCCAT TCGCACCAGC CACCACCCAC TCCCACCCCC TGTAGTGCTC 
CCACCCCTGG ACTGGTGGCC CCCACCCTGC GGGAGGCCTC CCCATGTGCC TGCGCCAAGA 
GACAGACAGA GAAGGCTGCA GGAGT CCTTT GTTGCTCAGC AGGGCGCTCT GCCCTCCCTC 
CTTCCTTCTT GCTTCTAATA GCCCTGGTAC ATGGTACACA CCCCCCCACC TCCTGCAATT 
AAACAGTAGC ATCGCC 

Seq ID NO: 269 Protein sequence: 
Protein Accession # :NP_001889 . 1 

1 11 21 

1 I I 

MAQYLSTLLL LLATLAVALiA WSPKEEDRII 
DDYYRRPLRV LRARQQTVGG VNYFFDVEVG 
EIYEVPWENR RSLVKSRCQE S 

Seq ID NO: 270 DNA sequence 
Nucleic Acid Accession #: XM_093210 
Coding sequence: 13-1854 

1 11 21 31 41 51 

I I I I I I 

ATGGCAAGCG CCGGAATCTC CTCAGCTGCC GTTTCACAAA AGAGGTACCA GGTCCGCACC 
AAACGAGCAC ACAAGCAGCA CCAGGAGCTG CAGAAGAAGG AGGCGGCAGC GATGGACCAG 
GGCAGAGGGA ATGGGGAGGG GGCATCCTAC CCCATATCTG AGGTGCGACT GCGGGACGTA 
GAGCGGACTG GGCCTTTCCC GTTGGCGCGT GGCCTCAATC AGGACTTCTT GCCCACGTGC 
GCCTTCAAAA CGGTAAGAGC TGCAACTGAA CGTGTGAGAC ATGGTGCAGA TAGGCTGAGA 
GGCGGCGGGA GAGATGCCCA TGAACTCAAG TACCCGGACA CGCCCTCCAC TTCTACCACC 
ACGAGTAACA CCGCCCCCAC GGGACCGCTC TCGAGGTCCC CCAAGCCAAG GACGCAAGGA 
GGAACGCCCC GGCGCGCGGC CAGCAGCGGC GGGCACCGGC CCAATGGCCA CGGAACTCAG 
CACTGGCAGT CGGCCCTCCT CACACCGCAG GCGTGCAGTG TGGCCGACGG AGCCTCCCGG 
GCCGAGGACC CAGCTAGGCC GTCACCCCGG TTGCTCCCAC GGGAAGGGGC ACCAGGCAAA 
CTGCCCAAGG CCCCGAGCCC AGGCTCCCTG GCGGAGGCCT CCGCTGGTCC CGCCCAGATC 
ATGGCCGCCA CCAGGCTCCC GAGCCATGGC TTCCTGTCCG GGAACGGCCC GGCGTCCTGG 
CTGTCCAGCT AG 

Seq ID NO: 271 Protein sequence: 
Protein Accession #: XP_093210 

1 11 21 31 41 51 



31 41 51 

1 I 1 

PGGIYNADLN DEWVQRALHF AISEYNKATK 

RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 



289 



10 
15 

20 
25 
30 
35 
40 
45 
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I I 

MLRHGEQKRK RARKKWDFLP 
TTTSNTAPTG PLSRSPKPRT 
SRAEDPARPS PRLLPREGAP 
PNSSVGRKEE RPGAGQQRRA 
LPREGAPGKL PKAPSPGSLA 



1 I I I 

TCAFKTVRAA TERVRHGADR LRGGGRDAHE LKYPDTPSTS 60 

QGGTPRRRPA AAGTRANGHG TQHWQSALLT PQACSVADGA 120 

GKLPKAPSPG SLAEASAGLL AHVRLQNADA QRVSISQALP 180 

PAPMATELST GSRPSSHRRR AVWPTEPPGP RTQLEPSPRL 240 
EASAGPAQIM AATRLPSRGF LSGNGPASWL SS 



PCT/US02/12476 



Seq ID NO: 272 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..732 



GGATACTGTG 
TGAAAAAGCT 
TAATGTGGAG 
ATACCCACTT 
ATGATTTTGT 
TAAATTATTT 
TTAGTATCAC 
AAAATTG C AG 
TTAAGCC 



11 

I 

TCACTCAAAG 
TTTTTTCCCA 
GAAATTATTC 
GAAGCCTCTG 
CTTGTTTCTG 
TTATTTATCT 
AATTTATGGG 
AAGTCATAGG 



21 

1 

TAATGGGAGG 
CTTTTAACTT 
TTTCTCATTG 
TAGAAATGTC 
CAGTGAGAAA 
TTCATATAGT 
AGAGGGTTTT 
ACTGTCATGT 



31 
I 

GAGAGAGAAC 
GCTTTAGCGT 
GAGATTACAG 
TCGTCCTCCG 
TTACATCCAT 
TCTTACAATT 
TTGTATTTTT 
ATTGCAGCTC 



41 
I 

AGGGAGGGTA 
TAAGAGTACT 
AATATATCTA 
GTTGTATTTC 
AGCAAAGACA 
TCTAAAAAAT 
AAGCATATGT 
TGAGAACCAA 



51 
I 

GGGATGCTTT 
TACCAGCTAA 
TTCATCTTGA 
TAAAACCTAC 
AAAGTCTTTT 
TAACACTCAT 
GG CTTATATA 
TGCCTGAAAC 



Seq ID NO: 273 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 

1111 
MGGRENREGR DAFEKAFFPT FNLL 

Seq ID NO: 274 DNA sequence 

Nucleic Acid Accession ft: NM_003 976.2 

Coding sequence: 2 99-961 



41 
I 



51 



CTCTGAGCTT 
CATGGAGTTG 
CTACTTCTGC 
GGGTGGCAGG 
CAGGAGGGTG 
GGAACTTGGA 
TGCCCTGTGG 
GGGCTCCGCG 
CGCCGGCCAC 
GCCGCCGCAG 
CGGGGGCCGC 
GGGCTGCCGC 
CGACGAGCTG 
CGACCTCAGC 
GCCCGTCAGC 
CAACAGCACC 
AGGGCTCGCT 
CCTCCCGCAG 
AGGCCCCTAC 
CAGCCCCAGA 
GGAGCCCTTC 
CCCTCCTCTG 
ACAGCATTTG 
CCTGTACTCA 



11 
I 

CTCTGAGCCT 
TGAAAGAATA 
TGGGTTGAGT 
CCGGTCCCCC 
GGGGAACAGC 
CTTGGAGGCC 
CCCACCCTGG 
CCCCGCAGCC 
CTGCCGGGGG 
CCTTCTCGGC 
GCGGCGCGGG 
CTGCGCTCGC 
GTGCGTTTCC 
CTGGCCAGCC 
CAGCCCTGCT 
TGGAGAACCG 
CCAGGGCTTT 
AGTCCCACTA 
CGGTGGGTGA 
GCCCTCACCC 
GGACCCACTT 
ATGAACACTA 
AAGGACACAT 
CTCATGGGAG 



21 

I 

TGTTTGCTCA 

GCTGCAAAGC 
CTAGCTGTGT 
ACAAAAGATA 
TCAACAATGG 
TCTCCACGCT 
CCGCTCTGGC 
CTGCCCCCCG 
GACGCACGGC 
CCGCGCCCCC 
CTGGGGGCCC 
AGCTGGTGCC 
GCTTCTGCAG 
TACTGGGCGC 
GCCGACCCAC 
TGGACCGCCT 
GCAGACTGGA 
GCCAGCGGCC 
TGGATATCAT 
TGCGGATCCC 
CTCACAGACT 
CAGTGGCTGA 
ATTGCAGTTG 
CTGGCCCC 



31 

I 

TCTGGAAAAA 
ACCTAACACA 
AGGCCCCTTG 
ACTCATCTCT 
CTGATGGGCG 
GTCCCACTGC 
TCTGCTGAGC 
CGAAGGCCCC 
CCGCTGGTGC 
GCCGCCTGCA 
GGGCAGCCGC 
GGTGCGCGCG 
CGGCTCCTGC 
CGGGGCCCTG 
GCGCTACGAA 
CTCCGCCACC 
CCCTTACCGG 
TCAGCCAGGG 
CCCCGAACAG 
AGCCTAAAAG 
CTGGCACTGG 
GGCATCAGCC 
CTTGGTTGAA 



41 
I 

GGGGATTAAA 
TAGTAAGGTT 
TTCCTCACCT 
TAATTTGCAA 
CTCCTGGTGT 
CCCTGGCCTA 
AGCGTCGCAG 
CCGCCTGTCC 
AGTGGAAGAG 
CCCCCATCTG 
GCTCGGGCAG 
CTCGGCCTGG 
CGCCGCGCGC 
CGACCGCCCC 
GCGGTCTCCT 
GCCTGCGGCT 
TGGCTCTTCC 
ACGAAGGCCT 
GTGAAGGGAC 
ACACCAGAGA 
CCAGGCCTCG 
CCCGCCCAGG 
AGTGCCTGTG 



51 
I 

CCATTTACCT 
CCCAGTGCAG 
GGAGAAACTG 
GCTGCCTCAA 
TGATAGAGAT 
GGCGGCAGCC 
AGGCCTCCCT 
TGGCGTCCCC 
CCCGGCGGCC 
CTCTTCCCCG 
CGGGGGCGCG 
GCCACCGCTC 
GCTCTCCACA 
CGGGCTCCCG 
TCATGGACGT 
GCCTGGGCTG 
TGCCTGGGAC 
CAAAGCTGAG 
AACTGACTAG 
CCTCAGCTAT 
AACCTGGGAC 
CCCTGTAGGG 
CTGGAACTGG 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



65 
70 



80 
85 



Seq ID NO: 275 Protein sequence: 
Protein Accession #: NP_003967.1 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR L S AT ACGCLG 



Seq ID NO: 276 DNA sequence 
Nucleic Acid Accession #: NMJD57091.1 
75 Coding sequence: 783-1445 



ACTGGCCGCT 
GGACCCCCAA 
TCGCTCCCCG 
CGCGTGTCTA 
CTCCATATCC 
CAAGCTAGGG 
CGGGGCAGGG 
CACCGGACGG 
CAGACAAGGC 



11 

I 

GAGAGAAGAA 
ATCTGCACGT 
CCCTCACTCA 
CAAACTCAAC 
GAGGGGCCCC 
GGGACTGGAT 
GCGCTCCCAG 
CTGCGGCGGC 
CCGGGGGCTC 



21 
I 

TCGGGTGGAG 
ACCAGCAGTC 
CTTTCTCCCG 
TCCCGGTTTC 
TCCCAGCATC 
CCGACGGGTG 
CCCCACCCCG 
GGGCAGGAGG 
CGCCAGCAGC 



31 
! 

CAGAGAGCAG 
AGCCGCCCCA 
CCCTCGGCCC 
CGTGCCTCTC 
TACCCCCCTC 
GAGCAGCCAG 
GGATCTGGTG 
CTGCTGAGGG 
AGGTCCCTCG 



41 
I 

CTGCTGCAGG 
CGCAGGGACC 
GGCCTCCCAG 
CACCGCTCGA 
CCAACCTCGG 
GTGAGCCCCG 
ACGCTGGGGC 
ATGGAGTTGG 
GGCCCCAGCC 



51 
I 

GCAGACAGCC 
GGCTTACCCC 
CTCTCTACTT 
GTTCTCTACT 
GGGACCTAGC 
AAAGGTGGGG 
TGGAATTTGA 
GCCCGGCCCC 
CTCGCTGCCA 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 



290 
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CCCGGGCCTG GAGCCCCACA CCCGAGGGTG CAGACTGGCT GCCAAGGCCA CACTTTTGGC 600 

TAAAAGAGGC ACTGCCAGGT GTACAGTCCT GGGCATGCGC TGTTTGAGCT TCGGGGGAGA 660 

GCCCAGCACT GGT CCCCGG A AAGGTGCCTA GAAGAACAAG GTGCAGGACC CCGTGCTGCC 72 0 

TCAACAGGAG GGTGGGGGAA CAGCTCAACA ATGGCTGATG GGCGCTCCTG GTGTTGATAG 780 

AGATGGAACT TGGACTTGGA GGCCTCTCCA CGCTGTCCCA CTGCCCCTGG CCTAGGCGGC 840 

AGCCTGCCCT GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT 90 0 

CCCTGGGCTC CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT 960 

CCCCCGCCGG CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC 102 0 

GGCCGCCGCC GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC 1080 

CCCGCGGGGG CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG 1140 

CGCGGGGCTG CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC 1200 

GCTCCGACGA GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC 1260 

CACACGACCT CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT 1320 

CCCGGCCCGT CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG 13 80 

ACGT CAACAG CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG 1440 

GCTGAGGGCT CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG 150 0 

GGACCCTCCC GCAGAGTCCC ACTAGCCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC 1560 

TGAGAGGCCC CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA 162 0 

CTAGCAGCCC CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG 1680 

CTATGGAGCC CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG 1740 

GGACCCCTCC TCTGATGAAC ACTACAGTGG CTGAGG CAT C AGCCCCCGCC CAGGCCCTGT 1800 

AGGGACAGCA TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA 1860 
CTGGCCTGTA CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 277 Protein sequence; 
Protein Accession #: NP_003 967.1 

1 11 21 31 41 51 

I | I I I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 60 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 120 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 180 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 

Seq ID NO: 278 DNA sequence 

Nucleic Acid Accession #: NMJJ57160.1 

Coding sequence: 1-714 

1 11 21 31 41 51 

I I I I 1 I 

ATGCCCGGCC TGATCTCAGC CCGAGGACAG CCCCTCCTTG AGGTCCTTCC TCCCCAAGCC 60 

CACCTGGGTG CCCTCTTTCT CCCTGAGGCT CCACTTGGTC TCTCCGCGCA GCCTGCCCTG 120 

TGGCCCACCC TGGCCGCTCT GGCTCTGCTG AGCAG CGTCG CAGAGGCCTC CCTGGGCTCC 180 

GCGCCCCGCA GCCCTGCCCC CCGCGAAGGC CCCCCGCCTG TCCTGGCGTC CCCCGCCGGC 240 

CACCTGCCGG GGGGACGCAC GGCCCGCTGG TGCAGTGGAA GAGCCCGGCG GCCGCCGCCG 3 00 

CAGCCTTCTC GGCCCGCGCC CCCGCCGCCT GCACCCCCAT CTGCTCTTCC CCGCGGGGGC 3 60 

CGCGCGGCGC GGGCTGGGGG CCCGGGCAGC CGCGCTCGGG CAGCGGGGGC GCGGGGCTGC 42 0 

CGCCTGCGCT CGCAGCTGGT GCCGGTGCGC GCGCTCGGCC TGGGCCACCG CTCCGACGAG 4 80 

CTGGTGCGTT TCCGCTTCTG CAGCGGCTCC TGCCGCCGCG CGCGCTCTCC ACACGACCTC 540 

AGCCTGGCCA GCCTACTGGG CGCCGGGGCC CTGCGACCGC CCCCGGGCTC CCGGCCCGTC 600 

AGCCAGCCCT GCTGCCGACC CACGCGCTAC GAAGCGGTCT CCTTCATGGA CGTCAACAGC 660 

ACCTGGAGAA CCGTGGACCG CCTCTCCGCC ACCGCCTGCG GCTGCCTGGG CTGAGGGCTC 72 0 

GCTCCAGGGC TTTGCAGACT GGACCCTTAC CGGTGGCTCT TCCTGCCTGG GACCCTCCCG 780 

CAGAGTCCCA CTAGCCAGCG GCCTCAGCCA GGGACGAAGG CCTCAAAGCT GAGAGGCCCC 840 

TACCGGTGGG TGATGGATAT CATCCCCGAA CAGGTGAAGG GACAACTGAC TAGCAGCCCC 900 

AGAGCCCTCA CCCTGCGGAT CCCAGCCTAA AAGACACCAG AGACCTCAGC TATGGAGCCC 960 

TTCGGACCCA CTTCTCACAG ACTCTGGCAC TGGCCAGGCC TCGAACCTGG GACCCCTCCT 102 0 

CTGATGAACA CTACAGTGGC TGAGGCATCA GCCCCCGCCC AGGCCCTGTA GGGACAGCAT 1080 

TTGAAGGACA CATATTGCAG TTGCTTGGTT GAAAGTGCCT GTGCTGGAAC TGGCCTGTAC 1140 
TCACTCATGG GAGCTGGCCC C 

Seq ID NO: 2 79 Protein sequence: 
Protein Accession #: NP_476501.1 

1 11 21 31 41 51 

1 I I - I I I 

MPGLISARGQ PLLEVLPPOA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVAEASLGS 60 

APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 12 0 

RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRFCSGS CRRARSPHDL 180 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 

Seq ID NO: 280 DNA sequence 

Nucleic Acid Accession #: NM_057090.1 

Coding sequence: 29-715 

1 11 21 31 41 51 

I I I I I 1 

CTGATGGGCG CTCCTGGTGT TGATAGAGAT GGAACTTGGA CTTGGAGGCC TCTCCACGCT 60 

GTCCCACTGC CCCTGGCCTA GGCGGCAGGC TCCACTTGGT CTCTCCGCGC AGCCTGCCCT 120 

GTGGCCCACC CTGGCCGCTC TGGCTCTGCT GAGCAGCGTC GCAGAGGCCT CCCTGGGCTC 180 

CGCGCCCCGC AGCCCTGCCC CCCGCGAAGG CCCCCCGCCT GTCCTGGCGT CCCCCGCCGG 240 

CCACCTGCCG GGGGGACGCA CGGCCCGCTG GTGCAGTGGA AGAGCCCGGC GGCCGCCGCC 3 00 

GCAGCCTTCT CGGCCCGCGC CCCCGCCGCC TGCACCCCCA TCTGCTCTTC CCCGCGGGGG 3 60 

CCGCGCGGCG CGGGCTGGGG GCCCGGGCAG CCGCGCTCGG GCAGCGGGGG CGCGGGGCTG 420 

CCGCCTGCGC TCGCAGCTGG TGCCGGTGCG CGCGCTCGGC CTGGGCCACC GCTCCGACGA 480 

GCTGGTGCGT TTCCGCTTCT GCAGCGGCTC CTGCCGCCGC GCGCGCTCTC CACACGACCT 540 

CAGCCTGGCC AGCCTACTGG GCGCCGGGGC CCTGCGACCG CCCCCGGGCT CCCGGCCCGT 600 

CAGCCAGCCC TGCTGCCGAC CCACGCGCTA CGAAGCGGTC TCCTTCATGG ACGTCAACAG 660 
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CACCTGGAGA ACCGTGGACC GCCTCTCCGC CACCGCCTGC GGCTGCCTGG GCTGAGGGCT 72 0 

CGCTCCAGGG CTTTGCAGAC TGGACCCTTA CCGGTGGCTC TTCCTGCCTG GGACCCTCCC 780 

GCAGAGTCCC ACTAG CCAGC GGCCTCAGCC AGGGACGAAG GCCTCAAAGC TGAGAGGCCC 840 

CTACCGGTGG GTGATGGATA TCATCCCCGA ACAGGTGAAG GGACAACTGA CTAGCAGCCC 900 

CAGAGCCCTC ACCCTGCGGA TCCCAGCCTA AAAGACACCA GAGACCTCAG CTATGGAGCC 960 

CTTCGGACCC ACTTCTCACA GACTCTGGCA CTGGCCAGGC CTCGAACCTG GGACCCCTCC 1020 

TCTGATGAAC ACTACAGTGG CTGAGGCATC AGCCCCCGCC CAGGCCCTGT AGGGACAGCA 1080 

TTTGAAGGAC ACATATTGCA GTTGCTTGGT TGAAAGTGCC TGTGCTGGAA CTGGCCTGTA 1140 
CTCACTCATG GGAGCTGGCC CC 

Seq ID NO: 281 Protein sequence: 
Protein Accession #: NP_476431.1 • 

1 11 21 31 41 51 

111)11 
MELGLGGLST L.SHCPVIPRRQ APLGIjSAQPA I>WPTLAALAIj IiSSVAEASLG SAPRSPAPRE 
GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 
SRARAAGARG CRLRSQfcVPV RALGLGHRSD ELVRFRFCSG SCRRARSPHD LSLASLLGAG 
ADRPPPGSRP VSQPCCRPTR YEAVS FMDVN STWRTVDRLS ATACGCLG 

Seq ID NO: 282 DNA sequence 
Nucleic Acid Accession Eos sequence 



PCT/US02/12476 



CTACTGCACC 
ATAGCAATTT 
CCGTGAACCT 
AGGTAACGTT 
GATCTAATCA 
GCTAAAAATG 
TATTCTTCAT 
CACTCTTAAG 
GAATCTCAGG 



11 
I 

TGCCCTCTGT 
CCTTTGGTTT 
TATGAATGCT 
TTCTTTTGCT 
ATGATCTAGT 
AACTTGAAAC 
GCTGGCTGTA 
GATTTTATGT 
ATACCTACAT 



21 
I 

TTCCTTTGGA 
TTAAGACTTC 
GCTTAAAAAT 
CTCATCTTGG 
CAGAAGCTAC 
ACGGAAGTAG 
ATAATAGCCA 
GTATTATTCA 
TTATCACTTT 



31 
I 

AATCTCTTAC 
TACATTGCTT 
AATGTCAAAA 
TTTCCATATA 
TTCACTGGCT 
TGGTTGGTCC 
TTATTTGTTA 
AATTGCTATT 
TTCAATATAT 



41 
I 

CTTTCATTAG 
TTTCTTTTAT 
TATGTTTTAG 
CTATTTTTGG 
AACAGTGATC 
AGT TTGAAAG 
TGCCTTTGTT 
ACTGT TCTTT 
ATGTATTTCT 



51 
I 

GGTTTCTTTC 
TATCTGTGCT 
CTGGCTACTC 
TTTTTTGTGA 
ATGTTCATGT 
CTCTTATTAG 
ATGT AG CAG A 
TTATAGTTGA 
TATT 



Seq ID NO: 283 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 564-1481 



GAGACTTTTA 
CAAAGAGGTC 
GAAGTGCTAC 
ACTTGGATCA 
CGACCGATGC 
CTTTCCCTGA 
GCAAACAAGG 
GGGGAGACAC 
AAGGGCTGGC 
CCTCCCGGCC 
GCCTGGCGCG 
ACTTGGCGGG 
CCGGTTTCTC 
CTATGACGGG 
CGCTGAGCTG 
AGTTTGACGA 
TGGATGAAGA 
AGGGGTGCGC 
GCTTACCAGA 
AACGTGACCG 
AAGAAATGGA 
TTCAGAAAAA 
AGGAAAAAGC 
ATCAAGAATG 
AAAACAACAG 
ATGGTTGGAA 
TGGAAAACTT 
AATTCCGTGG 
GAAGAGTAAA 
AGCCAGGAGC 
AATAACATGC 
CAGTCAATAA 
TGGAGTCCTT 
GTTGATCTTG 
ATTTAGCTTG 



11 

I 

ATCATCTATC 
AAAAATTCAC 
CAAAACACGC 
AGTTCCCTCC 
TCACAATTCT 
TTCAGTGGCA 
GACAGGCCCT 
CGTACTAAAC 
TAAGGGAGGC 
TCCACGCGTC 
CGCAGGGCCC 
CCAGATGTGG 
TGCTGACTGC 
CGCACGTGGG 
CAGCAATTCC 
GGACGACGGT 
TGCCCATGAT 
CAG CACTCAG 
AAGCCGCCTG 
GCTGCAACTG 
AGAACGTGAA 
GAATGAGCAA 
AG CAAAGGAA 
GTTAAAGAAA 
CAAGCTGAAA 
AATGCGAAAC 
ACAGGTTTTT 
AAACCAATTC 
AGACCTGTGA 
AATCTTTGCC 
TTTTATCTGG 
CTCAACACTT 
AGAGTTGAGG 
GCATGTTGTT 
TATTAAAAGT 



21 

i 

CCTTGTGCTT 
ATGTGTAGAC 
AATGACTGTC 
CCTCTCCTCA 
GACCTCGTAA 
GGTAACATAT 
CAAAGTTGTC 
AAGCTTGCAA 
CGGGGCGGAG 
GCCATGG CAA 
GCTGGGAAAG 
GCGGCGGGGC 
AGACCCAGGT 
CAGGGGCTGG 
ACCAGGTCGC 
GACGGGGAGG 
TCAGAGGCCA 
GTTGAATCAG 
ACACCATGGG 
AAAGCTCTAG 
AAAAGAAAGA 
AAAAGAAAAG 
CTGGAGAAAG 
AAAAATGCTG 
TACAGGAGAA 
ATAAACCTCG 
ACAGTGGAAA 
ATATGCCACC 
TAAGTCAGCC 
TTGGAACTCT 
AGCTATTTAA 
AATGTGATTA 
AAGATATTTT 
TTGCAGAATA 
ACACTGTAAT 



31 
I 

TACGCAGACC 
AAATTAGGTC 
CTAAAAGTGC 
AAATATATCG 
TTATATAGGG 
TTCATGTACA 
GGTAGGGAGC 
ACAGCAGGCA 
GAAGCCAAGC 
CGCGGGGTCT 
CGCGTCCCCG 
GCTGGGGGCC 
CTCGGCCCTC 
AGGTGGTGCG 
TGTTGTCTCC 
ATGAGGAAGA 
AAGTGGCGAG 
AAAATAACCA 
AGGTGTGGTT 
AGGAATTAAA 
TAATTGCTGA 
AAAGAGAACA 
AATACTTGCA 
AAGAATGTGA 
AAAGGAAATA 
TCCAGCTGCA 
TTCCTATCCA 
TCCCAAAGAA 
ACACAAGTCA 
GTGCAGAATA 
TTTAAAAATC 
TTGACAAATA 
CTGGATTTTG 
AGTGGCTGAA 
ACCAATAAAA 



41 
i 

CTACAATACA 
CCTTAAGATG 
GTTCTGGGAT 
ACTTGTGCTG 
GGTGGTTTTG 
AAATGAACTG 
CAGGACCCCG 
CCTTCCTGCC 
TCTGCAGGCC 
GTGCTGGCCG 
CCGCGGCTCC 
TACTTTTCCC 
CTCGGACTCC 
CTCGCCGTCG 
CCTTGGCCAC 
CGTGGATGAT 
CCTGAGAGGA 
AGAAGAACAG 
TATTGGCAAA 
TCAACAACTA 
AGAAAAGCAC 
AAAAATTAAT 
AGAAAAAGCA 
GAGGAAGAAG 
GCAGAAAAAA 
AAGAGCTATG 
GAACCAGCCT 
GCTAAGGATC 
TCATCTCTGG 
CAAAGATAGC 
AGAAATTGTT 
GCAATTTTTG 
GTTTTTATAA 
TATGTAAGAA 
CTAACAATTT 



51 

i 

CTAGAGGCTT 
CCAGGCAAAC 
ACACCTGTAA 
AAAGAAATCA 
GTTTCTGCGT 
CAACACCACG 
CCAGTGGCGT 
ACTGAGGAGG 
CTGACAAAGT 
GGATTGGCCG 
GCCAGTTTGA 
TCTTCCTACG 
TGCTCAGTCC 
CCGCCGCTGC 
CAGAGCTTCC 
GAGGAAGACG 
ATGGAGTTAC 
AAACAGGTGC 
GAAAAAGAAG 
GAAAAAAGAA 
AAGGAATGGG 
AAAGAAATGG 
AAAGAAAAAT 
AAAGAAAAGA 
AGTTTCAAGA 
GTTATGCCAA 
TTTATAATCC 
TATCAGGAAG 
TAATTCATAA 
GTATGTGGAA 
TTTTACTGCT 
CATTTGTATA 
ACTTTTTAAG 
TTGTGTTTGT 
TTCTTG 



Seq ID NO: 2 84 Protein sequence: 
Protein Accession #: Eos sequence 



1 
I 

MATRGLCWPG 
PRSRPSSDSC 
GEDEEDVDDE 
PWEVWFIGKE 
RKEREQKINK 



11 



21 



31 



51 
I 



60 
120 
180 



60 
12 0 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



LAGLARAGPA GKARPRRGSA SLNLAGQMWA AGRWGPTFPS SYAGFSADCR 
SVPMTGARGQ GLEWRSPSP PLPLSCSNST RSLLSPLGHQ SFQFDEDDGD 
EDVDEDAHDS EAKVASLRGM ELQGCASTQV ESENNQEEQK QVRLPESRLT 
KEERDRLQDK ALEELNQQLE KRKEMEEREK RKI I AEEKHK EWVQKKNEQK 
EMEEKAAKEL EKEYLQEKAK EKYQEWLKKK NAEECERKKK EKKNNSKLKY 
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Seq ID NO: 2 85 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
5 Coding sequence: 1-1746 

1 11 21 31 41 51 

I I I I I I 

ATGCCACTGA AGCATT AT CT CCTTTTGCTG GTGGGCTGCC AAGCCTGGGG TGCAGGGTTG 60 

10 GCCTACCATG GCTGCCCTAG CGAGTGTACC TGCTCCAGGG CCTCCCAGGT GGAGTGCACC 12 0 

GGGGCACGCA TTGTGGCGGT GCCCACCCCT CTGCCCTGGA ACGCCATGAG CCTGCAGATC 180 

CTCAACACGC ACATCACTGA ACTCAATGAG TCCCCGTTCC TCAATATCTC AGCCCTCATC 240 

GCCCTGAGGA TTGAGAAGAA TGAGCTGTCG CGCATCACGC CTGGGGCCTT CCGAAACCTG 3 00 

GGCTCGCTGC GCTATCTCAG CCTCGCCAAC AACAAGCTGC AGGTTCTGCC CATCGGCCTC 3 60 

15 TTCCAGGGCC TGGACAGCCT TGAGTCTCTC CTTCTGTCCA GTAACCAGCT GTTGCAGATC 42 0 

CAGCCGGCCC ACTTCTCCCA GTGCAGCAAC CTCAAGGAGC TGCAGTTGCA CGGCAACCAC 480 

CTGGAATACA TCCCTGACGG AGCCTTCGAC CACCTGGTAG GACT CACGAA GCTCAATCTG 540 

GGCAAGAATA GCCTCACCCA CATCTCACCC AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GTCCTCCGGC TGTATGAGAA CAGGCTCACG GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

20 GTTAACCTGC AGGAACTGGC TCTACAGCAG AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TTCCACAACA ACCACAACCT CCAGAGACTC TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CCACCCAGCA TCTTCATGCA GCTGCCCCAG CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CTGAAGGAGC TCTCTCTGGG GATCTTCGGG CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

TATGACAACC ACATCTCTTC TCTACCCGAC AATGTCTTCA GCAACCTCCG CCAGTTG CAG 960 

25 GTCCTGATTC TTAGCCGCAA TCAGATCAGC TTCATCTCCC CGGGTGCCTT CAACGGGCTA 102 0 

ACGGAGCTTC GGGAGCTGTC CCTCCACACC AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TTCCGCATGT TGGCCAACCT GCAGAACATC TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CCAGGGAATA TCTTCGCCAA CGTCAATGGC CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 

CTGGAGAACT TGCCCCTCGG CATCTTCGAT CACCTGGGGA AACTGTGTGA GCTGCGGCTG 1260 

30 TATGACAATC CCTGGAGGTG TG ACT CAG AC ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

AACCAGCCTA GGTTAGGGAC GGACACTGTA CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GGCCAGTCCC TCATTATCAT CAATGTCAAC GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC CCAGACACAC CCAGTTACCC TGACACCACA 150 0 

TCCGTCTCTT CTACCACTGA GCTAACCAGC CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

35 ATTCAGGTCA CTGATGACCG CAGCGTTTGG GGCATGACCC AGGCCCAGAG CGGGCTGGCC 162 0 

ATTGCCGCCA TTGTAATTGG CATTGTCGCC CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 168 0 

TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG GGAATGATGG GACTGGAGGA CCTGGGAATT 180 0 

TCATCTTTCT GCCTCCACCC CTGGGTCCAT GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 18 6 0 

40 CTAGATAAAG GTGTGCCTAC CTCTTCCTGA CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 192 0 

CGTGCCGGAC CTTCCTACAA TCAGGAAGAT AGATCCAACT GGCCATGGCA AAAGCCCTGG 198 0 

GGATTTCCGA TTCATACCCC TGGGCTTCCT TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2 04 0 

ACCTGTCCTC CAAGAACAGC CTTCCCTGCG CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 210 0 

AGTTAGTCCA CAGCCTGCTC ACTT CGTGGG AATAGTT CT C CGCTGAGATA GCCCCTCTCG 2160 

45 CCTAAGTATT ATGTAAGTTG ATTTCCCTTC TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 222 0 

ACCCAGCATG TCCCCTCAAA TGAAAGTTCT CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 22 80 

TGAGTTCTCT CCTCAAAGAA GACTTCAAAC CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2340 

CAGCCTGGTT TTGGGGATGC TATGAAAGAG AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 240 0 

AGACAGAAGA GCCGTCATCA GTGT CTCACT TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

50 CCCCAGCACA GCAAGCTCAG CCTTTTAGAG AAGGATATTT CCAAACTGCA AACTTTGCTT 2520 

TGAAAAGTTT AGCCCTTTAA GGAATGAAAT CATGTAGAAT TTTGGACTTC TAAAAACATT 25 80 

AAAATCAGCT TATTAATACG GGATAGAGAA AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2 640 

CACCCCTAGA GTTTGTTTTA AAATTTTTAA TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2700 

GTGGGAACAT GATAGTGTAT GGCTTGGTGG ATTTTCACAA ACTGAACATA CCTGTGTAAT 27 60 

55 CAGCATCTAG ACCCAGACCC AGAGCATCAC AAATATCCCC CATCCTGGGC TTTTCCCAGA 2 820 

GGAGATGGGG GCTTCTGAAG ATGGACTTAC CTGGGACCTG CCCCCCATGA GCCAGGACGG 2 880 

TCCCCCCACA GTCAGCCTGT GCAAAGGCCC CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2 940 

TGTGGACAGG ATGGGAGACT GTGGCCTGAA CAGGAGATTT TATTATATCT GGAGACCCTG 3 000 

AGAGACCCTG AGACCTGGGG CACCATGGCT GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3 060 

60 GTCCGTGCAG CCACACCCTC TTCCCTGCCA GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 312 0 

TCCGCCTGGA GCCTTCTATG GACGTGATAT GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

ACTTAGGGGA AGTGAAATCG CTCAGAGATG AGATCCTTTA ATTGAAAACG AAGTGTAACG 3240 

GAATCTAGTG TCTTTCTAAT GTGGTAAAAT TCTCCATCAA CATCACAGTC AGCTGGCAGC 33 00 

TGAACTTCAG AATCTCACTT ACAGCAGGCG ACACGGGGGT ACACCGATGG GTCACACTGG 3360 

65 GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 

TCCAGGGTTA TTCTCCTCCT CGAGTCACAG TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3 480 

CTGCTATACA CATATTCACA TGGCGCTCAA GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

CTCTGGACAA CTGGCCCAGT TTACAGTGAA ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3 600 

AGGAAAGAAC TTCAGCTGAC TCCACGGGGA TCTGGAAATC CACGACCAAT CCCGATCGGC 3 660 

70 TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGCTTTGG AAATCCACCA CCAATCCCGA 3720 

TCGGCTCTTA TTAGCTCCCC GCTCCACAAG ACACCTGTGA TCTGGAAATC TACCACCAAT 3 780 

CCCGATCGGC TCTTATTAGC TCCCCGCTCC ACAAGACACC TGTGACAT CC TCCAGGGCCA 3 840 

CAGGAGCACG TGCTGACCAG TTTTCCCTTC CAGTTCCTGC ACAAAAAGTG TCCAGAGGGC 3 900 

TGTTTGCAAA CACTAGTGCA CTTTGTAGCT TTTCACCCTC TGTCCCAGGG AATCTAGGAG 3960 

75 AGATGAGGCC CGTCAGAGTC AAGAGATGTC ATCCCCCCAG GGTCT CCAAG GCATTTCCAC 4020 

ACTATTGGTG GCACCTGGAG GACATGCACC AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4080 

CCAGAGCATG GCACATGAGC ATCACCCGCT GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGGGCATCCC GGCCCGTACC CCTCCAGACA GGAAGCATGG GTTTGCCCAC AGACCTGTCG 4200 

GGTGCTCCTG TGAGTGGCCT CCAGATGTCT TTGTGCATAG GCACAAGTGG GCCAGGGCTG 42 60 

80 GAGGGAGGTG GGAAACCTCA T CAT CCGGTG GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GGTATTCCTG GCAGTAGCCA TGACATTGGA GCACCTTCCT CTCCAGCCAG AGGCTGACCT 4380 

GAGGGCCACT GTCCTCAGAT GACACCACCC AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

CCTTATGTGA ACCTCTTGCC TCTTCCTTTC TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

GCCTCCTTTT CTTCAGCGGG CCCTTCAACC TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 

85 TACTAGAAAA GCTGAGTGGA GTCTCCTTTC CAACAGGATG ATGCATTTGC TCAATTCTCA 4620 

GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

CCT CTCTGTT TACAGCTCCT TGACAGTCCC ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 
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GTGTTGGAGA AGAAACAACA AAAGCCAATT 
TGCACAGATA CTCTTCAAGC ACTGGACGTG 
GGTAGGAGTG CCGCCTCTAC CCACTTGTGA 
GGTGTTCAAT AGGCTGGGAG TTTTATTTAT 
TTGTCTTGGG CTTTCGTCAT TAAACCAAAG 
TTAGTCTTGG TCATCAGAAC CTCACTTGGT 
GGAAAAAATA AACTCTTCCA TCCCTTAAAG 
TGGGCTGTAT GTATATTGTT CTTCCTCCTT 
AACTTTTCAT GGACACAATT TCCACAACCT 
GAACTTCCAA ACTCAGGAAG TTTGCAGAGA 
AGTTGGTCGA CAGATGTTAG ATGTATCCTA 
GCCCCCAGAT CCCACAGTCA GAACTGAATC 
GGAAGGAAGC CATGGCTGTG GTTCAGAGAG 
CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT 
CTTCATGCTG CCTTCAAAGC TAGATCATGT 
GCCCCAGTGC TTGGCGATGC ATTTACAGAT 
AGCCCTGGTG GGCAGGGTTG GGGGGTCTGT 
GGTGTACAGA ATCAACAATA AATAATATAC 

Seq ID NO: 286 Protein sequence: 
Protein Accession #: NP_570843.1 



PCT/US02/12476 



AGAACCACTA 
GATTCTCTCT 
TGGGGTACAG 
CTCTTCAAAC 
GAAATGGAAG 
ACCATATAGA 
AATAGAATAG 
AGAATTTAGA 
TTCAGATGCT 
GCAGACAGCT 
GCTTTTAGCC 
TGCGTTGTTG 
GGTGGGCTGG 
AAGGAGAGAT 
TTGCCTTGCT 
TTCTAGGCCC 
CTTCTGCTGG 
ATGTAT 



MPLKHYLLLL 
LNTHITELNE 
FQGLDSLESL 
GKNSLTHISP 
FHNNHNLQRL 
YDNHISSLPD 
FRMTjANLQNI 
YDNPWRCDSD 
VPSYPETPWY 
IAAIVIGIVA 



11 

I 

VGCQAWGAGL 
SPFLNISALI 
LLSSNQLLQI 
RVFQHLGNLQ 
YLSNNHISQL 
NVFSNLRQLQ 
SLQNNRLRQL 
ILPLRNWLLL 
PDTPSYPDTT 
LACSLAACVG 



21 

1 

AYHGCPSECT 
ALRIEKNELS 
QPAHFSQCSN 
VLRLYENRLT 
PPSIFMQLPQ 
VLILSRNQIS 
PGNIFANVNG 
NQPRLGTDTV 
SVSSTTELTS 
CCCCKKRSQA 



31 
I 

CSRASQVECT 
RITPGAFRNI) 
LKELQLHGNH 
DIPMGTFDGL 
LNRLTLFGNS 
FISPGAFNGL 
LMAIQLQNNQ 
PVCFSPANVR 
PVEDYTDLTT 
VLMQMKAPNE 



Seq ID NO: 287 DNA sequence 
Nucleic Acid Accession #: NM_002362 
Coding sequence: 1..954 



ATGTCTTCTG 
GAGGCCCTGG 
TCCTCCTCCT 
GGTCCTCCCC 
TGGAGGCAAC 
GACGCAGAGT 
CTGCTCCGCA 
ATCAAAAATT 
ATGATCTTTG 
ACCTGCCTGG 
GGCCTTCTGA 
GAAATCTGGG 
GGGGAGCCCA 
CAGGTACCCG 
GAAACCAGCT 
GCCTACCCAT 



11 
I 

AGCAGAAGAG 
GCCTGGTGGG 
CTCCTCTGGT 
AGAGTCCTCA 
CCAATGAGGG 
CCTTGTTCCG 
AGTATCGAGC 
ACAAGCGCTG 
GCATTGACGT 
GCCTTTCCTA 
TAATCGTCCT 
AGGAGCTGGG 
GGAAACTGCT 
GCAGTAATCC 
ATGTGAAAGT 
CCCTGCGTGA 



21 
I 

TCAGCACTGC 
TGCACAGGCT 
CCCTGGCACC 
GGGAGCCTCT 
TTCCAGCAGC 
AGAAGCACTC 
CAAGGAGCTG 
CTTTCCTGTG 
GAAGGAAGTG 
TGATGGCCTG 
GGGCACAATT 
TGTGATGGGG 
CACCCAAGAT 
TGCGCGCTAT 
CCTGGAGCAT 
AGCAGCTTTG 



31 
I 

AAGCCTGAGG 
CCTACTACTG 
CTGGAGGAAG 
GCCTTACCCA 
CAAGAAGAGG 
AGTAACAAGG 
GTCACAAAGG 
ATCTTCGGCA 
GACCCCGCCA 
CTGGGTAATA 
GCAATGGAGG 
GTGTATGATG 
TGGGTGCAGG 
GAGTTCCTGT 
GTGGTCAGGG 
TTAGAGGAGG 



TTTTTAAAAA 


G1GG1 1AGJ.G 


4800 


U 1 AGl_(_C_ 1 (_A 


GGAGGGG 1 GG 


4860 


AGGGA(_± 1GG 


TCTTCTGCAT 


4 92 0 


1 1 1G1AGAAG 




4980 


CCATTCCCCT 


biTGCTCTCC 


5040 


I GAAAAGG I I 


Ibi AAGGAGA 


5100 


TTTGTCCCTC 


TCATGGGAAT 


5160 


GA 1 AG AAGAG 


1 1L1AL1 1 AG 


5220 


GATGTAGAGC 


TATTGGGAAA 


5280 


AGAGATAACT 


CGGGACCCAG 


5340 


ATAAACCACT 


CAAAGATTCA 


5400 


GGAAGCCAGC 


AGTGGCCTTG 


5460 


CAAGCCACTT 


CCGGGGAAAA 


5520 


TGTTCTCACC 


AACCCGCTGC 


5580 


TAGAGAATTA 


CTGCAAATCA 


5640 


TCAGGGTTTT 


GTAGAGTGTG 


5700 


ATGCTGCTTG 


TAATCCATTT 


5760 


41 


51 




GAK J. V A V f 1 f 


Jj ir* W W AM o Jj y J. 


60 


f CT .T3 VT , OT . TiM 
bobKl LioJjAJN 


TvTVT ri\7T,t>mT. 
IN Jxij y V Jj ir JLlxU 


12 0 


LEYIPDGAFD 


HLVGLTKLNL 


180 


VNLQELALQQ 


NQIGLLSPGTj 


240 


IiKELSLGIFG 


PMPNLRELWL 


300 


TPT ,p r?r , OT ,T-TT 
1 Cj1jK.xMJ Oi-jrl 1 


Tja t ,nriT ."nn^n/ 


3 60 


LENLPLGIFD 


HLGKLCELRL 


420 


GQSLIIINVN 


VAVPSVHVPE 


480 


IQVTDDRSVW 
C 


GMTQAQSGLA 


540 


41 

t 


51 
I 




1 

AAGGGG1 


1 

G G G G GAAG AA 


60 


7V 7V f~* 7V 0/**l TV 

AGGAGGAGGA 


GG CTGCTGTC 




TGCCTGCTGC 


TG AGT CAGG A 


180 


CTACCATCAG 


CTTCACTTGC 


240 


AGGGGGGAAG 


LACL 1 GGGG 1 




I GGAJ. GAG 1 1 


GGCTCATTTT 




CAGAAATGCT 


GGAG AGAG 1 L. 


90 


AAGGG 1 LLbA 


G 1 GGG i GAAG 


480 


GGAAGAG<- 1 A 


vj/iv_v_v_ 1 ibl L 


540 


Al GAGA 1L1 1 




600 


GCGACAGCGC 


CT CTGAGGAG 


660 


GGAGGGAGCA 


CACTGTCTAT 


720 


AAAACTACCT 


GGAGTACCGG 


780 


GGGGTCCAAG 


GGCTCTGGCT 


840 


TCAATGCAAG 


AGTTCGCATT 


900 


AAGAGGGAGT 


CTGA 





60 
65 
70 
75 
80 
85 



Seq ID NO: 288 Protein sequence: 
Protein Accession U : NP 002353.1 



MSSEQKSQHC 
GPPQSPQGAS 
LLRKYRAKEL 
TCLGLSYDGL 
GEPRKLLTQD 
AYPSLREAAL 



11 
I 

KPEEGVEAQE 
ALPTTISFTC 
VTKAEMLERV 
LGNNQIFPKT 
WVQENYIiEYR 
LEEEEGV 



21 
I 

EALGLVGAQA 
WRQPNEGSSS 
IKNYKRCFPV 
GLLIIVLGTI 
QVPGSNPARY 



31 

I 

PTTEEQEAAV 
QEEEGPSTSP 
IFGKASESIiK 
AMEGDSASEE 
EFLWGPRAIjA 



41 

I 

SSSSPLVPGT 
DAESLFREAL 
MIFGIDVKEV 
EIWEELGVMG 
ETSYVKVLEH 



51 
I 

LEEVPAAESA 
SNKVDELAHF 
DPASNTYTLV 
VYDGREHTVY 
WRVNARVR I 



Seq ID NO: 2 89 DNA sequence 
Nucleic Acid Accession #: NMJ302362 
Coding sequence: 46.. 1344 



CGGCGGCCGC 
GGCGACCTGA 
CATCAGCGCG 
CT CAACAGAC 
TTGACCAGAA 
CAGCCCATCG 
GGCCCCAGCA 
GTTCTACCTG 
AAATCCCATC 



11 

I 

GCCCTGGTTG 
AGCAGGCGCT 
GCAGCAGCAC 
ATAATATTGT 
ATGTGCAGTC 
ATTTGAGTGC 
GTGAAAATCT 
CAGCTGAATT 
TCCTCGATTA 



21 

1 

GGTCCCCACT 
TCCCTGTGTG 
TGCAAAGAAA 
GTTTGGTGAT 
TGTGTCTATT 
ATGCACTGTT 
GGAGGAAGAG 
CCATGGGCTT 
TGTGATGACA 



31 
I 

GCTCTCGGGG 
GCCGAGTCGC 
GAAGACATAA 
TACACATGGA 
ATTGACACAG 
GCACTTCACA 
ACAGAAAACA 
TGGGACAGCT 
ACTTTACTGT 



41 

I 

GCGCCATGGA 
CAACGGTCCA 
ACCTGAGTGT 
CTGAGTTTGA 
AATTAAAGGT 
TTTTCCAGCT 
TAATTGCAGC 
TGGTATACGA 
TTT CAGACAA 



51 

I 

CGAGGCCGTG 
CGTGGAGGTG 
TAGAAAGCTA 
TGAACCTTTT 
TAAAGACTCA 
GAATGAAGAT 
AAATCACTGG 
TGTGGAAGTC 
GAACGTCAAC 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 



294 



WO 02/086443 

AGCAACCTCA TCACCTGGAA CCGGGTGGTG CTGCTCCACG GTCCTCCTGG CACTGGAAAA 600 

ACATCCCTGT GTAAAGCGTT AGCCCAGAAA TTGACAATTA GACTTTCAAG CAGGTACCGA 660 

TATGGCCAAT TAATTGAAAT AAACAGCCAC AGCCTCTTTT CTAAGTGGTT TTCGGAAAGT 720 

GGCAAGCTGG TAACCAAGAT GTTTCAGAAG ATTCAGGATT TGATTGATGA TAAAGACGCC 780 

CTGGTGTTCG TGCTGATTGA TGAGGTGGAG AGTCTCACAG CCGCCCGAAA TGCCTGCAGG 840 

GCGGGCACCG AGCCATCAGA TGCCATCCGC GTGGTCAATG CTGTCTTGAC CCAAATTGAT 900 

CAGATTAAAA GGCATTCCAA TGTTGTGATT CTGACCACTT CTAACATCAC CGAGAAGATC 960 

GACGTGGCCT TCGTGGACAG GGCTGACATC AAGCAGTACA TTGGGCCACC CTCTGCAGCA 102 0 

GCCATCTTCA AAATCTACCT CTCTTGTTTG GAAGAACTGA TGAAGTGTCA GAT CAT AT AC 10 8 0 

CCTCGCCAGC AGCTGCTGAC CCTCCGAGAG CTAGAGATGA TTGGCTTCAT TGAAAACAAC 1140 

GTGTCAAAAT TGAGCCTTCT TTTGAATGAC ATTTCAAGGA AGAGCGAGGG CCTCAGCGGC 12 00 

CGGGTCCTGA GAAAACTCCC CTTTCTGGCT CATGCGCTGT ATGTCCAGGC CCCCACCGTC 12 60 

ACCATAGAGG GGTTCCTCCA GGCCCTGTCT CTGGCAGTGG ACAAGCAGTT TGAAGAGAGA 1320 

AAGAAGCTTG CAGCTTACAT CTGATCCTGG GCTTCCCCAT CTGGTGCTTT TCCCATGGAG 13 8 0 

AACACACAAC CAGTAAGTGA GGTTGCCCCA CACAGCCGTC TCCCAGGGAA TCCCTTCTGC 1440 

AAACCAAACG TTACTTAGAC TGCAAGCTAG AAAGCCACCA AGGCCAGGCT TTGTTAAAAG 150 0 

AAGTGTATTC TATTTATGTT GTTTTAAAAT GCATACTGAG AGACAAACAT CTTGTCATTT 15 60 

TCACTGTTTG TAAAAGATAA TTCAGATTGT TTGTCTCCTT GTGAAGAACC ATCGAAACCT 162 0 

GTTTGTTCCC AGCCCACCCC CAGTGGATGG GATGCATAAT GCCAGCAAGT TTTGTTTAAC 168 0 

AGCAAAAAAG GAAGATTAAT GCAGGTGTTA TAGAAGCCAG AAGAGAAACT GTGTCACCCT 174 0 

AAAGAAGCAT ATAATCATAG CATTAAAAAT GCACACATTA CTCCAGGTGG AAGGTGGCAA 18 0 0 

TTGCTTTCTG ATATCAGCTC GTTTGATTTA GTGCAAAAAT GTTTTCAAGA CTATTTAATG 18 60 

GATGTAAAAA AGCCTATTTC TAG AT TAT AC CAACTGAGAA AAAAATGGTC GGTAAAGTGT 1920 

TCTTTCATAA TAAATAATCA AGACATGGTC CCATTTGCAG GAAAAGTGCA GACTCTGAGT 1980 

GTTCCAGGGA AACACATGCT GGACATCCCT TGTAACCCGG TATGGGCGCC CCTGCATTGC 2040 

TGGGATGTTT CTGCCCACGG TTTTGTTTGT GCAATAACGT TATCACATTT CTAATGAGGA 2100 

TTCACATTAA TATAATATAA AATAAATAGG TCAGTTACTG GTCTCTTTCT GCCGAATGTT 2160 
ATGTTTTGCT TTTATCTCAC AGTAAAATAA ATATAATTAA AAA 



Seq ID NO: 2 90 Protein sequence: 
Protein Accession #: NP_004228 

1 11 21 31 41 51 

I I I I I I 

MDEAVGDLKQ ALPCVAESPT VHVEVHQRGS STAKKEDINL SVRKLLNRHN IVFGDYTWTE 60 
FDEPFLTRNV QSVSIIDTEL KVKDSQPIDL SACTVALHIF QLNEDGPSSE NLEEETENII 12 0 
AANHWVLPAA EFHGLWDSLV YDVEVKSHLL DYVMTTLLFS DKNVNSNLIT WNRWLLHGP 18 0 
PGTGKTSLCK ALAQKLTIRL SSRYRYGQLI EINSHSLFSK WFSESGKLVT KMFQKIQDLI 240 
DDKDALVFVL IDEVESLTAA RNACRAGTEP SDAIRWNAV LTQIDQIKRH SNWILTTSN 3 00 
ITEKIDVAFV DRADIKQYIG PPSAAAIFKI YLSCLiEELMK CQIIYPRQQL LTLRELEMIG 3 60 
FIENNVSKLS LLLNDISRKS EGLSGRVLRK LPFLAHALiYV QAPTVTIEGF LQALSLAVDK 42 0 
QFEERKKLAA YI 



Seq ID NO: 291 DNA sequence 

Nucleic Acid Accession NM_002658 . 1 

Coding sequence: 77-1372 

1 11 21 31 41 51 

I I I I I I 

GTCCCCGCAG CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAG CGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 180 

TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 24 0 

GAAATTCGGA GGG CAGC ACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG 3 00 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 3 60 

CTCTGCCACT GTCCTTCAGC AAACGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 420 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 480 

GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

CATCTACAGG AGGCACCGGG GGGGCTCTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 720 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 840 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC ACCACAACGA 900 

CATTGCCTTG CTGAAGATCC GTTCCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 102 0 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTAT CTCT AT CCGGAGCAGC TGAAAATGAC 10 80 

TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 1140 

CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 1200 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 12 60 

CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 1560 

CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTG CAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCT CGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 

AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATGTGGGAG CAGCGGTTTG GGGAGCAGAG ACACTAACGA CTTCAGGGCA GGGCTCTGAT 18 60 

ATTCCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 192 0 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGT CACTC 2040 

CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TTCACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGG GTGAGG ACCACTCCTT 222 0 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 2280 



295 



WO 02/086443 

ATCAATAAAA TGTGATTTTT CTGA 

Seq ID NO: 292 Protein sequence: 
Protein Accession # :NP_002S49 . 1 



PCT/US02/12476 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 



1 
I 

MRAIiLARLLIj 
HCEIDKSKTC 
YCRNPDNRRR 
GGEFTTIENQ 
RSRLNSNTQG 
PSMYNDPQFG 
CAADPQWKTD 
SHTKEENGLA 



11 
I 

CVLWSDSKG 
YEGNGHFYRG 
PWCYVQVGLK 
PWFAAIYRRH 
EMKFEVENL I 
TSCE I TGFGK 
SCQGDSGGPL 
L 



21 

I 

SNELHQVPSN 
KASTDTMGRP 
PL.VQECMVHD 
RGGSVTYVCG 
LHKDYSADTL 
ENSTDYLYPE 
VCSLQGRMTL 



31 

1 

CDCLNGGTCV 
CLPWNSATVL 
CADGKKPSSP 
GSLISPCWVI 
AHHNDIALLK 
QLKMTWKIiI 
TGIVSWGRGC 



41 

I 

SNKYFSN I HW 
QQTYHAHRSD 
PEELKFQCGQ 
SATHCFIDYP 
IRSKEGRCAQ 
SHRECQQPHY 
ALKDKPGVYT 



51 

I 

CNCPKKFGGQ 
ALQLGLGKHN 
KTLRPRFKI I 
KKEDYIVYLG 
PSRTIQTICL 
YGSEVTTKML 
RVSHFL.PWIR 



Seq ID NO: 2 93 DNA sequence 
Nucleic Acid Accession #: NM_001498 
Coding sequence: 93.. 2006 



GGCACGAGGC 
AGGAGGAGGA 
TGAGCTGGGA 
TCCTGCACAT 
AGGTGGAATA 
CTGGGGAGAA 
CTACCCTTTG 
ACGGAGGAAC 
AGGCTACTTC 
TAGGCTGTCC 
CCAAGTCCCT 
CAAGAAATAT 
ACAAGAATAC 
CTTCTAAGCC 
T C C AGGTG AC 
CTACTATCTG 
TGT CAGACAT 
AGGAGCGAGG 
ACTCAATAGA 
TAGATAAAGA 
ATGTTGCTCA 
ATGATGCTAA 
GATTTAAGCC 
AGGTGCAATT 
GAGTGAT CCT 
TGAAGGTAGC 
TTTGCAAAGG 
TCGCTGCAGA 
TGTTTCCTGG 
ACACCAGATG 
TAATGACAGT 
ACAGTGTCAT 
ATGAATTATG 
GAAGTAAAAC 
ACTGGCTACA 
CTGTACTGTT 
TCTAGAGTTT 
AATAACATAT 
CAACCTACTG 
GAGTTAAAAT 
TGAAATTTTT 
TGTCTACATT 
GCACCTTATC 
GCTTTCCCCT 



11 
I 

TGAGTGTCCG 
GGAGGAGGAG 
GGAAACCAAG 
CTACCACGCC 
CATGTTGGTA 
AGTTCTTGAA 
GAGACCAGAG 
AATGTCCGAG 
TATATTAGAA 
TGGGTTCACA 
CTTCTTTCCA 
CCGACATAGG 
ACCATCTCCA 
GGATCATATT 
ATTCCAAGCC 
TCCAATTGTT 
TGATTGTCGC 
ACTGGAGCCA 
CAGCTATTTA 
GATCTACGAA 
TCTCTTTATT 
TGAGTCTGAC 
CCCTCCTCCA 
AACAGACTTT 
TTCCTACAAA 
ACAGAAAAGA 
TGGCAATGCA 
GGAGTACACC 
ACTGATCCCA 
TAGTATTCTG 
TGCCAGATGG 
AACTGATGAA 
TGAATGCCCA 
TGACTCATCC 
GTACCATGCC 
TTCTGGGCCA 
ATACAGTGTA 
CTAAAGTCAT 
T CTAAGCAGT 
GTTTACTGTA 
CTCTTTAAAA 
AAATCACTTG 
TATGATGTTT 
CTGAATAAAT 



21 
I 

TCTCGCGCCC 
GAGGGGGCGG 
CGCCATGCCG 
GTCAAGGACC 
TCTTTTGATC 
ACTCTGCAAG 
TATGGGAGTT 
TTCAATACAG 
GAAAATCAGG 
CTGCCCGAGG 
GATGAAGCAA 
AGAGGAGAAA 
TTTATAGAAA 
TACATGGATG 
TGCAGTATAT 
ATGGCTTTGA 
TGGGGAGTGA 
TTGAAGAACA 
TCTAAGTGTG 
CAGCTGTTGC 
AGAGACCCAC 
CATTTTGAGA 
AACTCAGACA 
GAGAACTCTG 
TTGGATTTTC 
GATGCTGTCT 
GTGGTGGATG 
CTCATGAGCA 
ATTCTGAACT 
AACTACCTAA 
ATGAGGGAGT 
ATGAATTATA 
GAGTTACTTG 
AACTAGACAT 
TCTCAGCCCG 
GTGAGCCAGA 
CATGTACATA 
CATGAACTGG 
TTTGTAAATG 
AATTTTTGTT 
ACATTTTCTC 
AATCCATTGA 
CTTTTGCAAT 
ACCCATTGAA 



31 

I 

GGAAGCGGGC 
CCATGGGGCT 
ACCACGTGCG 
GGCACAAGGA 
ATGAAAATAA 
AGAAGGGGGA 
ACATGATTGA 
TTGAGGCCAA 
CTCTTTGCAC 
TCAAACCCAA 
TAAACAAGCA 
AGGTTGTCAT 
CATTTACTGA 
CCATGGGATT 
CTGAGGCCAG 
GTGCTGCATC 
TTTCTGCATC 
ATAACTATAG 
GTGAGAAATA 
AGGAAGGCAT 
TGACACTGTT 
ATATTCAGTC 
TTGGATGGAG 
CCTATGTGGT 
TCATTCCACT 
TGCAGGGAAT 
GTTGTGGCAA 
TAGACACCAT 
CTTACCTTGA 
AGCTAATTAA 
TTATCGCAAA 
GCCTTATTTT 
GATCAGCATT 
TCTACAGAAA 
TGTGTATAAT 
AATTGATTAA 
GTAAAGTATT 
CTTGTACATT 
TACTGGTAAT 
CTTTTAAAGA 
TCGTTAATTT 
AAGTGCTTCA 
TGGAATAATC 
CTCTGAAAAA 



41 

I 

GACCGCCGTC 
GCTGTCCCAG 
GCGGCACGGG 
CGTTCTCAAG 
AAAAGTCCGG 
AAGGACAAAC 
AGGGACACCA 
CATGCGAAAA 
AATAACTTCA 
CCCAGTGGAA 
CCCTCGCTTC 
CAATGTACCA 
GGATGATGAA 
TGGAATGGGC 
ATACCTTTAT 
TCCCTTTTAC 
TGTAGATGAT 
GATCAGTAAA 
TAATGACATC 
TGATCAT CTC 
TGAAGAGAAA 
CACAAATTGG 
AGTAGAATTT 
GTTTGTGGTA 
GTCAAAGGTT 
GTTTTATTTC 
GGCCCAGAAC 
CATCAATGGG 
AAACATGGAA 
GAAGAGAGCA 
CCATCCTGAC 
GAAGTGTAAC 
TAGGAAAGTA 
GAAAAATGCA 
ATGAAGACCA 
GGCTTTCTTT 
TTTGATTAAC 
TTTAAATTCT 
TGTACAATAC 
CTACCTGGGA 
TCCTTTGTCA 
AGGGTAATCT 
ACTTGGTCAC 
AAAAAAAAAA 



51 
I 

AGCCCGGAGG 
GGCTCGCCGC 
ATCCTCCAGT 
TGGGGCGATG 
TTGGTCCTGT 
CCAAACCATC 
GGACAGCCCT 
CGCCGGAAGG 
TTTCCCAGAT 
GGAGGAGCTT 
AGTACCTTAA 
ATATTTAAGG 
GCTTCAAGGG 
AATTGCTGTC 
GATCAGTTGG 
CGAGGCTATG 
AGAACT CGGG 
TCCCGATATG 
GACTTGACGA 
CTGGCCCAGC 
ATACACCTGG 
CAGACAATGA 
CGACCCATGG 
CTGCTCACCA 
GATGAGAACA 
AGGAAAGATA 
AGCACGGAGC 
AAGGAAGGTG 
GTGGATGTGG 
TCTGGAGAAC 
TACAAGCAAG 
CAAATTGCAA 
AAATATAGTG 
TTATTGACGA 
AATGATAGAA 
GGTAGGTAAA 
AATGTATTTT 
TACTCTGGAG 
TTGCATTCCA 
CCTGATTTAT 
TTTCCTTTGT 
TGGGTTTCTA 
CTTGCCCCAA 
AAAA 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 



70 



Seq ID NO: 2 94 Protein sequence: 
Protein Accession #: NP_001489 



75 
80 
85 



MGLLSQGSPL 
ENKKVRLVLS 
EANMRKRRKE 
NKHPRFSTLT 
MGFGMGNCCL 
SASVDDRTRE 
EG I DHIjIjAQH 
GWRVEFRPME 
QGMFYFRKDI 
YLENMEVDVD 
LILKCNQIAN 



11 
I 

SWEETKRHAD 
GEKVLETLQE 
ATSILEENQA 
RNIRHRRGEK 
QVTFQACSIS 
ERGLEPLKNN 
VAHLFIRDPL 
VQLTDFENSA 
CKGGNAWDG 
TRCSILNYLK 
ELCECPELLG 



21 
I 

HVRRHGILQF 
KGERTNPNHP 
LCTITSFPRL 
WINVPIFKD 
EARYLYDQLA 
NYRI SKSRYD 
TLFEEKIHIiD 
YWFWLLTR 
CGKAQNSTEL 
LIKKRASGEL 
SAFRKVKYSG 



31 
I 

LHIYHAVKDR 
TLWRPEYGSY 
GCPGFTLPEV 
KNTPSPFIET 
TICPIVMALS 
SIDSYLSKCG 
DANE SDHFEN 
VILSYKLDFL 
AAEEYTLMSI 
MTVARWMREF 
SKTDSSN 



41 

! 

HKDVIjKWGDE 
MIEGTPGQPY 
KPNPVEGGAS 
FTEDDEASRA 
AASPFYRGYV 
EKYNDIDLTI 
IQSTNWQTMR 
IPLSKVDENM 
DTIINGKEGV 
IANHPDYKQD 



51 

I 

VEYMLVSFDH 
GGTMSEFNTV 
KSLFFPDEAI 
SKPDH I YMDA 
SDIDCRWGVI 
DKEIYEQLLQ 
FKPPPPNSDI 
KVAQKRDAVL 
FPGLIPILNS 
SVITDEMNYS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



296 



WO 02/086443 

Seq ID NO: 295 DNA sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 247-816 



PCT/US02/12476 



10 



15 



20 



25 



30 



35 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
ATGACAGGAC 
TTCAG CAAAG 
ACCAG CAGTT 
CAACGAGAAA 
CAAAAATATG 
AAGCGATTTT 
AAGCACCTTA 
CACACCCCAA 
TTCTACAATG 
CTTCCAGAGG 
TTGAAAGGAT 
TAACCCATTA 



11 

I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
ATGCTATTCC 
ATAGGATGAT 
TCTCTGGAGA 
TTAATGCTGA 
AAAAAATCTT 
TTGAATCCAT 
AGAAGAAACT 
ATGCATAATC 
GAG CAGGAT A 
CTAAGAAATT 
AACTTGTGTT 
GGTAAATACT 



21 

1 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
ACCCAGCCAA 
GCAGAAACCT 
TGACCTAGAA 
TATAAAACGT 
CGAAATGCTT 
CAT C AAGGAA 
GAAACGTATG 
TCGTTAATGA 
TTGCTGAAGT 
TCTGTTAGTA 
TTGGTTATTT 
ATTACAGTCG 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TTGGATTCTC 
GGTAGCAATG 
TGCAGAGAAA 
AAATTAGTGA 
GAAGGAGTGC 
GCAGCAAGAT 
ATTTGAGAAT 
TTGAGGAGAG 
CTCCTGGCAT 
AAAGATGTTC 
TGTATTCCCA 
TGGTTTCTGC 



41 

1 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
AGATTGATGA 
CACCTGTGGG 
CAGCCTCCTC 
AGGAACTCCG 
AAGGACCTAC 
GTATGAGACG 
ACTTGTCCCT 
AAAAGGATCA 
ATGTTACCGA 
TTTTTCCCAA 
CCTGTGCTGG 
A 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAAGCTT 
CTTCACTGGT 
AGGAAACGTT 
TCCCAAAAGC 
ATGCGTTGGA 
TGCAGTCAGG 
AGACTTTGTT 
GGAGGATTAT 
GATTGCTGTT 
ATCAAATAGC 
AGCATTTTAT 
TAGATATTAT 



Seq ID NO: 296 Protein sequence: 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I I 1 I I 1 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 

GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAP VGGNVT S SFSGDDLECR ETASSPKSQR 

EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 



40 



Seq ID NO: 2 97 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-815 



45 
50 
55 
60 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
ATGACAGGAC 
TTCAGCAAAG 
ACCAGCAGTT 
CAACAAGAAA 
CAAAAATATG 
AAACGATTTT 
AAGCACCTTA 
CACACCCCAA 
TTCTACAATG 
CTTCCAGAGG 
TTGAAAGGAT 
TAACCCATTA 



11 
I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
ATGCTATTCC 
ATAGGATGAT 
TCTCTGGAGA 
TTAATGCTGA 
AAAAAATCTT 
TTGAATCCAT 
AGAAGAAACT 
ATGCATAATC 
GAGCAGGATA 
CTAAGAAATT 
AACTTGTGTT 
GGTAAATACT 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
ACCCAGCCAA 
GCAGAAACCT 
TGACCTAGAA 
TATAAAACGT 
CGAAATGCTT 
CATCAAGGAA 
GAAACGTATG 
TCATTAATGA 
TTGCTGAAGT 
TCTGTTAGTA 
TTGGTTATTT 
ATTACAGTCG 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TTGGATTCTC 
GGTAGCAATG 
TGCAGAGAAA 
AAATTAGTGA 
GAAGGAGTGC 
GCAGCAAGAT 
ATTTGAGAAT 
TTGAGGAGAG 
CTCCTGGCAT 
AAAGATGTTC 
TGTATTCCCA 
TGGTTTCTGC 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
AGATTGATGA 
CACCTGTGGG 
CAGCCTCCTC 
AGGAACTCCG 
AAGGACCTAC 
GTATGAGACG 
ACTTGTCCCT 
AAAAGGATCA 
ATGTTACCGA 
TTTTTCCCAA 
CCTGTGCTGG 
A 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAAGCTT 
CTTCACTGGT 
AGGAAACGTT 
TCCCAAAAGC 
ATGCGTTGGA 
TGCAGTCAGG 
AGACTTTGTT 
GGAGGATTAT 
GATTGCTGTT 
ATCAACTGGC 
AGCGTTTTAT 
TAGATATTAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



65 



Seq ID NO: 2 98 Protein sequence: 
Protein Accession #: Eos sequence 



1 11 21 31 41 51 

1 I I I 1 I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
70 GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAP VGGNVT S SFSGDDLECR ETASSPKSQQ 

EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 



60 
120 
180 



Seq ID NO: 2 99 DNA sequence 
75 Nucleic Acid Accession #: Eos sequence 
Coding sequence: 247-815 



80 
85 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 



11 
I 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 



31 
I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 



41 

I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 



51 

I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 



60 
120 
180 
240 
300 
360 
420 
480 



297 



10 



15 



20 



WO 02/086443 

TCAGCAAAGA TAGGATGATG CAGAAACCTG GTAGCAATGC ACCTGTGGGA 
CCAGCAGTTT CTCTGGAGAT GACCTAGAAT GCAGAGAAAC AGCCTCCTCT 
AACAAGAAAT TAATGCTGAT ATAAAACGTA AATTAGTGAA GGAACTCCGA 
AAAAATATGA AAAAATCTTC GAAATGCTTG AAGGAGTGCA AGGACCTACT 
AACGATTTTT TGAATCCATC ATCAAGGAAG CAGCAAGATG TATGAGACGA 
AGCACCTTAA GAAGAAACTG AAACGTATGA TTTGAGAATA CTTGTCCCTG 
ACACCCCAAA TGCATAATCT CATTAATGAT TGAGGAGAGA AAAGGATCAG 
TCTACAATGG AGCAGGATAT TGCTGAAGTC TCCTGGCATA TGTTACCGAA 
TTCCAGAGGC TAAGAAATTT CTGTTAGTAA AAGATGTTCT TTTTCCCAAA 
TGAAAGGATA ACTTGTGTTT TGGTTATTTT GTATTCCCAC CTGTGCTGGT 
AACCCATTAG GTAAATACTA TTACAGTCGT GGTTTCTGCA 

Seq ID NO: 3 00 Protein sequence: 
Protein Accession #: Eos sequence 



31 



41 



GGAAACGTTA 
CCCAAAAGCC 
TGCGTTGGAC 
GCAGTCAGGA 
GACTTTGTTA 
GAGGATTATC 
ATTGCTGTTT 
TCAACTGGCC 
GCGTTTTATT 
AGATATTATT 



51 



1 11 21 

I I I I I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKAKKLMT 
GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAPVGGNVTS SFSGDDLECR ETASSPKSQQ 
EINADIKRKL VKELRCVGQK YEKIFEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 
LKKKLKRMI 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



PCT/US02/12476 



60 
120 
180 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 301 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-812 



AGTGTTCGGC 
GGCCAAACGG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAATTT 
AACAAGAAAT 
AATATGAAAA 
GATTTTTTGA 
ACCTTAAGAA 
CCCCAAATGC 
ACAATGGAGC 
CAGAGG CTAA 
AAGGATAACT 
CCATTAGGTA 



11 
1 

TGGGGCAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TGGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AATCTTCGAA 
ATCCATCATC 
GAAACTGAAA 
ATAATCTCAT 
AGGATATTGC 
GAAATTTCTG 
TGTGTTTTGG 
AATACTATTA 



21 
I 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAATGTC 
ATGCTTGAAG 
AAGGAAGCAG 
CGTATGATTT 
TAATGATTGA 
TGAAGTCTCC 
TTAGTAAAAG 
TTATTTTGTA 
CAGTCGTGGT 



31 

I 

TGGCTACTTC 
CGCCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGGAAT 
AAGTAGTGAA 
GAGTGCAAGG 
CAAGATGTAT 
GAGAATACTT 
GGAGAGAAAA 
TGGCATATGT 
ATGTTCTTTT 
TTCCCACCTG 
TTCTGCA 



41 
I 

CCTTCCTCCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGA 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GGAAATCCGA 
ACCTACTGCA 
GAGACGAGAC 
GTCCCTGGAG 
GGATCAGATT 
TACCGAATCA 
TCCCAAAGCG 
TGCTGGTAGA 



51 
I 

ATCCCCCTTG 
TCCCAGGTGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAATGTTA 
CCCAAAAGCC 
TGCCTTGGAC 
GTCAGGAAAC 
TTTGTTAAGC 
GATTATCACA 
GCTGTTTTCT 
ACTGGCCTTC 
TTTTATTTGA 
TATTATTAAC 



Seq ID NO: 302 Protein sequence: 
Protein Accession #: Eos sequence 



11 



21 



31 



41 



51 



I 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKEKKLMT 
GHAIPPSQLD SQIDDFTGFS KDGMMQKPGS NAPVGGNVTS NFSGDDLECR GIASSPKSQQ 
EINADIKCQV VKEIRCLGQY EKI FEMLEGV QGPTAVRKRF FESIIKEAAR CMRRDFVKHL 
KKKLKRMI 

Seq ID NO: 303 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 247-815 



AGTGTTCGGC 
GGCCAAACAG 
CCTAGGGGGC 
GGGAGGCGCC 
GAAACAATGA 
CCCAGGGAAT 
AAACAAGGAG 
TGACAGGACA 
TCAGCAAAGA 
CCAGCAGTTT 
AACAAGAAAT 
AAAAATATGA 
AACGATTTTT 
AGCACCTTAA 
ACACCCCAAA 
TCTACAATGG 
TTCCAGAGGC 
TGAAAGGATA 
AACCCATTAG 



11 
I 

TGGGACAGGC 
GATCGGTGCT 
ACATTTCCCA 
ACAACTTCAC 
CCGATAAAAC 
GTGACAGTCC 
CAGGAGACAG 
TGCTATTCCA 
TAGGATGATG 
CTCTGGAGAT 
TAATGCTGAT 
AAAAATCTTC 
TGAATCCATC 
GAAGAAACTG 
TGCATAATCT 
AGCAGGATAT 
TAAGAAATTT 
ACTTGTGTTT 
GTAAATACTA 



21 
i 

ACGCTGTGGC 
TCTGGTGAGA 
CAACTCCCAG 
TGCCATTTTG 
AGAGAAGGTG 
TTCGTATCAG 
CCTTATTGCA 
CCCAGCCAAT 
CAGAAACCTG 
GACCTAGAAT 
ATAAAACGTA 
GAAATGCTTG 
ATCAAGGAAG 
AAACGTATGA 
CGTTAATGAT 
TGCTGAAGTC 
CTGTTAGTAA 
TGGTTATTTT 
TTACAGTCGT 



31 
I 

TGGCTACTTC 
CGTCTCCCCA 
AGGGCAGGTT 
TGAGGTGCCG 
GCTGTAGATC 
AAAAGGCAGA 
GGCTCTGCCA 
TGGATTCTCA 
GTAGCAATGC 
GCAGAGAAAC 
AATTAGTGAA 
AAGGAGTGCA 
CAGCAAGATG 
TTTGAGAATA 
TGAGGAGAGA 
TCCTGGCATA 
AAGATGTTCT 
GTATTCCCAC 
GGTTTCTGCA 



41 
I 

CCTTCCTTCC 
TGCACATCAC 
TCTAGAAAGT 
CCGTCTCTCC 
CTGAAACTGT 
GGATGGCCCT 
TGTCCAAAGC 
GATTGATGAC 
ACCTGTGGGA 
AGCCTCCTCT 
GGAACTCCGA 
AGGACCTACT 
TATGAGACGA 
CTTGTCCCTG 
AAAGGATCAG 
TGTTACCGAA 
TTTTCCCAAA 
CTGTGCTGGT 



51 

1 

ATCCCCCTTG 
TCCCAGATGC 
GCCACCAGTG 
TCCAGCAAGG 
GTTTAAACGT 
GTTGGCAAGG 
AAAGAGCTTA 
TTCACTGGTT 
GGAAACGTTA 
CCCAAAAGCC 
TGCGTTGGAC 
GCAGTCAGGA 
GACTTTGTTA 
GAGGATTATC 
ATTGCTGTTT 
TCAACTGGCC 
GCGTTTTATT 
AGATATTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 304 Protein sequence: 
Protein Accession #: Eos sequence 



298 



WO 02/086443 



PCT/US02/12476 



1 11 21 31 41 51 

'III 

MTDKTEKVAV DPETVFKRPR ECDSPSYQKR QRMALLARKQ GAGDSLIAGS AMSKAKKLMT 60 

GHAIPPSQLD SQIDDFTGFS KDRMMQKPGS NAP VGGNVT S SFSGDDLECR ETASSPKSQQ 12 0 

5 EINADIKRKL VKELRCVGQK YEKI FEMLEG VQGPTAVRKR FFESIIKEAA RCMRRDFVKH 180 
LKKKLKRMI 

Seq ID NO: 305 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
10 Coding sequence: 87-689 

1 11 21 31 41 51 

1 I I I I I 

CGTGGAGGCA GCTAGCGCGA GGCTGGGGAG CGCTGAGCCG CGCGTCGTGC CCTGCGCTGC 60 

15 CCAGACTAGC GAACAATACA GTCAGGATGG CTAAAGGTGA CCCCAAGAAA CCAAAGGGCA 120 

AGATGTCCGC TTATGCCTTC TTTGTGCAGA CATGCAGAGA AGAACATAAG AAGAAAAACC 180 

CAGAGGTCCC TGTCAATTTT GCGGAATTTT CCAAGAAGTG CTCTGAGAGG TGGAAGACGA 240 

TGTCCGGGAA AGAGAAATCT AAATTTGATG AAATGGCAAA G G CAG AT AAA GTGCGCTATG 300 

ATCGGGAAAT GAAGGATTAT GGACCAG CT A AGGGAGGCAA GAAGAAGAAG GATCCTAATG 3 60 

20 CTCCCAAAAG GCCACCGTCT GGATTCTTCC TGTTCTGTTC AGAATTCCGC CCCAAGATCA 420 

AATCCACAAA CCCCGGCATC TCTATTGGAG ACGTGGCAAA AAAGCTGGGT GAGATGTGGA 480 

ATAATTTAAA TGACAGTGAA AAGCAGCCTT ACATCACTAA GGCGGCAAAG CTGAAGGAGA 540 

AGTATGAGAA GGATGTTGCT GACTATAAGT CGAAAGGAAA GTTTGATGGT GCAAAGGGTC 60 0 

CTGCTAAAGT TGCCCGGAAA AAGGTGGAAG AGGAAGATGA AGAAGAGGAG GAGGAAGAAG 6 60 

25 AGGAGGAGGA GGAGGAGGAG GATGAATAAA GAAACTGTTT ATCTGTCTCC TTGTGAATAC 72 0 

TTAGAGTAGG GGAGCGCCGT AATTGACACA TCTCTTATTT GAGAAGTGTC TGTTGCCCTC 780 

ATTAGGTTTA ATTACAAAAT TTGATCACGA TCATATTGTA GTCTCTCAAA GTGCTCTAGA 840 

AATTGTCAGT GGTTTACATG AAGTGGCCAT GGGTGTCTGG AGCACCCTGA AACTGTATCA 900 

AAGTTGTACA TATTTCCAAA CATT TTTAAA ATGAAAAGGC ACTCTCGTGT TCTCCTCACT 960 

30 CTGTGCACTT TGCTGTTGGT GTGACAAGGC ATTTAAAGAT GTTTCTGGCA TTTTCTTTTT 1020 

ATTTGTAAGG TGGTGGTAAC TATGGTTATT GGCTAGAAAT CCTGAGTTTT CAACTGTATA 1080 

TATCTATAGT TTGTAAAAAG AACAAAACAA CCGAGACAAA CCCTTGATGC TCCTTGCTCG 1140 

GCGTTGAGGC TGTGGGG AAG ATGCCTTTTG GGAGAGGCTG TAGCTCAGGG CGTGCACTGT 1200 

GAGGCTGGAC CTGTTGACTC TGCAGGGGGC ATCCATTTAG CTTCAGGTTG TCTTGTTTCT 1260 

35 GTATATAGTG ACATAGCATT CTGCTGCCAT CTTAGCTGTG GACAAAGGGG GGTCAGCTGG 1320 

CATGAGAATA TTTTTTTTTT TAAGTGCGGT AGTTTTTAAA CTGTTTGTTT TTAAACAAAC 1380 

TAT AGAACT C TTCATTGTCA GCAAAGCAAA GAGTCACTGC ATCAATGAAA GTTCAAGAAC 1440 

CTCCTGTACT TAAACACGAT TCGCAACGTT CTGTTATTTT TTTTGTATGT TTAGAATGCT 15 0 0 

GAAATGTTTT TGAAGTTAAA TAAACAGTAT TACATTTTTA AAACTCTTCT CTATTATAAC 1560 

40 AGTCAATTTC TGACTCACAG CAGTGAACAA ACCCCCACTC CATTGTATTT GGAGACTGGC 162 0 

CTCCCTATAA ATGTGGTAGC TTCTTTTATT ACTCAGTGGC CAGCTCACTT AGGGCTGAGA 1680 

TGAAGGAGAG GGCTACTTGA AGCTACTGTG TGATTT TGTT TGTGTCTGAG TGGCATTCAG 1740 

ATGAAGT CTG GAGGAGTTAG GAGAACGACA TAGGCAAGGT TCAGCAGCCT TCCAAGGTAT 1800 

AGGAAGGTGG GTGATTAGGA CTGAGGCTAT CTAGGTTTAA CTTTTGTCCC ACCTCCACCC 1860 

45 CCTATTTTGT GGGGCCAAAT GCATTGCTAA ACAGCAATTT CAGAGTGTAT GGTGTGTCAA 1920 

AAATTAAGGC CTTATTGTTT TTCTCTTTCA CCCCTACCCC CCGTGCTCCT GGCACATATC 1980 

ACATTATTTG TGGTGCCCAA CATTTGGGGT CTTGAGCCTG CTGCTGGTCT CCTGGATGCC 2 040 

AGTGAGGGTA TGTGGGATGG GGTGGTGGGG TAGGGGACGG TATCCTTTTT TTGCTCCTAC 2100 

TTGGAAACAC CAAACACCCC AAGGAAGATG ATAGGCTCCA TCTTGGGCCA CCTGAGCTAT 2160 

50 AGGGCAGGCT AATGGAATCA ACCATTTCTG AGCACTAAAT GTATCATGAA AAGTTGAATG 2220 

GCCTGCTCAT AAGTTTAGCT CATTCACTGG AAATGTAGAT TGATGTTCAA TGTTAAACTG 2280 

GAAGGAGCTT GGTTTGTGTG TCAGTGGTTA TATTAGTGGG TAGTGTAACA TTTTATCCAG 2 3 40 

GTTGGGGTGA GGGGAGATGG CCACAGTAGC AAGTGGTGAC ACTAAATACC ATTTTGAAGG 240 0 

CTGATGTGTA TATACATCAT TACTGTCCGT AGCAATGAAG GATACAGTAC TGTGTTGTGG 2460 

55 GTGAGTGTTG CTATTGCCCA GCATTAATAT TTGGGTGTGT ATGTTTGAGG CTATGAAACA 252 0 

CGCAGGAGTG TTTTTGTGCT ATTAATTTTA AGAGAAAGCA GCTTTTTCTT AAAATTCACT 2580 

GTTGAGAAAC TTGCATGTCT GGAGGCGGTG TCCTCTCCGC CCTGTCGGGT CCTGGATGAG 2640 

TACGAGTTAT GGTCACGGTC ACAGCCTGAT CTCTTATGTG TTCATAGCCA TTCGCTCTCC 270 0 

CATCAGAACT GTTTGTCCTG AATGTGTTCC TCTAGT TCTA GAAAATGACC ACTAATTTAA 2760 

60 AAAACTCGGT TGTGAGGTTT GCCCAGAGGC ACTTGTTCCA GAATTTCCCC TCCTGCTTCA 2 82 0 

GCCATGTCCT TGTCACTTGG CATT CTAAGC TAAAGCTTTA GCTT CCCAAT TCGTGATGTG 2 880 

CTAGGCCAAG ATTCGGGAGC TGTTGCCAGC CTCGTCAAAT ATGGAAGAGA AACAACCTGC 2 940 

GGTCAAAAGG GAGTGATTTG TTAAGTGGTG CGCGTCTATC TCATAACTAG ATGTACCAAC 3000 

CAGGGAAGGG CCAAGGATGG AAAGGGGTAA CTTTTGTGCT TCCAAAGTAG CTAAGCAGAA 3 060 

65 GTGGGGGAGC AGTTTAGCCA GATGAT CTTT GATTAGGCAA ACATTGAGTT TTAAAGAGGC 312 0 

TGTCAAGTTG AGGCCACTTG GTCCATTAGC TGGGGCAGCA AGATCACTAC TCAACGTTTT 3180 

CACACTGTGG CAAGATTGCT CTTCTAGTGG AATAATGCCC TAGTTTCTCT GAGATGATGT 3240 

AAGTGGCATG ATGTTACCTA AGGCTTAGGC TTAGCTTGAT TTCTGGGCCC ACTGTCTGTG 33 00 

TTCTTAAGAT GCCAACCTGT TGCTTTTTTT TTTTTTTTCC CCCATTTAAA AGGATAGTAC 3360 

70 CTACTCCCTC TAACCACCTC ACCCCATTCT TGAATGACAT TTTATCCTTC GGAAAGAACA 3420 

AGGCTGTGAT GTAGTGACTA TTGT CTGTGT CTCCTGTGTG TGTCTGTTCT TGTCACAAAT 3480 
GTATTTGGGG ACGTTGGATG CATT CATTTT CTGTAATAAA G 

Seq ID NO: 306 Protein sequence: 
75 Protein Accession #: NP_005333.1 

1 11 21 31 41 51 

I I i I I I 

MAKGDPKKPK GKMSAYAFFV GTCREEHKKK NPEVPVNFAE FSKKCSERWK TMSGKEKSKF 60 

80 DEMAKADKVR YDREMKDYGP AKGGKKKKDP NAPKRPPSGF FLFCSEFRPK IKSTNPGISI 120 

GDVAKKLGEM WNNLND SEKQ PYITKAAKLK EKYEKDVADY KSKGKFDGAK GPAKVARKKV 180 
EEEDEEEEEE EEEEEEEEDE 

Seq ID NO: 307 DNA sequence 
85 Nucleic Acid Accession #: NM_022342 
Coding sequence: 1..2178 



299 



WO 02/086443 



1 11 21 31 41 51 

I 1 I 1 I I 

ATGGGTACTA GGAAAAAAGT TCATGCATTT GTCCGTGTCA AACCCACCGA TGACTTTGCT 60 

CATGAAATGA TCAGATACGG AGATGACAAA AGAAGCATTG ATATTCACTT AAAAAAAGAC 120 

ATTCGGAGAG GAGTTGTCAA TAACCAACAG ACAGACTGGT CGTTTAAGTT GGATGGAGTT 180 

TTCACGATG CCTCCCAGGA CTTGGTTTAT GAGACAGTTG CAAAGGATGT GGTTTCTCAG 240 

CCCTCGATG GCTATAATGG CACCATCATG TGTTATGGGC AGACGGGAGC TGGCAAGACA 300 

ACACCATGA TGGGGG CAAC TGAGAATTAC AAGCACCGGG GGATCCTCCC TCGTGCCCTG 360 

AGCAGGTTT TTAGGATGAT CGAAGAACGC CCCACACATG CCATCACTGT GCGTGTTTCC 42 0 

ACTTGGAAA TCTATAATGA GAGCCTGTTT GATCTCCTGT CCACTCTGCC CTATGTTGGA 480 

CCTCAGTCA CACCAATGAC CATCGTGGAA AACCCTCAAG GAGTCTTCAT TAAGGGCTTG 540 

CAGTTCACC TCACAAGTCA GGAGGAGGAT GCATTCAGCC TCCTTTTTGA GGGTGAGACC 600 

ACAGGATTA TAGCCTCCCA CACTATGAAC AAAAACTCTT CCAGATCACA CTGCATTTTC 660 

CCATCTACT TAGAGGCCCA TTCCCGGACC TTATCAGAGG AAAAGTACAT CACTTCCAAA 72 0 

TTAACTTGG TGGATCTGGC AGGCTCAGAG AGGCTGGGGA AGTCTGGGTC TGAGGGCCAA 780 

TCCTGAAGG AAGCCACCTA CATCAACAAA TCGCTCTCAT TCCTGGAGCA GGCCATCATT 840 

CCCTTGGGG ACCAGAAGCG GGACCACATC CCCTTTCGGC AGTGCAAGCT CACCCACGCT 900 

TGAAGGACT CGTTAGGGGG AAACTGCAAT ATGGTCCTCG TGACAAACAT CTATGGAGAA 960 

CTGCCCAGT TAGAAGAAAC GCTATCTTGA CTGAGATTTG CCAGCAGGAT GAAGCTAGTC 1020 

CCACTGAGC CTGCCATCAA TGAAAAGTAT GATGCTGAGA GAATGGT CAA GAACCTGGAG 1080 

AGGAACTAG CACTACTCAA GCAGGAG CTG GCTATCCATG ACAGCCTGAC CAACCGCACC 1140 

TTGTGACCT ATGACCCCAT GGATGAAATC CAGATTGCTG AGATCAACTC CCAGGTGCGG 120 0 

GGTACCTGG AGGGGACACT GGACGAGATC GACATAATCA GCCTTAGACA GATCAAGGAG 12 60 

TGTTCAACC AGTTCCGGGT GGTTCTGAGC CAACAGGAAC AGGAAGTGGA GTCCACTTTG 1320 

GCAGGAAGT ACACCCTCAT TGACAGGAAT GACTTTGCAG CCATTTCTGC TATCCAGAAG 1380 

CGGGGCTTG TGGATGTTGA TGGCCACCTA GTGGGTGAGC CTGAAGGACA AAACTTTGGA 1440 

TCGGAGTCG CCCCTTTCTC TACCAAACCT GGGAAGAAAG CCAAGTCCAA GAAGACATTC 1500 

AAGAGCCAC TCAGGCCCGA CACCCCACCC TCCAAACCAG TGGCCTTTGA GGAGTTTAAG 1560 

ATGAGCAAG GTAGTGAGAT CAACCGAATT TTCAAAGAAA ACAAATCCAT CTTGAATGAA 1620 

GGAGGAAAA GGGCCAGCGA GACCACACAG CACAT CAATG CC AT CAAGCG GGAGATTGAT 1680 

TGACCAAGG AGGCCCTGAA TTTCCAGAAG TCACTACGGG AGAAGCAAGG CAAGTACGAA 1740 

ACAAGGGGC TGATGATCAT CGATGAGGAA GAATTCCTGC TGATCCTCAA GCT CAAAGAC 1800 

TCAAGAAGC AGTACCGCAG CGAGTACCAG GACCTGCGTG ACCTCAGGGC TGAGAT CCAG 1860 

ATTGCCAGC ACCTAGTGGA TCAGTGTCGC CACCGCCTGC TCATGGAATT TGACATCTGG 1920 

ACAATGAGT CCTTTGTCAT CCCTGAGGAC ATGCAGATGG CACTGAAGCC AGGCGGCAGC 1980 

TCCGGCCAG GCATGGTCCC TGTGAACAGG ATTGTGTCTC TGGGAGAAGA TGACCAGGAC 2040 

AATTCAGCC AGCTGCAGCA GAGGGTGCTT CCTGAGGGCC CTGATTCCAT CTCCTTCTAC 2100 

ATGCCAAAG T CAAGATAG A GCAGAAGCAT AATTACTTGA AAACCATGAT GGGCCTCCAG 2160 
AGGCACATA GAAAATAG 



Seq ID NO: 3 08 Protein sequence: 
Protein Accession #: NP_071737 



1 li 21 31 41 51 

I l i l l l 

MGTRKKVHAF VRVKPTDDFA HEMIRYGDDK RSIDIHIUKKD IRRGWNNQQ TDWSFKLDGV 60 

LHDASQDLVY ETVAKDWSQ ALDGYNGTIM CYGQTGAGKT YTMMGATENY KHRGILPRAL 120 

QQVFRMIEER PTHAITVRVS YLEIYNESLF DLLSTLPYVG PSVTPMTIVE NPQGVFIKGL 180 

SVHLTSQEED AFSLLFEGET NRI I ASHTMN KNSSRSHCIF TIYLEAHSRT LSEEKYITSK 240 

INLVDLAGSE RLGKSGSEGQ VLKEATYINK SLSFLEQAII ALGDQKRDHI PFRQCKLTHA 300 

LKD S LGGNCN MVLVTNIYGE AAQLEETLSS LRFASRMKLV TTEPAINEKY DAERMVKNLE 360 

KELiALLKQEL AIHDSLTNRT FVTYDPMDEI QIAEINSQVR RYLEGTLDEI DIISLRQIKE 420 

VFNQFRWLS QQEQEVESTL RRKYTLIDRN DFAAISAIQK AGLVDVDGHL VGEPEGQNFG 480 

LGVAPFSTKP GKKAKSKKTF KEPLRPDTPP SKPVAFEEFK NEQGSEINRI FKENKSILNE 540 

RRKRASETTQ HINAIKREID VTKEADNFQK SLREKQGKYE NKGLMI IDEE EFLLILKLKD 600 

LKKQYRSEYQ DLRDLRAEIQ YCQHLVDQCR HRLLMEFDIW YNESFVIPED MQMALKPGGS 660 

IRPGMVPVNR IVSLGEDDQD KFSQLQQRVL PEGPDSISFY NAKVKIEQKH NYLKTMMGLQ 72 0 
QAHRK 



Seq ID NO: 309 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



1 11 21 31 41 51 

I I I I I I 

TTTTTTTTTT TTTTTTTTAA TGCCTGCTGT CATGCTCTGT CTACCAGGGT GAATTTCCAA 60 

AAATTTCTGC ATAGCAATTT TAGCCAAAAC TATATATGTT CTGGGGAGGA TAGGCATAGG 12 0 

CACATTGAAG ACCAAAGGAA AGAGTGAAGA AGTGTAGTTG GGTCATTGTG AATGGATGTT 180 

TAGATTGTCA AGAAAAGTGG GCCAGAGGCC CCACCTCACA CTAGGACGGC AATTGCCTCT 240 

CATTAGTATC TCAGGCACCA TGGGTCT TAT TTGGTGTCAT AAGAAACACC CTCAACAAAG 300 

TAATGAACCC TCAGCCTCCA GCTTCTCTTC TTCGGGATTC TTCTTAGGGC CTCCTTTTTC 3 60 

CTTTTATGTT TCCAGTACCC TGAATTTCTT ATTCCCATCC CCCATTAAAA TCTGCTTCAA 42 0 

AGAAAAAACA AGAAGGACAC ATTCACTTTA AGATCCAAAT GAATGATAAG AGCTTAAAAC 480 
ATTATACTTA TCAGTATTAT TTGCATTTTT ATAGAAACCA AAACCATATT TCAACAAC 



Seq ID NO: 310 DNA sequence 

Nucleic Acid Accession #: NM_018622.2 

Coding sequence: 1-1140 

1 11 21 31 41 51 

I I I I I i 

ATGGCGTGGC GAGGCTGGGC GCAGAGAGGC TGGGGCTGCG GCCAGGCGTG GGGTGCGTCG 60 
GTGGGCGGCC GCAGCTGCGA GGAGCTCACT GCGGTCCTAA CCCCGCCGCA GCTCCTCGGA 120 
CGCAGGTTTA ACTTCTTTAT TCAACAAAAA TGCGGATTCA GAAAAGCACC CAGGAAGGTT 180 
GAACCTCGAA GATCAGACCC AGGGACAAGT GGTGAAGCAT ACAAGAGAAG TGCTTTGATT 240 
CCT CCTGTGG AAGAAACAGT CTTTTATCCT TCTCCCTATC CTATAAGGAG TCTCATAAAA 300 
CCTTTATTTT TTACTGTTGG GTTTACAGGC TGTGCATTTG GATCAGCTGC TATTTGGCAA 3 60 



300 



10 



15 



WO 02/086443 

TATGAATCAC TGAAAT CCAG GGTCCAGAGT TATTTTGATG GTATAAAAGC TGATTGGTTG 420 

GATAGCATAA GACCACAAAA AGAAGGAGAC TTCAGAAAGG AGATTAACAA GTGGTGGAAT 480 

AACCTAAGTG ATGGCCAGCG GACTGTGACA GGTATTATAG CTGCAAATGT CCTTGTATTC 540 

TGTTTATGGA GAGTACCTTC TCTGCAGCGG ACAATGATCA GATATTTCAC AT CGAATCCA 600 

GCCTCAAAGG TCCTTTGTTC TCCAATGTTG CTGTCAACAT TCAGTCACTT CTCCTTATTT 660 

CACATGGCAG CAAATATGTA TGTTTTGTGG AGCTTCTCTT CCAG CATAGT GAACATTCTG 720 

GGTCAAGAGC AGTTCATGGC AGTGTACCTA TCTGCAGGTG TTATTTCCAA TTTTGTCAGT 780 

TACCTGGGTA AAGTTGCCAC AGGAAGATAT GGACCATCAC TTGGTGCATC TGGTGCCATC 840 

ATGACAGTCC TCGCAGCTGT CTGCACTAAG ATCCCAGAAG GGAGGCTTGC CATTATTTTC 900 

CTTCCGATGT TCACGTTCAC AGCAGGGAAT GCCCTGAAAG CCATTATCGC CATGGATACA 960 

GCAGGAATGA TCCTGGGATG GAAATTTTTT GATCATGCGG CACATCTTGG GGGAGCTCTT 102 0 

TTTGGAATAT GGTATGTTAC TTACGGTCAT GAACTGATTT GGAAGAACAG GGAGCCGCTA 10 80 
GTGAAAATCT GGCATGAAAT AAGGACTAAT GGCCCCAAAA AAGGAGGTGG CTCTAAGTAA 



Seq ID NO: 311 Protein sequence: 
Protein Accession #: NP_061092.2 



PCT/US02/12476 



20 



25 



MAWRGWAQRG 
EPRRSDPGTS 
YESLKSRVQS 
CLWRVPSLQR 
GQEQFMAVYL 
LPMPTFTAGN 
VKIWHEIRTN 



11 

1 

WGCGQAWGAS 
GEAYKRSALI 
YFDGIKADWL 
TMIRYFTSNP 
SAGVISNFVS 
ALKAI IAMDT 
GPKKGGGSK 



21 
I 

VGGRSCEELT 
PPVEETVFYP 
DSIRPQKEGD 
ASKVLCSPML 
YLGKVATGRY 
AGMILGWKFF 



31 

I 

AVLTPPQLLG 
SPYPIRSLIK 
FRKEINKWWN 
LSTFSHFSLF 
GPSLGASGAI 
DHAAHLGGAL 



41 
I 

RRFNFFIQQK 
PLFFTVGFTG 
NLSDGQRTVT 
HMAANMYVLW 
MTVLAAVCTK 
FGIWYVTYGH 



51 
1 

CGFRKAPRKV 
CAFGSAAIWQ 
GIIAANVLVF 
SFSSSIVNIL 
IPEGRLAIIF 
ELIWKNREPL 



60 
120 
180 
240 
300 
360 



Seq ID NO: 312 DNA sequence 
30 Nucleic Acid Accession #: NM_000625 
Coding sequence: 195.. 3656 



35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



CTCTCGGCCA 
CACAGGTCTC 
GCCCCACAGT 
GTAAAGCCAT 
ATGCAATGAA 
CCAGTCCAGT 
CCCCGCAGCC 
CAACCCCATT 
TCCAAGACAC 
TGGGGTCCAT 
CAGATGAGCT 
AGGCAAAAAT 
CAGTAACCTA 
ATGCCCCACG 
GCTGTTCCAC 
ACAATGGCAA 
ACTTCCGGGT 
GCATCAGAGG 
AGCCCAAGTA 
CTGAGCTCTT 
ACGAGTGGTT 
TGCTTGAGGT 
CAGAGATCGG 
GCAGGAGAAT 
TTGAGATCAA 
ACCACTCGGC 
GCTGCCCGGC 
TTCACCAGGA 
GGAAAACCCA 
AAGTCTTGGT 
GAGTCAGAGT 
ACCTGGGGGC 
GGCTGAGCTG 
GAGACTGCCC 
ACAACAAATT 
CCTTTGCTCA 
TGGGAGAAGG 
CCTTCAAGGC 
AGCTCTACAC 
AGCCTTTGGA 
GGCTCAAATC 
AACTCTCCTG 
GCCCAGGCAA 
CACCCCACCA 
ACAAGAGGCT 
CACCCCCAAC 
GACAGAGGCT 
GCCCCACATT 
TGCTTTCCCA 
ACACGCCCAC 
AGGGTCCCCT 
CAGTGCCCTG 



11 

I 

CCTTTGATGA 
TTCCTGGTTT 
GAAGAACATC 
AGAGATGGCC 
TGGGGAAAAA 
GACACAGGAT 
CCTCGTGGAG 
GTCCTCCCCA 
ACTTCACCAT 
TATGACTCCC 
TCTACCTCAA 
AGAGGAACAT 
CCAACTGACG 
CTGCATTGGG 
TGCCCGGGAA 
CATCAGGTCG 
GTGGAATGCT 
GGACCCTGCC 
CGGCCGCTTC 
CGAAATCCCA 
TCGGGAACTG 
GGGCGGCCTG 
AGTCCGGGAC 
GGGCCTGGAA 
CATTGCTGTG 
TGCAGAATCC 
AGACTGGATT 
GATGCTGAAC 
TGTCTGGCAG 
CAAAGCTGTG 
CACCATCCTC 
CTTATT CAGC 
CCTGGAGGAG 
TGGCAATGGA 
CAGGTACGCT 
TGACATTGAT 
GGATGAGCTC 
AGCCTGTGAG 
CTCCAATGTG 
CCTCAGCAAA 
TCGGCAGAAT 
TGAGGATGGC 
CCAGCCGGCC 
GGCAGTGCGC 
GCCCCCCTGC 
CCAGCTGCTG 
GGAGGCCCTG 
CCTGGAGGTG 
GCTCCCCATT 
GGAGATCCAC 
GCACCACGGC 
CTTTGTGCGG 



21 

I 

GGGGACTGGG 
GACTGTCCTT 
TGAGCTCAAA 
TGT CCTTGGA 
GGCATCAACA 
GACCTTCAGT 
ACGGGAAAGA 
CGGCATGTGA 
AAGGCCAAAG 
AAAAGTTTGA 
GCTATCGAAT 
CTGGCCAGGG 
GGAGATGAGC 
AGGAT CCAGT 
ATGTTTGAAC 
GCCATCACCG 
CAGCTCATCC 
AACGTGGAAT 
GATGTGGTCC 
CCTGACCTTG 
GAGCTAAAGT 
GAGTTCCCAG 
TTCTGTGATG 
ACGCACAAGC 
CTCCATAGTT 
TTCATGAAGT 
TGGCTGGTCC 
TACGTCCTGT 
GACGAGAAGC 
CTCTTTGCCT 
TTTGCGACAG 
TGTGCCTTCA 
GAACGGCTGC 
GAGAAACTGA 
GTGTTTGGCC 
CAGAAGCTGT 
AGTGGGCAGG 
ACGTTTGATG 
ACCTGGGACC 
GCCCTCAGCA 
CTACAAAGTC 
CAAGGCCTGA 
CTGGTCCAAG 
CTGGAGGCCC 
TCACTCAGCC 
CTCCAAAAGC 
TGCCAGCCCT 
CTAGAGGAGT 
CTGAAGCCCA 
CTGACTGTGG 
GTCTGCAGCA 
AATGCCAGCG 



31 

i 

CAGTTCTAGA 
ACCCCGGGGA 
TCCAGATAAG 
AATTTCTGTT 
ACAATGTGGA 
ATCACAACCT 
AGTCTCCAGA 
GGATCAAAAA 
GGATTTTAAC 
CCAGAGGACC 
TTGTCAACCA 
TGGAAGCGGT 
TCATCTTCGC 
GGTCCAACCT 
ACATCTGCAG 
TGTTCCCCCA 
GCTATGCTGG 
TCACTCAGCT 
CCCTGGTCCT 
TGCTTGAGGT 
GGTACGCCCT 
GGTGCCCCTT 
TCCAGCGCTA 
TGGCCTCGCT 
TCCAGAAGCA 
ACATGCAGAA 
CTCCCATGTC 
CCCCTTTCTA 
GG AG AC C C AA 
GTATGCTGAT 
AGACAGGAAA 
ACCCCAAGGT 
TGTTGGTGGT 
AGAAATCGCT 
TCGGCTCCAG 
CCCACCTGGG 
AGGACGCCTT 
TCCGAGGCAA 
CGCACCACTA 
GCATGCATGC 
CGACATCCAG 
ACTACCTGCC 
GCATCCTGGA 
TGGATGAGAG 
AGGCCCTCAC 
TGGCCCAGGT 
CAGAGTACAG 
TCCCGTCCCT 
GGTTCTACTC 
CCGTGGTCAC 
CATGGCTCAA 
GCTTCCACCT 



41 

I 

CAGTCCCGAA 
GGCAGTGCAG 
TGACATAAGT 
CAAGACCAAA 
GAAAGCCCCC 
CAGCAAGCAG 
ATCTCTGGTC 
CTGGGGCAGC 
TTGCAGGTCC 
CAGGGACAAG 
ATATTACGGC 
AACAAAGGAG 
CACCAAGCAG 
GCAGGTCTTC 
ACACGTGCGT 
GCGGAGTGAT 
CTACCAGATG 
GTGCATCGAC 
GCAGGCCAAT 
GGCCATGGAA 
GCCTGCAGTG 
CAATGGCTGG 
CAACATCCTG 
CTGGAAAGAC 
GAATGTGACC 
TGAATACCGG 
TGGGAGCATC 
CTACTATCAG 
GAGAAGAGAG 
GCGCAAGACA 
ATCAGAGGCG 
TGTCTGCATG 
GACCAGTACG 
CTTCATGCTG 
CATGTACCCT 
GGCCTCTCAG 
CCGCAGCTGG 
ACAGCACATT 
CAGGCTCGTG 
CAAGAACGTG 
CCGTGCCACC 
GGGGGAGCAC 
GCGAGTGGTG 
TGGCAGCTAC 
CTACTTCCTG 
GGCCACAGAA 
CAAGTGGAAG 
GCGGGTGTCT 
CATCAGCTCC 
CTACCACACC 
CAGCCTGAAG 
CCCCGAGGAT 



51 
1 

GTTCTCAAGG 
CCAGCTGCAA 
GACCTGCTTT 
TTCCACCAGT 
TGTGCCACCT 
CAGAATGAGT 
AAGCTGGATG 
GGGATGACTT 
AAATCTTGCC 
CCTACCCCTC 
TCCCTCAAAG 
ATAGAAACAA 
GCCTGGCGCA 
GATGCCCGCA 
TACTCCACCA 
GGCAAGCACG 
CCAGATGGCA 
CTGGGCTGGA 
GGCCGTGACC 
CATCCCAAAT 
GCCAACATGC 
TACATGGGCA 
GAGGAAGTGG 
CAGGCTGTCG 
ATCATGGACC 
TCCCGTGGGG 
ACCCCCGTGT 
GTAGAGGCCT 
ATTCCATTGA 
ATGGCGTCCC 
CTGGCCTGGG 
GATAAGTACA 
TTTGGCAATG 
AAAGAGCTCA 
CGGTTCTGCG 
CTCACCCCGA 
GCCGTGCAAA 
CAGATCCCCA 
CAGGACTCAC 
TTCACCATGA 
ATCCTGGTGG 
CTTGGGGTTT 
GATGGCCCCA 
TGGGT CAGTG 
GACATCACCA 
GAGCCTGAGA 
TTCACCAACA 
GCTGGCTTCC 
CCCCGGGATC 
CGAGATGGCC 
CCCCAAGACC 
CCCTCCCATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



301 



10 



15 



WO 02/086443 

CTTGCATCCT CATCGGGCCT GGCACAGGCA TCGCGCCCTT CCGCAGTTTC 
GGCTCCATGA CTCCCAGCAC AAGGGAGTGC GGGGAGGCCG CATGACCTTG 
GCCGCCGCCC AGATGAGGAC CACATCTACC AGGAGGAGAT GCTGGAGATG 
GGGTGCTGCA TGCGGTGCAC ACAGCCTATT CCCGCCTGCC TGGCAAGCCC 
TTCAGGACAT CCTGCGGCAG CAGCTGGCCA GCGAGGTGCT CCGTGTGCTC 
CAGGCCACCT CTATGTTTGC GGGGATGTGC GCATGGCCCG GGACGTGGCC 
AGCAGCTGGT GGCTGCCAAG CTGAAATTGA ATGAGGAGCA GGT CGAGGAC 
AGCT CAAGAG CCAGAAGCGC TAT CACGAAG ATATCTTTGG TGCTGTATTT 
CGAAGAAGGA CAGGGTGGCG GTGCAGCCCA GCAGCCTGGA GATGT CAGCG 
CTACAGGAGG GGTTAAAGCT GCCGGCACAG AACTTAAGGA TGGAGCCAGC 
CTGAGGTCAC AGGGCCTGGG GAG AT GG AGG AAAGTGATAT CCCCCAGCCT 
TTCCTCAACG TTGCTCCCCA TCAAGCCCTT TACTTGACCT CCTAACAAGT 
ATTGATCGGA GCCTC 

Seq ID NO: 313 Protein sequence: 
Protein Accession #: NP^OOOeie 



PCT/US02/12476 



TGGCAGCAAC 

GCCCAGAAGG 
AAGGTCTATG 
CACAAGGAGC 
CACACCCTGA 
TATTTCTTTC 
CCTTACGAGG 
CTCTGAGGGC 
TCTGCATTAT 
CAAGTCTTAT 
AGCACCCTGG 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 



20 
25 
30 
35 
40 
45 



MACPWKFLFK 
VETGKKSPES 
TPKSLTRGPR 
LTGDELIFAT 
RSAITVFPQR 
RFDWPLVLQ 
GLEFPGCPFN 
AVLHSFQKQN 
LNYVLSPFYY 
ILFATETGKS 
NGEKLKKSLF 
ELSGQEDAFR 
SKALSSMHAK 
PALVQGILER 
LLLQKLAQVA 
PILKPRFYSI 
VRNASGFHLP 
EDHIYQEEML 
VCGDVRMARD 
VAVQPSSLEM 



11 
I 

TKFHQYAMNG 
LVKLDATPLS 
DKPTPPDELL 
KQAWRNAPRC 
SDGKHDFRVW 
ANGRDPELFE 
GWYMGTEIGV 
VTIMDHHSAA 
YQVEAWKTHV 
EALAWDLGAL 
MLKELNNKFR 
SWAVQTFKAA 
NVFTMRLKSR 
WDGPTPHQA 
TEEPERQRLE 
SSPRDHTPTE 
EDPSHPCILI 
EMAQKGVLHA 
VAHTLKQLVA 
SAL 



21 

I 

E KG I NNNVEK 
SPRHVRIKNW 
PQAIEFVNQY 
IGRIQWSNLQ 
NAQLIRYAGY 
IPPDLVbEVA 
RDFCDVQRYN 
ESFMKYMQNE 
WQDEKRRPKR 
FSCAFNPKW 
YAVFGLGSSM 
CET FDVRGKQ 
QNLQSPTSSR 
VRLEALDESG 
ALCQPSEYSK 
IHLTVAWTY 
GPGTGIAPFR 
VHTAYSRLPG 
AKLKLNEEQV 



31 
i 

APCATSSPVT 
GSGMTFQDTL 
YGSLKEAKIE 
VFDARSCSTA 
QMPDGSIRGD 
MEHPKYEWFR 
I LEEVGRRMG 
YRSRGGCPAD 
REIPLKVLVK 
CMDKYRLSCL 
YPRFCAFAHD 
HIQIPKLYTS 
ATILVELSCE 
SYWVSDKRLP 
WKFTNSPTFL 
HTRDGQGPLH 
SFWQQRLHDS 
KPKVYVQDIIi 
EDYFFQLKSQ 



41 
I 

QDDLQYHNLS 
HHKAKG I LTC 
EHLARVEAVT 
REMFEHICRH 
PANVEFTQLC 
ELELKWYALP 
LETHKLASLW 
WIWLVPPMSG 
AVLFACMLMR 
EEERLIiLWT 
IDQKLSHLGA 
NVTWDPHHYR 
DGQGLNYLPG 
PCSLSQALTY 
EVLEEFPSLR 
HGVCSTWLNS 
QHKGVRGGRM 
RQQLASEVLR 
KRYHEDIFGA 



51 
I 

KQQNESPQPL 
RSKSCLGSIM 
KEIETTVTYQ 
VRYSTNNGNI 
I DLGWKPKYG 
AVANMLLEVG 
KDQAWEINI 
SITPVFHQEM 
KTMASRVRVT 
STFGNGDCPG 
SQLTPMGEGD 
LVQDSQPLDL 
EHLGVCPGNQ 
FLDITTPPTQ 
VSAGFLLSQL 
LKPQDPVPCF 
TLVFGCRRPD 
VLHKEPGHLY 
VFPYEAKKDR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 314 DNA sequence 
Nucleic Acid Accession #: XM_087254 
Coding sequence: 47.. 2332 



50 
55 
60 
65 
70 
75 
80 
85 



AGAGTACGTG 
ATGTTCAATT 
AACACCAGAC 
CTTAT CCCAT 
AATTAAAGAA 
CAATGTTCAA 
GTTGGAGTAC 
TGGTATTGTG 
GGAACGGTAC 
AATTGTTCAG 
TCTCCCTAAA 
TTTGAAAGGG 
GGAAATAGAT 
GGCAGCTGTT 
AGACAGACTA 
AGTATGGGTA 
CCATTTTCAT 
TGCTGAACAA 
GCTGGTAGTG 
GGAAGTTTGC 
AAAAGTAATA 
TGGTGCTAAT 
AGAAGGAAGA 
CAAATTGCTT 
TTTTTTTTAT 
GTTTTCTCAG 
TTCCCTACCT 
AAATAAGCCC 
TCTTTATTGG 
ACTAATAGGG 
TGGCACTTTG 
TCATTTTTGG 
ATTTTCCTTG 
TGTGTTTATT 
TACATGTCTA 
TACTGAAAAG 
CTGTTTCCCG 
AGGAAGATGT 
CAACGACAGG 



11 

i 

TTTACAGATA 
AATGGCATGA 
TCTTCAGAAG 
CTTACAACCA 
CATGATCTCT 
ACTGACTGCA 
TATGCATCTT 
TTTATTGGCA 
AAACTGCTTC 
GCACCTTCAG 
TGTATAGGTG 
CTAAGAACTC 
AAACGCATAT 
TTCCAGTTCA 
CAAGATAAAG 
CTTACTGGGG 
AGAACCATGA 
TTGAGGCAGC 
GATGGGACCA 
AGAAATTGTT 
AGACTAATAA 
GACGTAAGCA 
CAGGCTGCAA 
TTTGTTCATG 
AAGAATGTGT 
CAAACATTGT 
ATTCTGATAT 
ACCCTTTATC 
ACCATCCTGG 
AAAGATACAT 
GTCTTCACAG 
ACTTGGATCA 
TTTTATGGAG 
CAGCTCCTGT 
TTTCTTGATA 
GCACAGCTTA 
GAAGGAGAAG 
AGTCCAACCC 
AGCATCTTGA 



21 
I 

AAAC TGGT AC 
AATACCAAGA 
GAAACTTATC 
GTTCCTCTTT 
TCTTTAAAGC 
CTGGTGATGG 
CACCAGATGA 
ATTCTGAAGA 
ATATTCTGGA 
GTGAGAAGTT 
GAGAAATAGA 
TGTGTATAGC 
TTGAAGCCAG 
TAGAGAAAGA 
TTCGAGAAAC 
ATAAACATGA 
ACATCCTTGA 
TTGCCAGAAG 
GCCTATCTCT 
CAGCTGTATT 
AAATATCACC 
TGATACAAGA 
GAAACAGTGA 
GTCATTTTTA 
GCTTTATCAC 
ATGACAGCGT 
ATAGTCTTTT 
GAGACATTAG 
GCTTCAGTCA 
CTCTGCTTGG 
TCATGGTTAT 
ACCATCTCGT 
GGATTCTCTG 
CAAGTGGTTC 
T CAT AAAG AA 
CTGAAACAAA 
CAGCGTGTGC 
ACATCAGCAG 
CTCTCTCCAC 



31 

I 

ACTGACAGAA 
AATTAATGGT 
TTATCTTAGT 
CAGAACCAGT 
AGTCAGT CTC 
TCCCTGGCAA 
AAAGGCT CTA 
AACTATGGAG 
ATTTGATTCA 
ATTATTTGCT 
AAAAACCAGA 
ATATAGAAAA 
GACTGCCTTG 
CCTGATATTA 
TATTGAAGCA 
AACAGCTGTT 
ACTTATAAAC 
AATTACAGAG 
TGCACTCAGG 
ATGCTGTCGT 
TGAGAAACCT 
AGCCCATGTT 
CTATGCAATA 
TTATATTAGA 
ACCCCAGTTT 
GTACCTGACT 
GGAACAGCAT 
TAAAAACCGC 
TGCCTTTATT 
AAATGGCCAG 
TACAGTCACA 
TACCTGGGGA 
GCCATTTTTG 
TGCTTGGTTT 
GGTCTTTGAC 
TGCAGGTATC 
ATCTGTTGGA 
ATCATGGAGT 
AATGGACTCA 



41 

AATGAGATGC 
AGACT TGTAC 
AGTTTATCCC 
CCTGAAAATG 
TGTCACACTG 
TCCAACCTGG 
GTAGAAGCTG 
GTTAAAACTC 
GAT CGT AGGA 
AAAGGAGCTG 
ATTCATGTAG 
TTTACATCAA 
CAGCAGCGGG 
CTTGGAGCCA 
TTGAGAATGG 
AGTGTGAGTT 
CAGAAATCAG 
GATCATGTGA 
GAGCATGAAA 
ATGGCTCCAC 
ATAACATTGG 
GGCATAGGAA 
GCCAGATTTA 
ATAGCTACCC 
TTATATCAGT 
TTATACAATA 
GTAGACCCTC 
CTCTTAAGTA 
TTCTTTTTTG 
ATGTTTGGAA 
GTAAAGATGG 
TCTATTATAT 
GGCTCCCAGA 
GCCATAATCC 
CGACACCTCC 
AAGTGCTTGG 
AGAATGCTGG 
GCATCGGATC 
TCTACTTGTT 
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AGTTT CGGG A 
CCGAAGGACC 
ATCTTAACAA 
AAACTGAACT 
TACAGATTAG 
CACCATCGCA 
CTGCAAGGAT 
TTGGAAAACT 
GAATGAGTGT 
AGTCAT CAAT 
ATGAATTTGC 
AAGAGTATGA 
AAGAGAAATT 
CAGCAGTAGA 
CTGGTATCAA 
TATCATGTGG 
ACAGCGAGTG 
TTCAGCATGG 
AACTATTTAT 
TGCAGAAAGC 
CTGTTGGTGA 
TCATGGGTAA 
AGTTCCTCTC 
TTGTACAGTA 
TCTACTGTTT 

ATGTGTTACA 
TTAAAACATT 
GATCCTATTT 
ACTGGACATT 
CTCTGGAAAC 
TTTATTTTGT 
ATATGTATTT 
TCATGGTTGT 
ACCCTACAAG 
ACTCCATGTG 
AACGAGTTAT 
CTTTCTATAC 
AAAGGGG CAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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TAGTACTTTG TGGGAGCCAG TTCACCTCCT TTCCTAAAAT TCAGTGTGAT CACCCTGTTA 2400 

ATGGCCACAC TAGCTCTGAA ATTAATTTCC AAAATCTTTG TAGTAGTTCA TACCCACTCA 2460 

GAGTTATAAT GGCAAACAAA CAGAAAGCAT TAGTACAAGC CCCTCCCAAC ACCCTTAATT 252 0 

TGAATCTGAA CATGTTAAAA TTTGAGAATA AAGAGACATT TTTCATCTCT TTGTCTGGTT 2 580 

TGTCCCTTGT GCTTATGGGA CTCCTAATGG CATTTCAGTC TGTTGCTGAG GCCATTATAT 2 640 

TTTAATATAA ATGTAGAAAA AAGAGAGAAA TCTTAGTAAA GAGTATTTTT TAGTATTAGC 2700 

TTGATTATTG ACTCTTCTAT TTAAATCTGC TTCTGTAAAT TATGCTGAAA GTTTGCCTTG 2760 

AGAACTCTAT TTTTTTATTA GAGTTATATT TAAAGCTTTT CATGGGAAAA GTTAATGTGA 2820 

ATACTGAGGA ATTTTGGTCC CTCAGTGACC TGTGTTGTTA ATT CATTAAT GCATTCTGAG 2880 

TTCACAGAGC AAATTAGGAG AATCATTTCC AACCATTATT TACTGCAGTA TGGGGAGTAA 2940 

ATTTATACCA ATTCCTCTAA CTGTACTGTA ACACAGCCTG TAAAGTTAGC CATATAAATG 3000 

CAAGGGTATA TCATATATAC AAATCAGGAA TCAGGTCCGT TCACCGAACT TCAAATTGAT 3060 

GTTTACTAAT ATTTTTGTGA CAGAGTATAA AGACCCTATA GTGGGTAAAT TAGATACTAT 3120 

TAGCATATTA TTAATTTAAT GTCTTTATCA TTGGATCTTT TGCATGCTTT AATCTGGTTA 3180 

ACATATTTAA ATTTGCTTTT TTTCTCTTTA CCTGAAGGCT CTGTGTATAG TATTTCATGA 3240 

CATCGTTGTA CAGTTTAACT ATATCAATAA AAAGTTTGGA CAGTATTTAA ATATTGCAAA 3300 

TATGT TTAAT TATACAAATC AGAATAGTAT GGGTAATTAA ATGAATACAA AAAGAAGAGC 3360 

CTCTTTCTGC AGCCGACTTA GACATGCTCT TCCCTTTCTA TAAG CTAGAT TTTAGAATAA 3420 

AGGGTTTCAG TTAATAATCT TATTTTCAGG TTATGTCATC TAACTTATAG CAAACTACCA 348 0 

CAATACAGTG AGTTCTGCCA GTGTCCCAGT ACAAGGCATA TTTCAGGTGT GGCTGTGGAA 3540 

TGTAAAAATG CTCAACTTGT ATCAGGTAAT GTTAGCAATA AATTAAATGC TAAGAATGAT 3600 

TAATCGGGTA CATGTTACTG TAATTAACTC ATTGCACTTC AAAACCTAAC TTCCATCCTG 3 660 

AATTTATCAA GTAGTT CAGT ATTGTCATTT GTTTTTGTTT TATTGAAAAG TAATGTTGT C 372 0 

TTAAGATTTA GAAGTGATTA TTAGCTTGAG AACTATTACC CAGCTCTAAG CAAATAATGA 3780 

TTGTATACAT ATTAAGATAA TGGTTAAATG CGGTTTTACC AAGTTTTCCC TTGAAAATGT 3840 

AATTCCTTTA TGGAGATTTA TTGTGCAGCC CTAAGCTTCC TTCCCATTTC ATGAATATAA 3900 

GGCTTCTAGA ATTGGACTGG CAGGGGAAAG AATGGTAGAG ACAGAAATTA AGACTTTATC 3 960 

CTTGTTTGCT TGTAAACTAT TATTTTCTTG CTAATGTAAC ATTTGTCTGT TCCAGTGATG 402 0 

TAAGGATATT AAGTTATTAA GCTAAATATT AATTTTCAAA AATAGTCCTT CTTTAACTTA 4080 

GATATTTCAT AGCTGGATTT AGGAAGATCT GTTATTCTGG AAGTACTAAA AAGAATAATA 4140 

CAACGTACAA TGTCTGCATT CACTAATTCA TGTTCCAGAA GAGGAAATAA TGAAGATATA 4200 

C T CAGT AG AG TACTAGGTGG GAGGATATGG AAATTTGCTC ATAAAATCTC TTATAAAACG 4260 

TGCATATAAC AAAATGACAC CCAGTAGGCC TGCATTACAT TTACATGACC GTGTTTATTT 4320 

GCCATCAAAT AAACTGAGTA CTGACACCAG ACAAAGACTC CAAAGTCATA AAATAGCCTA 43 80 

TGACCAACTG CAGCAAGACA GGAGGTCAGC TCGCCTATAA TGGTGCTTAA AGTGTGATTG 4440 

ATGTAATTTT CTGTACT C AC CATTTGAAGT TAGTTAAGGA GAACTTTATT TTTTTAAAAA 4500 

AAGTAAATGG CAACCACTAG TGTGCTCATC CTGAACTGTT ACTCCAAATC CACTCCGTTT 4560 

TTAAAGCAAA ATTATCTTGT GATTTTAAGA AAAGAGTTTT CTATTTATTT AAGAAAGTAA 462 0 

CAATGCAGTC TGCAAGCTTT CAGTAGTTTT CTAGTGCTAT ATTCATCCTG TAAAACTCTT 4680 

ACTACGTAAC CAGTAATCAC AAGGAAAGTG TCCCCTTTGC ATATTTCTTT AAAATTCTTT 4740 

CTTTGGAAAG TATGATGTTG ATAATTAACT TACCCTTATC TGCCAAAACC AGAGCAAAAT 4800 

GCTAAATACG TTATTGCTAA TCAGTGGTCT CAAATCGATT TGCCTCCCTT TGCCTCGTCT 4860 

GAGGGCTGTA AGCCTGAAGA TAGTGGCAAG CACCAAGTCA GTTTCCAAAA TTGCCCCTCA 4920 

GCTGCTTTAA GTGACTCAGC ACCCTGCCTC AGCTTCAGCA GGCGTAGGCT CACCCTGGGC 4 980 

GGAGCAAAGT ATGGGCCAGG GAGAACTACA GCTACGAAGA CCTGCTGTCG AGTTGAGAAA 5040 

AGGGGAGAAT TTATGGTCTG AATTTTCTAA CTGTCCTCTT TCTTGGGTCT AAAGCTCATA 5100 

ATACACAAAG GCTT CCAGAC CTGAGCCACA CCCAGGCCCT ATCCTGAACA GGAGACTAAA 5160 

CAGAGGCAAA TCAACCCTAG GAAATACTTG CATTCTGCCC TACGGTTAGT ACCAGGACTG 5220 

AGGTCATTTC TACTGGAAAA GATTGTGAGA TTGAACTTAT CTGATCGCTT GAGACTCCTA 5280 

ATAGGCAGGA GTCAAGGCCA CTAGAAAATT GACAGTTAAG AGCCAAAAGT TTTTAAAATA 5340 

TGCTACTCTG AAAAATCTCG TGAAGGCTGT AGGAAAAGGG AGAATCTTCC ATGTTGGTGT 5400 

TTTTCCTGTA AAGATCAGTT TGGGGTATGA TATAAGCAGG TATTAATAAA AATAACACAC 5460 

CAAAGAGTTA CGTAAAACAT GTTTTATTAA TTTTGGTCCC CACGTACAGA CATTTTATTT 5520 

CTATTTTGAA ATGAGTTATC TATTTTCATA AAAGTAAAAC ACTATTAAAG TGCTGTTTTA 5580 

TGTGAAATAA CTTGAATGTT GTTCCTATAA AAAATAGATC ATAACTCATG ATATGTTTGT 5640 

AATCATGGTA ATTTAG AT TT TTATGAGGAA TGAGTATCTG GAAATATTGT AGCAATACTT 5700 

GGTTTAAAAT TTTGGACCTG AGACACTGTG GCTGTCTAAT GTAATCCTTT AAAAATTCTC 5760 

TGCATTGTCA GTAAATGTAG TATATTATTG TACAGCTACT CATAATTTTT T AAAGTT TAT 5820 
GAAGTTATAT T T ATCAAAT A AAAACTTTCC TATAT 

Seq ID NO: 315 Protein sequence: 
Protein Accession #: XP_0872 54 



1 11 21 31 41 51 

I t 1 I I I 

MQFRECSING MKYQEINGRL VPEGPTPDSS EGNLSYLSSL SHLNNLSHLT TSSSFRTSPE 60 

NETELIKEHD LFFKAVSLCH TVQISNVQTD CTGDGPWQSN LAPSQLEYYA SSPDEKALVE 120 

AAARIGIVFI GNSEETMEVK TLGKLERYKL LHILEFDSDR RRMSVIVQAP SGEKLLFAKG 180 

AESSILPKCI GGEIEKTRIH VDEFALKGLR TLCIAYRKFT SKEYEEIDKR IFEARTALQQ 240 

REEKLAAVFQ FIEKDLILLG ATAVEDRLQD KVRET I EALR MAG I KVWVLT GDKHETAVSV 30 0 

SLSCGHFHRT MNILELINQK SDSECAEQLR QLARRITEDH VIQHGLWDG TSLSLALREH 360 

EKLFMEVCRN CSAVLCCRMA PLQKAKVIRL IKISPEKPIT LAVGDGANDV SMIQEAHVGI 420 

GIMGKEGRQA ARNSDYAIAR FKFIiSKLLFV HGHFYYIRIA TLVQYFFYKN VCFITPQFLY 480 

QFYCLFSQQT LYDSVYLTLY NICFTSLPIL IYSLLEQHVD PHVLQNKPTL YRDISKNRLL 540 

SIKTFLYWTI LGFSHAFIFF FGSYLLIGKD TSLLGNGQMF GNWTFGTLVF TVMVITVTVK 600 

MALETHFWTW INHLVTWGSI IFYFVFSLFY GGILWPFLGS QNMYFVFIQL LSSGSAWFAI 660 

ILMWTCLFL D 1 1 KKVFDRH LHPTSTEKAQ LTETNAGIKC LDSMCCFPEG EAACASVGRM 720 

LERVIGRCSP THISRSWSAS DPFYTNDRSI LTLSTMDSST C 

Seq ID NO: 316 DNA sequence 
Nucleic Acid Accession #: NM_004473 
Coding sequence: 661.. 1791 
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CTCGCCAGCG GTCCGCGGGG CTGGAGACCC ACGCCGTGGA GAGGACCAGC CTCAGGTCGC 60 
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CCCGCCTGGG CCCGCGCCCC GACCTCGCTG CCCCCGCCTC GCCTCTCTGC CCGTGGCGCT 120 

'TACCGCCACC TTGGCCTCGG GGG CAGGGCA TGGGCGGCCC CCGCCAGATC GCCCAGCGCC 180 

AGTACTAACT GCCCTCGCTC TGGCCTTCGA GCCCGAAGCC TCTTCTGCGC GCACAACCTA 240 

GGCAGTAATC CTAAACTAGC GGG C AC C ACA GACCAGCTGC AGCCACCCCA ACCCAGGGAT 300 

CACTTCCGGA CCCCTCGACC GCCCGGCACC AGCG CGCAAG GGACCCTTCA GCCGGAGACC 360 

AGAGTCCAGT CCCGGTCGCG AGGCCACCGC CGCTGCCCGC CTCGAGAAGC ACAACGCGGG 42 0 

CTGAGCCGTC GGCTAGCGGG TCACTCCCGA GCCTCTGTCT GCACCGCGCC AGCCCCAGAC 480 

CACGGACGCT GAGCCT CCAG CGCGCGCCAG CCTGGGCCGC TGGGCTCTCC GGGCCAGCCC 540 

GCGACGATCC CCTGAGCTCT CCGCAGAAGG GCCGAGCGTC CGTTCCGGGG ACGCCAGGCC 60 0 

CGCCCCCGCC CCCCGACAGC CGCGGGGATC CAGAGCCCGG GGGTGCGGGA CGCCCGCGCC 660 

ATGACTGCCG AGAGCGGGCC GCCGCCGCCG CAGCCGGAGG TGCTGGCTAC CGTGAAGGAA 720 

GAGCGCGGCG AGACGGCAGC AGGGGCCGGG GTCCCAGGGG AGGCCACGGG CCGCGGGGCG 780 

GGCGGGCGGC GCCGCAAGCG CCCCCTGCAG CGCGGGAAGC CGCCCTACAG CTACAT CGCG 840 

CTCATCGCCA TGGCCATCGC GCACGCGCCC GAGCGCCGCC TCACGCTGGG CGGCATCTAC 900 

AAGTTCATCA CCGAGCGCTT CCCCTTCTAC CGCGACAACC CCAAAAAGTG GCAGAACAGC 960 

ATCCGCCACA ACCTCACACT CAACGACTGC TTCCTCAAGA TCCCGCGCGA GGCCGGCCGC 102 0 

CCGGGTAAGG GCAACTACTG GGCGCTCGAC CCCAACGCGG AGGACATGTT CGAGAGCGGC 1080 

AGCTTCCTGC GCCGCCGCAA GCGCTTCAAG CGCTCGGACC TCTCCACCTA CCCGGCTTAC 1140 

ATGCACGACG CGGCGGCTGC CGCAGCCGCC GCTGCCGCAG CCGCCGCCGC CGCCGCCGCC 12 0 0 

GCCGCCATCT TCCCAGGCGC GGTGCCCGCC GCGCGCCCCC CCTACCCGGG CGCCGTCTAT 12 60 

GCAGGCTACG CGCCGCCGTC GCTGGCCGCG CCGCCTCCAG TCTACTACCC CGCGGCGTCG 1320 

CCCGGCCCTT GCCGCGTCTT CGGCCTGGTT CGTGAGCGGC CGCTCAGCCC AGAGCTGGGG 13 80 

CCCGCACCGT CGGGGCCCGG CGGCTCTTGC GCCTTTGCCT CCGCCGGCGC CCCCGCTACC 1440 

ACCACCGGCT ACCAGCCCGC AGGCTGCACC GGGGCCCGGC CGGCCAACCC CTCTGCCTAT 1500 

GCGGCTGCCT ACGCGGGCCC CGACGGCGCG TACCCGCAGG GCGCCGGCAG TGCGATCTTT 1560 

GCCGCTGCTG GCCGCCTGGC GGGACCCGCT TCGCCCCCAG CGGGCGGCAG CAGTGGCGGC 162 0 

GTGGAGACCA CGGTGGACTT CTACGGGCGC ACGTCGCCCG GCCAGTTCGG AGCGCTGGGA 1680 

GCCTGCTACA ACCCTGGCGG GCAGCTCGGA GGGGCCAGTG CAGGCGCCTA CCATGCTCGC 1740 

CATGCTGCCG CTTATCCCGG TGGGATAGAT CGGTTCGTGT CCGCCATGTG AGCCAGCGTA 18 00 

GGGACGAAAA CTCATAGACA CATCGGCTGT TCACACGTTC CCCGCAACCT GAGAACGAAC 18 60 

AGGAATGGAG AGAGGACTCA ACTGGGACCC ACGTGGAAAA GACCGAGCAG GCCACAGAGG 192 0 

CTCGGTCTCC CCGCGCACAG CGTAGGCACC CTGTGTACTC TGTAAACGGG AGGAGGTGGG 1980 

GCGAGGCAGC CAGAGCCCTT GGACTGGCAC AGGGACCCTC GATGGAGCGA AGCCCTCAAA 2 040 

CGGGATGCTT TCTGGCATTC TATCGGGGAG GGTCCTTGGC GGTAACCAGA GGGCAGCGTA 210 0 

GTGTCAACAC CAGAGACCAG GATCCAAATT GTGGGGAATC AGTTTCAGCC TTCCATGTGC 2160 

TGCCGGAACT CGGGCCTTTT TACGCGGTTC GTCCTCTAGT GCCTTTAACT GCGTTACTAC 222 0 

AATAAAAGGC TGCGGCAGCG CCTTTCTTCT TAAAGTGAGG AGGACAAATT TGCAAAAGAA 22 80 

ATAGGCTTTT CTTCTTTTTT AAATTGGAGA AATCT CTGCT CTGGTTGACC TGGGCTGGTT 2340 

TTCCCTGTCT CTGAGAACTT GAGACCTAGC TCCGAGTTGA ACTGTG CGTC AGCACTCCAG 240 0 

TCCCATCACC TGAACCTTCA GTCTCCCCCA TCTGTTACAC TAGAGGGCTG CAGGACTCTA 2460 

TCCACCGCCC CCGGGTTATC ATTCAGGGCC CCATCATCTT GGATGCTGCC CTGCGTATTT 252 0 

GGCAG CAATG GTGGGCCACC CAGGGCCTCT GAGTAGCCAC CCAAAGCCTA GCCGCTGTTC 25 80 

TAGGGAACGG AAAAGAGTTC ATGGCCAAGC GTCTAACCTA AAGTCCCAGG ATTGGCTCCA 2 640 

GGCAGCAATT ATATCATAAC TTATTGAACT TTTGAGCAGG ACGTGCTGGT AATTTCATGG 2 70 0 

CTGTTACTGC CCAGTCATAA ATCTGCTTTT CCATTATAAG GCAGAGAGAA GTACATTCGT 2760 

TCATTTGTCC ACTGTTTCTT GTCATCACGC AGCCCTGGAC CCAAAGGGTG AACTAAAGTT 2 82 0 

TAAGGAGATG AGAGGATTCA AGGAGCCCGT TGGTGACGCC TTTCAGTAGC TGGGGAGGGC 2 88 0 

TCTTCCATCC CCAGCACCCC CTGCTACACC TCAGCAGCCT CCCCCATGCA AAAAGGAAAG 2 940 

AGAAAAATTA AGTTAGGGCA GTCAGTAAAG TGAGCTTTAG AAAGAAACTG GAATTTTAAC 3 0 00 

TTCATTTTGT ATCTTGCTTA AGTAGCAGGC TCACTAAAAT TAGAGAAAGT CCAATAACT C 3 0 60 

TCCCCCTTTC CCTTGAGAAA TCTTTAAGTT TCGATTCTGG AGCAAAAACT TTCAGCATTA 312 0 

AATATTTCAG AGGCTCCATT CACAGCTTTC AGATAAACTG GAGTGTTCAG ATGGACTGTT 3180 

TTAATAAAAA TCTTTGAGCA AGTGAGTTAT GGCAAGAGAA ACTCAGCCTC TTTCTGTATA 3240 

AACTTAACAG GGAAGGGCTG GGGTGTGAAA AAGAAGATTG TATGAAAACC ATTGGTAATT 3300 

TTTATTTTTT ATTTTTGGGA CTGCACTATC CTGTTCACGA AGACATGTGA ACTTGGTTCA 33 60 

GTCCAAATGG GGATTTGTAT AAACCAGTGC TCTCCATTAG AAATATGGTG C AAG CCA CAT 3420 

ATGTAATTTT AAATATTCTA GTAGCCACAT TAATAAAGTN AAAAGAAACA AAAAAAAAAA 34 80 
AA 

Seq ID NO: 317 Protein sequence: 
Protein Accession #: NP_004464 

1 11 21 31 41 51 

I I I I I I 

FKHLTHYROI DTRANSCRIP TIONF ACTOR TTFMTAESGP PPPQPEVLAT VKEERGETAA 60 

GAGVPGEATG RGAGGRRRKR PLQRGKPPYS YIALIAMAIA HAPERRLTLG GIYKFITERF 12 0 

PFYRDNPKKW QNSIRHNLTL NDCFLKIPRE AGRPGKGNYW ALDPNAEDMF ESGSFLRRRK 180 

RFKRSDLSTY PAYMHDAAAA AAAAAAAAAA AAAAAIFPGA VPAARPPYPG AVYAGYAPPS 240 

LAAPPPVYYP AASPGPCRVF GLVPERPLSP ELGPAPSGPG GSCAFASAGA PATTTGYQPA 3 00 

GCTGARPANP SAYAAAYAGP DGAYPQGAGS AIFAAAGRLA GPASPPAGGS SGGVETTVDF 360 
YGRTSPGQFG ALGACYNPGG QLGGASAGAY HARHAAAYPG GIDRFVSAM 

Seq ID NO: 318 DNA sequence 
Nucleic Acid Accession #: NM_005688 
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CCGGGCAGGT GGCTCATGCT CGGGAGCGTG GTTGAGCGGC TGGCGCGGTT GTCCTGGAGC 60 

AGGGGCGCAG GAATTCTGAT GTGAAACTAA CAGTCTGTGA GCCCTGGAAC CTCCGCTCAG 12 0 

AGAAGATGAA GGATAT CGAC ATAGGAAAAG AGTATATCAT CCCCAGTCCT GGGTATAGAA 180 

GTGTGAGGGA GAGAACCAGC ACTTCTGGGA CGCACAGAGA CCGTGAAGAT TCCAAGTTCA 240 

GGAGAACTCG ACCGTTGGAA TGCCAAGATG CCTTGGAAAC AGCAGCCCGA GCCGAGGGCC 300 

TCTCTCTTGA TGCCTCCATG CATTCTCAGC TCAGAATCCT GGATGAGGAG CATCCCAAGG 3 60 

GAAAGTACCA TCATGGCTTG AGTGCTCTGA AGCCCATCCG GACTACTTCC AAACACCAGC 420 

ACCCAGTGGA CAATGCTGGG CTTTTTTCCT GTATGACTTT TTCGTGGCTT TCTTCTCTGG 480 
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CCCGTGTGGC CCACAAGAAG GGGGAGCTCT CAATGGAAGA CGTGTGGTCT CTGTCCAAGC 540 

ACGAGTCTTC TGACGTGAAC TGCAGAAGAC TAGAGAGACT GTGGCAAGAA GAGCTGAATG 600 

AAGTTGGGCC AGACGCTGCT TCCCTGCGAA GGGTTGTGTG GATCTTCTGC CGCACCAGGC 660 

TCATCCTGTC CATCGTGTGC CTGATGATCA CGCAGCTGGC TGGCTTCAGT GGACCAGCCT 720 

TCATGGTGAA ACACCTCTTG GAGTATACCC AGGCAACAGA GTCTAACCTG CAGTACAGCT 780 

TGTTGTTAGT GCTGGGCCTC CTCCTGACGG AAATCGTGCG GTCTTGGTCG CTTGCACTGA 840 

CTTGGGCATT GAATTACCGA ACCGGTGTCC GCTTGCGGGG GGCCATCCTA ACCATGG CAT 900 

TTAAGAAGAT CCTTAAGTTA AAGAACATTA AAGAGAAATC CCTGGGTGAG CTCATCAACA 960 

TTTGCTCCAA CGATGGGCAG AGAATGTTTG AGGCAGCAGC CGTTGGCAGC CTGCTGGCTG 1020 

GAGGACCCGT TGTTGCCATC TTAGGCATGA TTTATAATGT AATTATTCTG GGACCAACAG 1080 

GCTTCCTGGG ATCAGCTGTT TTTATCCTCT TTTACCCAGC AATGATGTTT GCATCACGGC 1140 

TCACAGCATA TTTCAGGAGA AAATGCGTGG CCGCCACGGA TGAACGTGTC CAGAAGATGA 1200 

ATGAAGTTCT TACTTACATT AAATTT AT CA AAATGTATGC CTGGGTCAAA GCATTTTCTC 12 60 

AGAGTGTTCA AAAAATCCGC GAGGAGGAGC GTCGGATATT GGAAAAAGCC GGGTACTTCC 1320 

AGGGTATCAC TGTGGGTGTG GCTCCCATTG TGGTGGTGAT TGCCAGCGTG GTGACCTTCT 1380 

CTGTTCATAT GACCCTGGGC TTCGATCTGA CAGCAGCACA GGCTTTCACA GTGGTGACAG 1440 

TCTTCAATTC CATGACTTTT GCTTTGAAAG TAACACCGTT TTCAGTAAAG TCCCTCTCAG 1500 

AAGCCTCAGT GGCTGTTGAC AGATTTAAGA GTTTGTTTCT AATGGAAGAG GTTCACATGA 1560 

TAAAGAACAA ACCAGCCAGT CCTCACATCA AGATAGAGAT GAAAAATGCC ACCTTGGCAT 162 0 

GGGACTCCTC CCACTCCAGT ATCCAGAACT CGCCCAAGCT GACCCCCAAA ATGAAAAAAG 1680 

ACAAGAGGGC TTCCAGGGGC AAGAAAGAGA AGGTGAGGCA GCTGCAGCGC ACTGAGCATC 1740 

AGGCGGTGCT GGCAGAGCAG AAAGGCCACC TCCTCCTGGA CAGTGACGAG CGGCCCAGTC 1800 

CCGAAGAGGA AGAAGGCAAG CACATCCACC TGGGCCACCT GCGCTTACAG AGGACACTGC 1860 

ACAGCAT CGA T CTGG AG AT C CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 1920 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTGCAATCAG TGGAACCTTC G CTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2 040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 222 0 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGGCAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGT CCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 24 60 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 252 0 

AAACCAGTGG TTCACAGAAG AAGT CACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 2580 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2 640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACAT CCAGGC TGCTGGGGGC CCCTTGGCAT 2 700 

TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AG CACCTGG T 2 760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA GG G AACG AG A 2820 

CCTGGGTGAG TGACAGCATG AAGG ACAAT C CT CATATGC A GTACTATGCC AGCATCTACG 2880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2 940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT TCCAAAGACA 30 60 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 312 0 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGTCAC CTTTCCTCTC CCACATCACG TCCAGCATAC 33 00 

AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 33 60 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 342 0 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGT CAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3 600 

CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3 660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3 720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGTATCCTTC ACGATCAAAC 3 780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGTCCTCG CTGGGGATGG 3 840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGC AT CAA GATTGATGGA GTGAGAATCA 3 900 

GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3 960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 402 0 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 40 80 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 42 00 

CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 42 60 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 432 0 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 43 80 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 462 0 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

TCTATATATA ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

TATTAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4920 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 5100 

CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGG AT C A CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 5280 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

ATTCCCACAC CTCCACAGTT CAGTGG CAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGT CAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 



305 



WO 02/086443 

ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 
CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAA 



5760 
5820 



PCT/US02/12476 



Seq ID NO: 319 Protein sequence: 
Protein Accession #: NP_005679 
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MKDIDIGKEY 
LDASMHSQLR 
VAHKKGELSM 
LSIVCLMITQ 
ALNYRTGVRL 
PWAILGMIY 
VLTYIKFIKM 
HMTLGFDLTA 
NKPASPHIKI 
VLAEQKGHLIj 
KTSLISAILG 
CCLRPDLAIL 
NH I FNS AI RK 
FNNLLLGETP 
VPWSVYGVYI 
VSDSMKDNPH 
KFFDTTPTGR 
LVILFSVLHI 
LDDNQAPFFL 
GLFQFTVRIiA 
RYRENLPLVL 
IGLADLRSKL 
SEVMENGDNF 
MLTIAHRLHT 



11 

I 

IIPSPGYRSV 
I LDEEHPKGK 
EDVWSLSKHE 
LAGFSGPAFM 
RGAILTMAFK 
NVIILGPTGF 
YAWVKAFSQS 
AQAFTWTVF 
EMKNATLAWD 
LDSDERPSPE 
QMTLIiEGSIA 
PSSDIiTEIGE 
HLKSKTVLFV 
PVEINSKKET 
QAAGGPLAFL 
MQYYASIYAL 
ILNRFSKDMD 
VSRVLIREDK 
FTCAMRWLAV 
SETEARFTSV 
KKVSFTIKPK 
SIIPQEPVLF 
SVGERQLLCI 
VLGSDRIMVL 



21 
I 

RERTSTSGTH 
YHHGLSALKP 
SSDVNCRRLE 
VKHLLEYTQA 
KILKLKNIKE 
LGSAVFILFY 
VQKIREEERR 
NSMTFALKVT 
SSHSSIQNSP 
EEEGKHIHLG 
ISGTFAYVAQ 
RGANLSGGQR 
THQLQYLVDC 
SGSQKKSQDK 
VIMALFMLNV 
SMAVMLILKA 
EVDVRLPFQA 
RLDNITQSPF 
RLDLISIALI 
ERINHYIKTL 
EKIGIVGRTG 
SGTVRSNLDP 
ARALLRHCKI 
AQGQWEFDT 



31 

1 

RDREDSKFRR 
IRTTSKHQHP 
RLWQEELNEV 
TESNLGYSLL 
KSLGELINIC 
PAMMFASRLT 
I LEKAGYFQG 
PFSVKSLSEA 
KLTPKMKKDK 
HLRLQRTLHS 
QAW I LNATLR 
QRISLARALY 
DEVIFMKEGC 
GPKTGSVKKE 
GSTAFSTWWL 
IRGWFVKGT 
EMFIQNVILV 
LSHITSSIQG 
TTTGLMIVLM 
SLEAPARIKN 
SGKSSLGMAL 
FNQYTEDQIW 
L I LDEATAAM 
PSVLLSNDSS 



41 
I 

TRPLECQDAL 
VDNAGLF S CM 
GPDAASLRRV 
LVLGLLLTEI 
SNDGQRMFEA 
AYFRRKCVAA 
ITVGVAPIW 
SVAVDRFKSL 
RASRGKKEKV 
IDLEIQEGKL 
DNILFGKEYD 
SDRSIYILDD 
ITERGTHEEL 
KAVKPEEGQL 
SYWIKQGSGN 
LRASSRLHDE 
FFCVGMIAGV 
LATIHAYNKG 
HGQIPPAYAG 
KAPSPDWPQE 
FRLVELSGGC 
DALERTHMKE 
DTETDLLIQE 
RFYAMFAAAE 



51 
I 

ETAARAEGLS 
TFSWLSSLAR 
VWIFCRTRLI 
VRSWSLALTW 
AAVGSLLAGG 
TDERVQKMNE 
VIASWTFSV 
FLMEEVHMIK 
RQLQRTEHQA 
VGICGSVGSG 
EERYNSVLNS 
PLSALDAHVG 
MNLNGDYATI 
VQLEEKGQGS 
TTVTRGNETS 
LFRRILRSPM 
FPWFLVAVGP 
QEFLHRYQEL 
LAISYAVQLT 
GEVTFENAEM 
IKIDGVRISD 
CIAQLPLKLE 
TIREAFADCT 
NKVAVKG 



Seq ID NO: 32 0 DNA sequence 

Nucleic Acid Accession #: AK022089.1 

Coding sequence: 181-1488 



AGCAGTTGCA 
ACATCCGAGG 
CCCCACCCCC 
ATGGCTCCCT 
GACATGGATT 
TGTGGGCTGA 
GAGGCTACGT 
GAGAAGTGGA 
AAAGCGTGGG 
CTTCCAGTTC 
TGGGAGCTCA 
GTCAGGAAAA 
GATAATATGG 
GTCAAGAGAA 
GATCGAGTAG 
AGTGAAGTTG 
AG CGAAAGTG 
GATAAGCTCT 
GATGCGGAAG 
GATTTGGAGA 
AAAGAGATTA 
GCCAAGGAAT 
AGAGCGAAGG 
GTATTTAGCA 
CAGGACTCCG 
CTTTCTGACC 
ATATAACACT 
GAAATGTGTT 



11 
I 

CAACTTCCAG 
AGAAGAGAAA 
ACCCTCAGAT 
TTGGAAGAAA 
CAGAAGAGAA 
CTAAACGCAC 
TTGGAGAGAA 
GAGGCTCCGA 
GAGATGAGCA 
CTTTGTGGCG 
GCCCAGCAAA 
CTTTCCGGAA 
AGACATTAGT 
TGAAAGAGCT 
AAAATGATGG 
AGCAAAATCT 
ATGGAATTGA 
CTGCTGAAAT 
GGGAAGCTGC 
AAAGCATGAA 
AATACAGTGA 
TCAATTCACT 
AATCTGAGGT 
ATTACACAAA 
AAACAACAGT 
TGCTTT CATG 
CAAAAAAATG 
GTTTCTG 



21 
I 

CAACTTTCTC 
GCTTCCAGCC 
CACTTTAAGA 
CTTGCTAAAG 
GGAAATTGTG 
CACCTCTGCT 
ACGATTTCTT 
AAGGGTTCTT 
GCCCAATATG 
GACAGCTGAA 
CTACATGAAG 
ACTGGCTAAA 
TCATCTGATC 
GGATCTGGAA 
AGAAAACTAT 
AGACTTGCAG 
ACAGCTGGAA 
AGAAAAAGAG 
AAG TGAACTG 
AGCTGGTTTG 
CTCATTGCTT 
TCACATTAGC 
TCCCAGTAGC 
TGACACAGAC 
AGGAGATGTG 
TTTTAATGTT 
TAAATCATAT 



31 

I 

AGCCGGCTAC 
CTCCTCCCTT 
TAATTTCTTT 
ACTCGGCATA 
GTTTGGGTTT 
GATGTCATCC 
CTGGGGAAGC 

CCTCCACTAA 
CAATTTGTTT 

GCCAAATTAG 
ACTTTACCAC 
ATTAAGCAGG 
ATTTCCCAGG 
ATTGAAAAGT 
GTT CAGGATG 
TATGAGGAAA 
GAACGACTGA 
GTAAAAAGTG 
GAAAGCTCTA 
AAAATTCACT 
CAGATGAAAG 
AACAAAGATG 
AATGGGGAGA 
TCGGACACTG 
GTGCTGTTGT 
TGTTTAATTT 
TGTAGTATTC 



41 
1 

TAATGAGCTG 
CACCCTGGAA 
ATTCGTTTGC 
AAAAC AGAT C 
GCCAAGAAGA 
AGGCTTTGCT 
CCAGTGATTA 
CTAGAATCCT 
TGGTTAAAGC 
TGCAAAACAC 
CAGATAAACA 
ACACAGTTTC 
ACCATACTAT 
GTGAAGCTAA 
CATATTTAAT 
ACCAGACTCT 
AATATTACCG 
TTTGCATTGA 
ATTTAGAGAG 
CTCATTTGAG 
CAAAAGAATA 
GGTGCCAGTT 
TTCCTCCCTT 
GTATCAGTTC 
CAACATAGTT 
AATAGGAAAC 
AATAGTTAAT 



51 

I 

AAAGCCAGGA 
ATCCAGACAC 
CCGACAGACC 
TCCAACTAAA 
GAAGCTTGTC 
TGAGGAACAT 
CTGCAT CATA 
GAAGCTTTGG 
AGATGCTTTT 
AGAAAAATTG 
AAAAAGAATA 
TCATGAT CG A 
TCAT CAGCAA 
GTTCCATCTT 
GCCCAGTTTC 
GGAGGACCTG 
AATACTCATT 
TATAAATGAA 
TGTTAAGTGT 
TGGCATCCAG 
TGAACTCCTG 
AAAGGAAAAC 
TACTCAAAGA 
TAACCACAGT 
CCAATGGCTC 
CTCATTTTAA 
AAAAACTCGA 



Seq ID NO: 321 Protein sequence: 
Protein Accession #: NP_005438.1 



11 



21 



31 



41 



51 



MAP FGRNLLK 
EATFGEKRFL 
LPVPLWRTAE 
DNMETLVHLI 
SEVEQNLDLQ 
DAEGEAASELi 
AKEFNSLHIS 
QDSETTVGDV 



TRHKNRSPTK 
LGKPSDYCII 
AKLVQNTEKL 
ISQDHTIHQQ 
YEENQTLiEDL 
ESSNLESVKC 
NKDGCQLKEN 
VLLST 



DMDSEEKEIV 
EKWRGSERVL 
WELSPANYMK 
VKRMKELDLE 
SESDGIEQLE 
DLEKSMKAGL 
RAKES EVP SS 



VWVCQEEKLV 
PPLTRILKLW 
TLPPDKQKRI 
IEKCEAKFHL 
ERLKYYRILI 
KIHSHLSGIQ 
NGEIPPFTQR 



CGLTKRTTSA 
KAWGDEQPNM 
VRKTFRKIiAK 
DRVENDGENY 
DKLSAEIEKE 
KEIKYSDSDL 
VFSNYTNDTD 



DVIQALLEEH 
QFVLVKADAF 
IKQDTVSHDR 
VQDAYLMPSF 
VKSVCIDINE 
QMKAKEYELL 
SDTGISSNHS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



60 
120 
180 
240 
300 
360 
420 



Seq ID NO: 322 DNA sequence 

Nucleic Acid Accession #: NM_030920.1 



306 



WO 02/086443 

Coding sequence: 317-1123 



PCT/US02/12476 
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I 

AGCATTGAAG 
GGCGCGTGCG 
AGTTAGAGTC 
CTGGTGGTGG 
AAAATTTTGG 
AGAGCAAGAG 
CCCGGAGGAG 
AGGCCTGAAT 
TTCGCTGGCC 
AATTTCTGGA 
GAGTGGAAAC 
GAAAAGTCTT 
TTTTGAACTA 
GCCGGACTCT 
AAATGAAGCT 
TGAGGATGAG 
AGTGGGCCTC 
TGTTGAAGAA 
GAAACGAGAT 
TTCTCTAATG 
AGCTATCCCT 
TTGCCTTATC 
ATTTTCATTG 
TTCAAGCTTA 
GGTGTAGTAT 
AATATAAAAT 
GGAATCTTTA 
TTTGGTTGCT 
TTCAGCAACA 
TAGTGGTGAC 
CATGCAGGTG 
GTTTTTAGGT 
GTTTCACTCC 
CTTTAGTCTT 
AAGAAATTGA 
TCTCCTTACC 
AAAGAAAAGC 
GCCTGTTGAT 
TTGGATACCA 
TAATAGCATT 
TTAAAATTAA 
AGTGAGTGTG 
CTCCTCTTTT 
TAAAATTTTT 
TTAATTGAAG 
TACTGCTTCT 
CTTGTGGAGA 
TAGTGAGGGG 
ATTCCCTGTT 
ATTTTTAAAA 
ATTAGAGTAC 
TTAGCTTTGT 
AGTTTTCGGT 
CTCAAGAATC 
AG CCATG ACA 



11 
I 

GGGAAGGAAC 
TGCGTGTGTG 
CCAACTCTTG 
TAGTGGGCGT 
GTTGGGGGTA 
GAGGACATGG 
GTGACAGAGT 
GATACTTTCA 
CGGCTTCCCA 
GGCTTGGAAG 
AAAATAAAAG 
GACCTGTTTA 
CTGCAGCAAA 
GAAGAGGAGG 
GGTCCACCGG 
GATGAAGATG 
TCATACTTAA 
GGGGAAGAAG 
GCTGAAGACG 
TTTCTGGGTG 
ACAGAAGATA 
ATTCCAAATA 
ACTCCCATTG 
CTGTATTAGT 
GGTGCATTTC 
GACAATAGTC 
GTTTGAAGGC 
TTTTTGTCAC 
TAGTTCATGG 
TGATAAGATT 
AGCCCTTTTG 
GTGCATTCTT 
CTCCCGCTCT 
ATACCT CAAC 
GTATTTTCTG 
ATAAATGTTT 
AGGGGTGCTT 
ATATTAATAA 
CTCTGCAAAG 
TAATTGGAAT 
GAT TTGTTTT 
TGTTTCCTTA 
GCTATGGAGG 
TCCCTTTGTT 
TTATGCTTCT 
ACAATGTAAA 
AACTAAAAAT 
ATTGGAATCC 
CTAGTTCCTA 
TGAAATCTAT 
TTTGGTTTGA 
GGGTGGTATT 
CCTATTTTAA 
TCCTTAAAAA 
GTGTTAAAAA 



21 



1 

TGCGGGTGTG 
TGCGCGCGCT 
GACTCCATTT 
TTATATTTGC 
TTGGGGAAGG 
AGATGAAGAA 
TAGTCCTTGA 
AAGAACTAGA 
GCTTAAATAA 
TCCTGGCAGA 
ATCTCAGTAC 
ACTGTGAGAT 
TCACATACTT 
ATGATGAGGA 
AAGGATATGA 
AAGATGAAGC 
TGAAAGAAGA 
AGGAAGAAGA 
ATGGAGAGGA 
TGCAATAGAG 
ATGTGTAACT 
AGAACTAGTC 
TGGAATT CCC 
CATTTTTAGC 
ATTCCTTATT 
TCTTGAGTGG 
AGTTACCTTT 
AAGTAACTTG 
ATTTTGTGGA 
TTATTTTTGA 
TCAGGCTGCA 
TTTCTTCTTA 
CTCAAAAGGA 
TACGTTTCAG 
ACATAAGAAT 
CTTGGTTGAC 
TTGGAAGCAG 
GACATTAAAG 
TATTTCTAAC 
ATACTAGGCA 
CAAGTGGATG 
CATGGCTACT 
CTCCATGTTC 
TTGAATATTT 
ATACTGGGAC 
ATGTATGACT 
GTTGTACAAC 
CCAAAAAGAT 
ACAATTCTCA 
ATATTGACTT 
AAACAACACT 
TTCATGCAAA 
TGCTCTTTGT 
CTTTGAAGTT 
CAAAC 



31 
I 

GTGTGTGTAT 
AGTGTGTGGA 
GCTATTCTCT 
GTTCCTTTTC 
CAGGAAAGGG 
GAAGATTAAC 
TAATTGCCTG 
ATTTCTGAGT 
ACTTCGAAAA 
GAAATGT CCA 
AGTAGAAGCT 
CACAAACCTG 
AGATGGATTT 
TGGAGATGAA 
GGAAGAGGAG 
AGGTT CAGAG 
AATT CAGGAT 
GGAAGAAGGA 
AG AAG AT G AC 
TGATCACATC 
TTTTATAGGA 
TGTTAATGAT 
TAGCAATTTA 
CCATAATTAA 
TATAGATTAA 
TAAGTTGGTT 



GAAAATAGAA 
GGTTCTATTC 
AGGAAAAATT 
AATCATGACA 
GCAATTCCTT 
ACTTGGGAAA 
TCCTGTCTGG 
ATATTATCAA 
AACAT CTAAG 
TGTTAGTGTT 
TCAAATTTTA 
CTTTAATTCC 
GCTGGAAAAG 
TCCATTAAAA 
AAATAAAATA 
AAGGCAATGG 
GTAAGTTTTT 
ATATTTAAAT 
TGGTGTTTTA 
TGACCGAAAG 
AACATTTTTC 
TTACATACTA 
TCTTATCAAT 
TAGAGCCTCC 
TAAGTAAGGG 
ATGGCAGTAT 
AATACTTTTG 



41 

I 

GTGTGTGTGT 
CAAGGAGGTG 
TCTTTCTCCC 
ATTCATTTCT 
AAAAGGAGAG 
CTGGAGTTAA 
TGTGTCAATG 
ATGGCTAATG 
TTGGAGCTTA 
AATCTTACCT 
CTGCAAAATC 
GAAGATTATA 
GATCAGGAGG 
GATGATGAAG 
GAGGAAGAGG 
TTGGGAGAGG 
GAAGAAGATG 
GGTCTTCGAG 
TAGATCATTC 
TTTGTTTCTT 
AAAGTGTGGT 
CATATTGTAT 
TTTAGACTTA 
AACATGATCA 
CTGAAATTAC 
ATTTTTTTAG 
TTTTTTTTTG 
GCAGAATAGT 
AGTAATATGG 
GCTTATACTA 
TGCCGATGGT 
TATGATCACC 
CTTGTGAAAC 
GTTTTAAATA 
TACAGTTTTA 
ACAATATTAG 
CCTCAAAAGT 
ATGTTGGCCT 
CAGTTTTAAA 
TATTTGAAAC 
GTAGAAAAAT 
TAATGAGTAT 
CTTTTTAAAT 
AAGAAGTTAG 
ACTGAGTATA 
AAGTAAAAAT 
AAAACCCTTG 
TTCTGCTTTT 
TGCCAGATTA 
CATCTTACTG 
AGATAACTTT 
TGGGTTTTAT 
GTATATATTG 
TGCAACTGTG 



51 
I 

ATGTGTGTGC 
GGGGCAGCTG 
CCACACCTAT 
AAATCTCTTA 
TAGTAGCTGA 
GGAACAGATC 
GGGAAATTGA 
TGGAACTAAG 
GTGATAATAT 
ACCTCAATCT 
TTAAAAATTT 
GAGAAAGTAT 
ATAATGAAGC 
AGGAAGAGGA 
AAGAGGAGGA 
GAGAAGAGGA 
ATGATGACTA 
G GG AG AAG AG 
TAAGACCAGA 
CATGTACGAT 
TTTACTATTT 
GTAGAGAAAA 
ATTTTTTAAA 
CTTTTAAACA 
AGTTTGCTAT 
AGGTGATCCA 
ACTAAGAGTG 
AAAGGTTCTA 
TTCATGGATT 
AGTCCAGAGA 
TGTTTATTTT 
TTCCCTTCTT 
CCAGGAAAAC 
AGTGAAGTAG 
TGCAGTAAGC 
TGGGATGAAG 
CGGAACAATT 
CTCAAATGAT 
ACAGATATAA 
TAAATTGACA 
ATTTGGGATA 
ACAAGTATAT 
CTTGGCTATC 
TGTCAGCAAA 
GTACTGCTGC 
TATGATGTTA 
GGGATAAGTT 
AAAAACTGAA 
CAAAAT ACT T 
TGCAATCAAA 
TAAGACTTAT 
ATTTTGTAGA 
TGTTAAGTTC 
TTTTGAATAA 



41 



51 



Seq ID NO: 323 Protein sequence: 
Protein Accession #: NP_112182.1 

1 11 21 31 

I I l I l I 

MEMKKKINLE LRNRSPEEVT ELVLDNCLCV NGEIEGLNDT FKELEFLSMA NVELSSLARL 
PSLNKLRKLE LSDNIISGGL EVLAEKCPNL TYLNLSGNKI KDLSTVEALQ NLKNIiKSLDL 
FNCEITNLED YRESIFELLQ QITYLDGFDQ EDNEAPDSEE EDDEDGDEDD EEEEENEAGP 
PEGYEEEEEE EEEEDEDEDE DEDEAGSELG EGEEEVGLSY LMKEEIQDEE DDDDYVEEGE 
EEEEEEEGGL RGEKRKRDAE DDGEEEDD 

Seq ID NO: 324 DNA sequence 
Nucleic Acid Accession #: NM_003812 
Coding sequence: 224.. 2722 



1 
I 

TCCTCTGCGT 
GCCCAGCCCC 
CCATGCGCGC 
GCTCTCGCCG 
CAGCTCGCGG 
ACGCGGCCCC 
GCTTCTCGTC 
GGCTGCTGCG 
GGCAGATGAA 
AATGCAGAAA 



11 
I 

CCCGCCCCGG 
GAGCCCCGCG 
CGAGCCGGCG 
GCCACACGGA 
CAGCCGCCCC 
GCCGGCTCGG 
CTTCTCCTGC 
CCCAGCGCTC 
GACAATACAT 
GAAATCACAC 



21 
I 

GAGTGGCTGC 
CCCCGTGCCC 
TGACCGGCTC 
GCGGCGCCCG 
TGGCGGGCTG 
TGCCTGCCAG 
TGCCTCCGCT 
CGCATTGGAA 
TGCAACAGAA 
TGCCTTCAAG 



31 

1 

GAGGCTAGGC 
CGAGCCCGGA 
CGCCCGCGGC 
GGAGCTATGA 
CAGCCTTGCC 
CGCCCCGGCC 
CGCCGCCTCG 
TGAAACTGCA 
TAGCAGCAGT 
ACTCATATAT 



41 
I 

GAGCCGGGAA 
GCCCCCTGCC 
CGCCCCGCAG 
GCCATGAAGC 
GGCGCTTCCT 
CGCACGCCGC 
TCCCGGCCCC 
GAAAAAAATT 
AATATCAGTT 
TACATCAACC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3 060 
3120 
3180 
3240 



60 
120 
180 
240 



51 
I 

AGGGGGCGCC 
CGCGGCGGCA 
CTAGCCCGGC 
CGCCCGGCAG 
GCGGCCCCCA 
CCTGCCGCCT 
GCGCCTGGGG 
TGGGAGTCCT 
ACAGCAATGC 
AAGACT CGG A 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



307 



WO 02/086443 

AAGCCCTTAT CACGTTCTTG ACACAAAGGC AAGACACCAG CAAAAACATA ATAAGGCTGT 660 

CCATCTGGCC CAGG CAAGCT TCCAGAT TGA AGCCTTCGGC TCCAAATTCA TTCTTGACCT 720 

CATACTGAAC AATGGTTTGT TGTCTTCTGA TTATGTGGAG ATTCACTACG AAAATGGGAA 780 

ACCACAGTAC TCTAAGGGTG GAGAGCACTG TTACTACCAT GGAAGCATCA GAGGCGTCAA 840 

5 AGACTCCAAG GTGGCTCTGT CAACCTGCAA TGGACTT CAT GGCATGTTTG AAGATGATAC 90 0 

CTTCGTGTAT ATGATAGAGC CACTAGAGCT GGTTCATGAT GAGAAAAGCA CAGGTCGACC 960 

ACATATAATC CAGAAAACCT TGGCAGGACA GTATTCTAAG CAAATGAAGA ATCTCACTAT 1020 

GGAAAGAGGT GACCAGTGGC CCTTTCTCTC TGAATTACAG TGGTTGAAAA GAAGGAAGAG 10 80 

AGCAGTGAAT CCATCACGTG GTATATTTGA AGAAATGAAA TATTTGGAAC TTATGATTGT 1140 

10 TAATGATCAC AAAACGTATA AGAAGCATCG CTCTTCTCAT GCACATACCA ACAACTTTGC 1200 

AAAGT CCGTG GTCAACCTTG TGGATTCTAT TTACAAGGAG CAGCTCAACA CCAGGGTTGT 12 60 

CCTGGTGGCT GTAGAGACCT GG AC TGAGAA GGATCAGATT GACATCACCA CCAACCCTGT 1320 

GCAGATGCTC CATGAGTTCT CAAAATACCG GCAGCGCATT AAGCAGCATG CTGATGCTGT 13 80 

GCACCTCATC TCGCGGGTGA CATTTCACTA TAAGAGAAGC AGTCTGAGTT ACTTTGGAGG 1440 

15 TGTCTGTTCT CGCACAAGAG GAGTTGGTGT GAATGAGTAT GGTCTTCCAA TGGCAGTGGC 1500 

ACAAGTATTA TCGCAGAGCC TGGCTCAAAA CCTTGGAATC CAATGGGAAC CTTCTAGCAG 1560 

AAAGCCAAAA TGTGACTGCA CAGAATCCTG GGGTGGCTGC AT CATGGAGG AAACAGGGGT 162 0 

GTCCCATTCT CGAAAATTTT CAAAGTGCAG CATTTTGGAG TATAGAGACT TTTTACAGAG 1680 

AGGAGGTGGA GCCTGCCTTT TCAACAGGCC AACAAAGCTA TTTGAGCCCA CGGAATGTGG 1740 

20 AAATGGATAC GTGGAAGCTG GGGAGGAGTG TGATTGTGGT TTTCATGTGG AATGCTATGG 1800 

ATTATGCTGT AAGAAATGTT CCCTCTCCAA CGGGGCTCAC TGCAGCGACG GGCCCTGCTG 1860 

TAACAATACC TCATGTCTTT TTCAGCCACG AGGGTATGAA TGCCGGGATG CTGTGAACGA 192 0 

GTGTGATATT ACTGAATATT GTACTGGAGA CTCTGGTCAG TGCCCACCAA ATCTTCATAA 198 0 

GCAAGACGGA TATGCATGCA ATCAAAATCA GGGCCGCTGC TACAATGGCG AGTGCAAGAC 2 040 

25 CAGAGACAAC CAGTGTCAGT ACAT CTGGGG AACAAAGGCT GCAGGGTCTG ACAAGTTCTG 2100 

CTATGAAAAG CTGAATACAG AAGGCACTGA GAAGGGAAAC TGCGGGAAGG ATGGAGACCG 2160 

GTGGATTCAG TGCAGCAAAC ATGATGTGTT CTGTGGATTC TTACTCTGTA CCAATCTTAC 222 0 

TCGAGCTCCA CGTATTGGTC AACTT CAGGG TGAGATCATT CCAACTTCCT TCTACCATCA 22 80 

AGGCCGGGTG ATTGACTGCA GTGGTGCCCA TGTAGTTTTA GATGATGATA CGGATGTGGG 2340 

30 CTATGTAGAA GATGGAACGC CATGTGGCCC GTCTATGATG TGTTTAGATC GGAAGTGCCT 2400 

ACAAATTCAA GCCCTAAATA TGAGCAGCTG TCCACTCGAT TCCAAGGGTA AAGTCTGTTC 2460 

GGGCCATGGG GTGTGTAGTA ATGAAGCCAC CTGCATTTGT GATTTCACCT GGG CAGGG AC 2520 

AGATTGCAGT ATCCGGGATC CAGTTAGGAA CCTTCACCCC CCCAAGGATG AAGGACCCAA 2 580 

GGGTCCTAGT GCCACCAATC TCATAATAGG CTCCATCGCT GGTGCCATCC TGGTAGCAGC 2 640 

35 TATTGTCCTT GGGGGCACAG GCTGGGGATT TAAAAATGTC AAGAAGAGAA GGTTCGATCC 2700 

TACTCAGCAA GGCCCCATCT GAATCAGCTG CGCTGGATGG ACACCGCCTT GCACTGTTGG 2 760 

ATTCTGGGTA TGACATACTC GCAGCAGTGT TACTGGAACT ATTAAGTTTG TAAACAAAAC 2 82 0 

CTTTGGGTGG TAATGACTAC GGAGCTAAAG TTGGGGTGAC AAGGATGGGG TAAAAGAAAA 2 8 80 

CTGTCTCTTT TGGAAATAAT GTCAAAGAAC ACCTTTCACC ACCTGTCAGT AAACGGGGGA 2 940 

40 GGGGGCAAAA GACCATGCTA TAAAAAGAAC TGTTCCAGAA TCTTTTTTTT TCCCTAATGG 3000 
ACGAAGGAAC AACACACACA CAAAAATTAA ATGCAATAAA GGAATCATTA AAAA 
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45 



Seq ID NO: 325 Protein sequence: 
Protein Accession NP_003803 



11 



21 



31 



41 



51 



50 
55 
60 
65 
70 
75 
80 
85 



MKPPGSSSRQ 
RPRAWGAAAP 
INQDSESPYH 
HYENGKPQYS 
KSTGRPHIIQ 
LELMIVNDHK 
ITTNPVQMLH 
LPMAVAQVLS 
RDFLQRGGGA 
SDGPCCNNTS 
NGECKTRDNQ 
LCTNLTRAPR 
LDRKCLQIQA 
KDEGPKGPSA 



PPLAGCSLAG 
SAPHWNETAE 
VLDTKARHQQ 
KGGEHCYYHG 
KTLAGQYSKQ 
TYKKHRSSHA 
EFSKYRQRIK 
QSLAQNLGIQ 
CLFNRPTKLF 
CLFQPRGYEC 
CQYIWGTKAA 
IGQLQGEIIP 
LNMSSCPLDS 
TNLIIGSIAG 



ASCGPQRGPA 
KNLGVLADED 
KHNKAVHLAQ 
SIRGVKDSKV 
MKNLTMERGD 
HTNNFAKSW 
QHADAVHLIS 
WEPSSRKPKC 
EPTECGNGYV 
RDAVNECD I T 
GSDKFCYEKL 
TSFYHQGRVI 
KGKVCSGHGV 
AILVAAIVLG 



GSVPASAPAR 
NTLQQNSSSN 
ASFQIEAFGS 
ALSTCNGLHG 
QWPFLSELQW 
NLVDSIYKEQ 
RVTFHYKRSS 
DCTESWGGCI 
EAGEE CDCGF 
EYCTGDSGQC 
NTEGTEKGNC 
DCSGAHWLD 
CSNEATCICD 
GTGWGFKNVK 



Seq ID NO: 32 6 DNA sequence 
Nucleic Acid Accession #: AK074418. 
Coding sequence: 244-1515 



CTTTCTCCAA 
ACCTGAGAGC 
AACAGATGGA 
AAATGACAGA 
GTCATGGCGT 
GACTTTACCA 
TTCCCCGCAG 
GTGATATGGA 
ATAAGCAGAT 
GGATCCTTGA 
CACCAGTATG 
GTGATTGATG 
CAAAACCAAG 
TATTCCGATC 
ATCACCAACA 
ACCAAGGCAG 
GCGATGGAGA 
CAATACCGAA 
ACCGAATGGA 



11 
I 

GACGGCCGGC 
CTGTGGAACC 
AAAAGTGAGG 
GCTGGAGTTC 
ATTACCAGGA 
CCTTGCGGGA 
CAGATTCTTC 
AGCGGCCACA 
TTGACATCCA 
CTCAGAACCC 
CTGGCATTTT 
ACCGCCTACC 
AGTTCTGGCC 
TGCACTATGG 
TCCATCTGCA 
GCTCCCTGAT 
ATGGGCTGGT 
GGGGCTGGGA 
GAGGGCGCTG 



21 
I 

CATGCTCTCC 
TGCCCGTCTC 
GACCGGTGAG 
TGCTGTGCCT 
GCCTTCAGTG 
TCACTGCCTG 
CATAGGCCAG 
GGATCTACCA 
ACAAGGAGGC 
ACAGTACAGG 
CCGTTTCCGG 
TGTCCAGGGA 
CTGCCTGCTG 
CTTCCTCGAG 
CTCTTCCCCT 
AACCTGTGCC 
GAGTCTCCAT 
AGAAATTATC 
GAG TG ATGGG 



31 

I 

TCCTCTGCCA 
CCCTCCTCCA 
TGACTTGCTG 
GGAAAGGACC 
GAGACCTCCA 
AGCATGGGCC 
AAGCTGCTCC 
GGGGGTCCTC 
GCAGCTGACT 
CAGAAGATCC 
TTCTGGCAAT 
GATAAATGCC 
GAGAAGGCCT 
GATGCCCTGG 
GTGGACCTGG 
ACTCCAAGTG 
GCCTACACTG 
TCCCTGTGGA 
TCTCAGGAGT 



TPPCRLLLVL 
ISYSNAMQKE 
KFILDLILNN 
MFEDDTFVYM 
LKRRKRAVNP 
LNTRWLVAV 
LSYFGGVCSR 
MEETGVSHSR 
HVECYGLCCK 
PPNLHKQDGY 
GKDGDRWIQC 
DDTDVGYVED 
FTWAGTDCSI 
KRRFDPTQQG 



41 

f 

GTCTCCTCCA 
TCAGACACAC 
CTAAAGTTTA 
TCGGAAGTCT 
TCATCAAGTT 
GGACGTTTAA 
AGGAAAAACG 
CTCACTTCAT 
GCTGGTTCCT 
TGATGGTCCA 
GTGGCCAGTG 
TCTTTGTGCG 
ATGCCAAGCT 
TGGACCTCAC 
TGAAGGCAGT 
GGCCAACAGA 
TGACTGGGGC 
ACCCCTGGGG 
GGGAGGAAAC 



LLLPPLAASS 
ITLPSRLIYY 
GIiLSSDYVEI 
IEPLELVHDE 
SRGIFEEMKY 
ETWTEKDQID 
TRGVGVNEYG 
KFSKCSILEY 
KCSbSNGAHC 
ACNQNQGRCY 
SKHDVFCGFIj 
GTPCGPSMMC 
RDPVRNLHPP 
PI 



51 
I 

CCACTCTCTA 
CTGCCTAGGA 
TACCAGATGC 
TCTAAGGAGA 
CAAAGACCAG 
GGATGAGACA 
CCTCTCCAAT 
CCTGGATGAT 
GGCAGCACTG 
AAGCTTTTCA 
GGTGGAAGTG 
TCCTCGCCAC 
GCTCGGATCC 
AGGAGGCGTG 
GAAGACAGCG 
TACAGCACAG 
TGAGCAGATT 
CTGGGGCGAG 
CTGTGATCCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



308 



WO 02/086443 

CGGAAAAGCC AGCTACATAA GAAACGGGAA GATGGCGAGT TTTGGATGTC GTGTCAAGAT 12 00 

TTCCAACAGA AATTCATCGC CAT GTTT AT A TGTAGCGAAA TTCCAATTAC CCTGGACCAT 1260 

GGAAACACAC TCCACGAAGG ATGGTCCCAA ATAATGTTTA GGAAG CAAGT GATTCTAGGA 1320 

AACACTGCAG GAGGACCTCG GAATGATGCT CAATTCAACT TCTCTGTGCA AGAGCCAATG 13 80 

5 GAAGGCACCA ATGTTGTCGT GTGCGTCACA GTTGCTGTCA CACCATCAAA TTTGAAAGCA 1440 

GAAGATGCAA AATTTCCACT CGATTTCCAA GTGATTCTGG CTGGCTCACA GAAACACTGT 1500 

CCAAAGCTCA AATAATAAAT TCCGCCGCAA CTTCACCATG ACTTACCATC TGAGCCCTGG 15 60 

GAACTATGTT GTGGTTGCAC AGACACGGAG AAAATCAGCG GAGTTCTTGC TCCGAATCTT 1620 

CCTGAAAATG CCAGACAGTG ACAGGCACCT GAGCAGCCAT TTCAACCTCA GAATGAAGGG 1680 

10 AAGCCCTTCA GAACATGGCT CCCAACAAAG CATTTTCAAC AGATATGCTC AGCAGGTATG 1740 

GTACCT AG CA CCCAGGGGCC TTACGTGGGA TTGGAGAAAG GGGACCTGAG GGAGGGACAG 1800 

CCCTCACAGG CCCTTACTGG GATGCAGAGA GGAGAAGTGA CTTGATGGAC TATTTTACCT 1860 

GCCTCTCTTC CTGGATCGTC TCCAGAACTG CTGTGGCTGC CAAGCTCGGT AGAGACGTGG 1920 

CGCCCCACCC AGTCTCATCC GGGGGACTTC AAGCTGGAAT GCAGAGCTTA GAAAGGGAGG 1980 

15 GGATAATTAT GGGGTGTGAG GTGCATTGCC CTCTAAATCT TTAAACAAGC AATTGGCAGT 2040 

ACCCCGTGAA ACCTTTCCTT CTCCTACTCG- GCCACCTCCC ACCAACCTGG CATCGTTCCT 2100 

CCCGGGAGCT AGCCAGCTTC AGAAAGCACA TACAGCATCC TTGCTGCCAA ACCACCTATG 2160 

TGCACACAGG ATTTCCTTAA TGGCTTAATA AACTGTTATA AAGAACTCCT TGACTTGTCA 2220 

GAATAAAATA GCTGCCAGGG GCTCTGCACA ATGAGCCTCT TACCGTTAAA AAAAAAAAAA 2280 

20 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
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Seq ID NO: 327 Protein sequence: 
Protein Accession #: BAB85075.1 



MAYYQEPSVE 
IWKRPQDLPG 
QYAGIFRFRF 
SDLHYGFLED 
MENGLVSLHA 
KSQLHKKRED 
TAGGPRNDAQ 
KLK 



11 
I 

TSIIKFKDQD 
GPPHFILDDI 
WQCGQWVEW 
ALVDLjTGGV I 
YTVTGAEQIQ 
GEFWMSCQDF 
FNFSVQEPME 



21 
1 

FTTLRDHCLS 
SRFDIQQGGA 
IDDRLPVQGD 
TNIHLHSSPV 
YRRGWEEIIS 
QQKFIAMFIC 
GTNWVCVTV 



31 

I 

MGRTFKDETF 
ADCWFLAALG 
KCLFVRPRHQ 
DLVKAVKTAT 
LWNPWGWGET 
SEIPITLDHG 
AVTPSNLKAE 



41 
1 

PAADSSIGQK 
SLTQNPQYRQ 
NQEFWPCLLE 
KAGSIjITCAT 
EWRGRWSDGS 
NTLHEGWSQI 
DAKFPLDFQV 



51 
1 

LiLQEKRIiSNV 
KILMVQSFSH 
KAYAKLLGSY 
PSGPTDTAQA 
QEWEETCDPR 
MFRKQVILGN 
ILAGSQKHCP 



Seq ID NO: 328 DNA sequence 

Nucleic Acid Accession #: BC017490.1 

Coding sequence: 74-2788 



GTGGGTCACG 
TCGTGGTACT 
GCGTCGGCGA 
TGCCCTCACC 
CCTAGGCACA 
CATGGAAAGG 
TCTGGATGAT 
CATGCGGCAG 
GTATGACAGC 
GGCCACGGAG 
TCTCAAAGGC 
CCACCGCTTC 
GGAGCGCATC 
CTTGGCAGCC 
GCAGATCTTT 
CATCACCAAC 
GCTGAGGCAG 
TGGCGTCCTG 
GGGTCCTTTC 
GTCGGCCGGC 
CCGAATCCAG 
CATTCTCCTC 
CATCTATCAC 
C ACTGT CATC 
GACCGATGAA 
GATCTTTGCC 
TCTGGCCCTG 
TATCAACGTG 
TGAGAAAGTG 
CACGGCGTAT 
GGTTCTGGCT 
CAGAACCAGC 
CGTCACCTCC 
CTACGACCCC 
CTTTGACATC 
CCGCTTCGTG 
GGCCAATGGC 
CCAGGAGGTC 
CCAGATGGAC 
GACAGGCAGC 
CCACGCGCGC 
CGTGATGCTG 
GACTTTTGCC 
GAAGCAGTTA 
CACTATTGAG 



11 
I 

TGAACCACTT 
GCTATGGCGG 
GGCAATGATC 
TCCAGCCCTG 
GAGGGGCCCC 
GACTACCGCG 
GAGGACGTAG 
CGTGACCGGG 
GATGAGGAGG 
GACGGCGAGG 
CACTCTGTGC 
AAGAACTTCC 
AG CG AC ATGT 
AGGGAGCACG 
GATGAGGCTG 
CACATCCATG 
CTGCAT CTGA 
CCCCAGCTCA 
TGCCAGTCCC 
CCCTTTGAGG 
GAGAGTCCAG 
GCAGATCTGG 
AACAACTATG 
CTAGCCAACC 
GATGTGAAGA 
AGCATTGCTC ' 
TTCGGAGGGG 
CTCTTGTGCG 
TCCAGCCGAG 
GTCCAGCGGC 
GACCGAGGAG 
ATCCATGAGG 
CTGCAGGCTC 
TCGCTGACTT 
CTGTGTGTGG 
GTGGGCAGCC 
AGCGCTGCTG 
CTGAAGAAGT 
CAGGACAAGG 
ATCCCCATTA 
ATCCATCTGC 
GAGAGCTTCA 
CGCTACCTTT 
GTGGCAGAGC 
GTCCCTGAGA 



21 
i 

TTCGCGCGAA 
AAT CATCGGA 
CTCTCACCTC 
GCCGTGACCT 
TGGAGGAAGA 
CCATCCCAGA 
AGGAGCTGAC 
AGGCTGGCCG 
ACGAGGAGCG 
AGGACGAGGA 
GCGAGTGGGT 
TGCGCACTCA 
GCAAAGAGAA 
TGCTGGCCTA 
CCCTGGAGGT 
TCCGCATCTC 
ACCAGCTGAT 
GCATGGTCAA 
AGAACCAGGA 
TCAACATGGA 
GCAAAGTGGC 
TGGACAGCTG 
ATGGCTCCCT 
ACGTGGCCAA 
TGATCACTAG 
CTTCCATCTA 
AGCCCAAAAA 
GAGACCCTGG 
CCATCTTCAC 
ACCCTGTCAG 
TGTGTCTCAT 
CCATGGAGCA 
GCTGCACGGT 
TCTCTGAGAA 
TGAGGGACAC 
ACGTCAGACA 
AGCCCGCCAT 
ACATCATCTA 
TGGCCAAGAT 
CGGTGCGGCA 
GGGACTATGT 
TAGACACACA 
CATTCCGGCG 
AGGTGACATA 
AGGACTTGGT 



31 
I 

ACCTGGTTGT 
ATCCTTCACC 
CAGCCCTGGC 
TCCACCATTT 
AGAGGATGGA 
GCTGGACGCC 
GGCCAGTCAG 
GGGCCTGGGC 
CCCTGCCCGC 
GATGATCGAG 
GAGCATGGCG 
CGTCGACAGC 
CCGTGAGAGC 
CTTCCTGCCT 
GGTACTGGCC 
CCACCTGCCT 
CCGCACCAGT 
GTACAACTGC 
GGTGAAACCA 
GGAGACCATC 
GGCTGGCCGG 
CAAGCCAGGA 
CAACACTGCC 
GAAGGACAAC 
CCTCTCCAAG 
TGGTCATGAA 
CCCAGGTGGC 
CACAGCGAAG 
CACTGGCCAG 
CAGGGAGTGG 
TGATGAATTT 
ACAGAGCATC 
CATTGCTGCC 
CGTGGACCTC 
CGTGGACCCA 
CCACCCCAGC 
GCCCAACACG 
CGCCAAGGAG 
GTACAGTGAC 
CATCGAGTCC 
GATCGAAGAC 
GAAGTTCAGC 
TGACAACAAT 
TCAGCGCAAC 
GGATAAGGCT 



41 
I 

TGCTGTAGTG 
ATGGCATCCA 
CGAAGCTCCC 
GAGGATGAGT 
GAGGAGCTCA 
TATGAGGCCG 
AGGGAGGCAG 
CGCATGCGCC 
AAGCGCCGCC 
AGCATCGAGA 
GGCCCCCGGC 
CACGGCCACA 
CTGGTGGTGA 
GAGGCACCGG 
ATGTACCCCA 
CTGGTGGAGG 
GGGGTGGTGA 
AACAAGTGCA 
GGCTCCTGTC 
TATCAGAACT 
CTGCCCCGCT 
GACGAGATAG 
AATGGCTTCC 
AAGGTTGCTG 
GATCAGCAGA 
GACATCAAGA 
AAGCACAAGG 
TCGCAGTTTC 
GGGGCGTCGG 
ACCTTGGAGG 
GACAAGATGA 
TCCATCTCGA 
GCCAACCCCA 
ACAGAGCCCA 
GTCCAGGACG 
AACAAGGAGG 
TATGGCGTGG 
AGGGTCCACC 
CTGAGGAAAG 
ATGATCCGCA 
GACGTCAACA 
GTCATGCGCA 
GAGCTGTTGC 
CGCTTTGGGG 
CGTCAGAT CA 



51 
I 

GCGGAGAGGA 
GCCCGGCCCA 
GGCGTACTGA 
CCGAGGGGCT 
TTGGAGATGG 
AGGGACTGGC 
CAGAGCGGGC 
GTGGGCTCCT 
AGGTGGAGCG 
ACCTGGAGGA 
TGGAGAT CCA 
ACGTCTTCAA 
ACTATGAGGA 
CGGAGCTGCT 
AGTACGACCG 
AGCTGCGCTC 
CCAGCTGCAC 
ATTTCGTCCT 
CTGAGTGCCA 
ACCAGCGTAT 
CCAAGGACGC 
AGCTGACTGG 



TAGGGGAACT 
TCGGAGAGAA 
GAGGCCTGGC 
TACGTGGTGA 
TCAAGTATAT 
CTGTGGGCCT 
CTGGGGCCCT 
ATGACCAGGA 
AGGCTGGCAT 
TAGGAGGGCG 
TCATCTCACG 
AGATGCTGGC 
AGGAGGGGCT 
AGCCCCTGCC 
CGAAGCTCAA 
AATCTATGGC 
TGGCGGAGGC 
TGGCCATCCG 
GCATGCGCAA 
TCTTCATACT 
CCCAGCAGGA 
ACATCCACAA 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



309 



WO 02/086443 

CCTCTCTGCA TTTTATGACA GTGAGCTCTT CAGGATGAAC AAGTTCAGCC ACGACCTGAA 2760 

AAGGAAAATG ATCCTGCAGC AGTTCTGAGG CCCTATGCCA TCCATAAGGA TTCCTTGGGA 2 820 

TTCTGGTTTG GGGTGGTCAG TGCCCTCTGT GCTTTATGGA CACAAAACCA GAGCACTTGA 2880 

TGAACT CGGG GTACTAGGGT CAGGGCTTAT AGCAGGATGT CTGGCTGCAC CTGGCATGAC 2 940 

TGTTTGTTTC TCCAAGCCTG CTTTGTGCTT CTCACCTTTG GGTGGGATGC CTTGCCAGTG 3 0 00 

TGTCTTACTT GGTTGCTGAA CATCTTGCCA CCTCCGAGTG CTTTGTCTCC ACT CAGTACC 3 060 

TTGGATCAGA GCTGCTGAGT TCAGGATGCC TGCGTGTGGT TTAGGTGTTA GCCTTCTTAC 3120 

ATGGATGTCA GGAGAGCTGC TGCCCTCTTG GCGTGAGTTG CGTATTCAGG CTGCTTTTGC 3180 

TGCCTTTGGC CAGAGAGCTG GTTGAAGATG TTTGTAATCG TTTTCAGTCT CCTGCAGGTT 3240 

TCTGTGCCCC TGTGGTGGAA GAGGGCACGA CAGTGCCAGC GCAGCGTTCT GGGCTCCTCA 33 00 

GTCGCAGGGG TGGGATGTGA GTCATGCGGA TTATCCACTC GCCACAGTTA TCAGCTGCCA 33 60 

TTGCTCCCTG TCTGTTTCCC CACTCTCTTA TTTGTGCATT CGGTTTGGTT TCTGTAGTTT 342 0 
TAATTTTTAA TAAAGTTGAA TAAAATATAA AAAAAAAAAA AAAAAA 

Seq ID NO: 329 Protein sequence: 
Protein Accession #: AAH17490.1 

1 11 21 31 41 51 

I I I I I I 

MAESSESFTM ASSPAQRRRG NDPLTSSPGR SSRRTDALTS SPGRDLPPFE DESEGLLGTE 60 

GPLEEEEDGE ELIGDGMERD YRAIPELDAY EAEGLALDDE DVEELTASQR EAAERAMRQR 120 

DREAGRGLGR MRRGLLYDSD EEDEERPARK RRQVERATED GEEDEEMIES IENLEDLKGH 18 0 

SVREWVSMAG PRIiEIHHRFK NFLRTHVDSH GHNVFKERIS DMCKENRESL WNYEDLAAR 240 

EHVLAYFLPE APAELLQIFD EAALEWLAM YPKYDRITNH IHVRISHLPL VEELRSLRQL 3 00 

HLNQLIRTSG WTSCTGVLP QLSMVKYNCN KCNFVLGPFC QSQNQEVKPG SCPECQSAGP 3 60 

FEVNMEETIY QNYQRIRIQE SPGKVAAGRL PRSKDAILLA DLVDSCKPGD EIELTGIYHN 420 

NYDGSLNTAN GFPVFATVIL ANHVAKKDNK VAVGELTDED VKMITSLSKD QQXGEKIFAS 480 

IAPSIYGHED IKRGLALAliF GGEPKNPGGK HKVRGDINVL LCGDPGTAKS QFLKYIEKVS 540 

SRAIFTTGQG ASAVGLTAYV QRHPVSREWT LEAGALVLAD RGVCLIDEFD KMNDQDRTSI 600 

HEAMEQQSIS ISKAGIVTSL QARCTVIAAA NPIGGRYDPS LTFSENVDLT EPIISRFDIL 660 

CWRDTVDPV QDEMLARFW GSHVRHHPSN KEEEGLANGS AAEPAMPNTY GVEPLPQEVL 72 0 

KKYIIYAKER VHPKLNQMDQ DKVAKMYSDL RKESMATGSI PITVRHIESM IRMAEAHARI 780 

HLRDYVIEDD VNMAIRVMLE SFIDTQKFSV MRSMRKTFAR YLSFRRDNNE LLLFIIiKQLV 840 

AEQVTYQRNR FGAQQDTIEV PEKDLVDKAR QINIHNIjSAF YDSELFRMNK FSHDLKRKMI 90 0 
LQQF 

Seq ID NO: 330 DNA sequence 
Nucleic Acid Accession #: M17254 
Coding sequence: 257-1645 

1 11 21 31 41 51 

I I I I I I 

GTCCGCGCGT GTCCGCGCCC GCGTGTGCCA GCGCGCGTGC CTTGGCCGTG CGCGCCGAGC 60 

CGGGTCGCAC TAACTCCCTC GGCGCCGACG GCGGCGCTAA CCTCTCGGTT ATTCCAGGAT 12 0 

CTTTGGAGAC CCGAGGAAAG CCGTGTTGAC CAAAAGCAAG ACAAATGACT CACAGAGAAA 180 

AAAGATGGCA GAACCAAGGG CAACTAAAGC CGTCAGGTTC TGAACAGCTG GTAGATGGGC 24 0 

TGGCTTACTG AAGGACATGA TTCAGACTGT CCCGGACCCA GCAGCT CAT A TCAAGGAAGC 300 

CTTATCAGTT GTGAGTGAGG ACCAGTCGTT GTTTGAGTGT GCCTACGGAA CGCCACACCT 360 

GGCTAAGACA GAGATGACCG CGTCCTCCTC CAGCGACTAT GGACAGACTT CCAAGATGAG 420 

CCCACGCGTC CCTCAGCAGG ATTGGCTGTC TCAACCCCCA GCCAGGGTCA CCATCAAAAT 480 

GGAATGTAAC CCTAGCCAGG TGAATGGCTC AAGGAACTCT CCTGATGAAT GCAGTGTGGC 540 

CAAAGGCGGG AAGATGGTGG GCAGCCCAGA CACCGTTGGG ATGAACTACG GCAGCTACAT 600 

GGAGGAGAAG CACATGCCAC CCCCAAACAT GACCACGAAC GAGCGCAGAG TTATCGTGCC 660 

AGCAG AT CCT ACGCTATGGA GTACAGACCA TGTGCGGCAG TGGCTGGAGT GGGCGGTGAA 720 

AGAATATGGC CTTCCAGACG TCAACATCTT GTTATTCCAG AACATCGATG GGAAGGAACT 780 

GTGCAAGATG ACCAAGGACG ACTTCCAGAG GCTCACCCCC AGCTACAACG CCGACATCCT 840 

TCTCTCACAT CTCCACTACC TCAGAGAGAC TCCTCTTCCA CATTTGACTT CAGATGATGT 900 

TGATAAAGCC TTACAAAACT CTCCACGGTT AATGCATGCT AGAAACACAG ATTTACCATA 960 

TGAGCCCCCC AGGAGAT CAG CCTGGACCGG TCACGGCCAC CCCACGCCCC AGTCGAAAGC 102 0 

TGCTCAACCA TCTCCTTCCA CAGTGCCCAA AACTGAAGAC CAGCGTCCTC AGTTAG AT CC 10 80 

TTATCAGATT CTTGGACCAA CAAGTAGCCG CCTTGCAAAT CCAGGCAGTG GCCAGATCCA 1140 

GCTTTGGCAG TTCCTCCTGG AGCTCCTGTC GGACAGCTCC AACTCCAGCT GCATCACCTG 1200 

GGAAGGCACC AACGGGGAGT TCAAGATGAC GGATCCCGAC GAGGTGGCCC GGCGCTGGGG 12 60 

AGAGCGGAAG AGCAAACCCA ACATGAACTA CGATAAGCTC AGCCGCGCCC TCCGTTACTA 1320 

CTATGACAAG AACATCATGA CCAAGGTCCA TGGGAAGCGC TACGCCTACA AGTTCGACTT 13 80 

CCACGGG AT C GCCCAGGCCC TCCAGCCCCA CCCCCCGGAG TCATCTCTGT ACAAGTACCC 1440 

CTCAGACCTC CCGTACATGG GCTCCTATCA CGCCCACCCA CAGAAGATGA ACTTTGTGGC 150 0 

GCCCCACCCT CCAGCCCTCC CCGTGACATC TTCCAGTTTT TTTGCTGCCC CAAACCCATA 1560 

CTGGAATTCA CCAACTGGGG GTATATACCC CAACACTAGG CTCCCCACCA GCCATATGCC 1620 

TTCTCATCTG GGCACTTACT ACTAAAGACC TGGCGGAGGC TTTTCCCATC AGCGTGCATT 1680 

CACCAGCCCA TCGCCACAAA CTCTATCGGA GAACATGAAT CAAAAGTGCC TCAAGAGGAA 174 0 

TGAAAAAAGC TTTACTGGGG CTGGGGAAGG AAGCCGGGGA AG AG AT C CAA AGACTCTTGG 1800 

GAGGGAGTTA CTGAAGTCTT ACTACAGAAA TGAGGAGGAT GCTAAAAATG TCACGAATAT 18 60 

GGACATATCA TCTGTGGACT GACCTTGTAA AAGACAGTGT ATGTAGAAGC ATGAAGT CTT 192 0 

AAGGACAAAG TGCCAAAGAA AGTGGTCTTA AGAAATGTAT AAACTTTAGA GTAGAGTTTG 1980 

AATCCCACTA ATGCAAACTG GGATGAAACT AAAGCAATAG AAACAACACA GTTTTGACCT 2 040 

AACATACCGT TTATAATGCC ATTTTAAGGA AAACTACCTG TATTTAAAAA TAGTTTCATA 2100 

TCAAAAACAA GAGAAAAGAC ACGAGAGAGA CTGTGGCCCA TCAACAGACG TTGATATGCA 2160 

ACTGCATGGC ATGTGCTGTT TTGGTTGAAA TCAAATACAT TCCGTTTGAT GGACAGCTGT 2220 

CAGCTTTCTC AAACTGTGAA GATGACCCAA AGTTTCCAAC TCCTTTACAG TATTACCGGG 2280 

ACTATGAACT AAAAGGTGGG ACTGAGGATG TGTATAGAGT GAGCGTGTGA TTGTAGACAG 23 40 

AGGGGTGAAG AAGGAGGAGG AAGAGGCAGA GAAGGAGGAG ACCAGGCTGG GAAAGAAACT 2400 

TCTCAAGCAA TGAAGACTGG ACTCAGGACA TTTGGGGACT GTGTACAATG AGTTATGGAG 2460 

ACTCGAGGGT TCATGCAGTC AGTGTTATAC CAAACCCAGT GTTAGGAGAA AGGACACAGC 2520 

GTAATGGAGA AAGGGAAGTA GTAGAATTCA GAAACAAAAA TGCGCATCTC TTTCTTTGTT 2 580 

TGTCAAATGA AAATTTTAAC TGGAATTGTC TGATATTTAA GAGAAACATT CAGGACCTCA 2 640 

TCATTATGTG GGGGCTTTGT TCTCCACAGG GTCAGGTAAG AGATGGCCTT CTTGGCTGCC 2700 



310 



WO 02/086443 

ACAATCAGAA ATCACGCAGG CATTTTGGGT AGGCGGCCTC CAGTTTTCCT TTGAGTCGCG 2760 

AACGCTGTGC GTTTGTCAGA ATGAAGTATA CAAGTCAATG TTTTTCCCCC TTTTTATATA 2820 

ATAATTATAT AACTTATGCA TTTATACACT ACGAGTTGAT CTCGGCCAGC CAAAGACACA 2880 

CGACAAAAGA GACAATCGAT ATAATGTGGC CTTGAATTTT AACTCTGTAT GCTTAATGTT 2 940 

TACAATATGA AGTTATTAGT TCTTAGAATG CAGAATGTAT GTAATAAAAT AAGCTTGGCC 3000 

T AGCATGG CA AATCAGATTT ATACAGGAGT CTGCATTTGC ACTTTTTTTA GTGACTAAAG 3060 

TTGCTTAATG AAAACATGTG CTGAATGTTG TGGATTTTGT GTTATAATTT ACTTTGTCCA 312 0 
GGAACTTGTG CAAGGGAGAG CCAAGGAAAT AGGATGTTTG GCACCC 

Seq ID NO: 331 Protein sequence 
Protein Accession #j AAA52398 

1 11 21 31 41 51 

I I 1 I I 1 

MIQTVPDPAA HIKEALSWS EDQSLFECAY GTPHLAKTEM TASSSSDYGQ TSKMSPRVPQ 60 

QDWLSQPPAR VTIKMECNPS QVNGSRNSPD ECSVAKGGKM VGSPDTVGMN YGSYMEEKHM 120 

PPPNMTTNER RVIVPADPTL WSTDHVRQWL EWAVKEYGLP DVNILLFQNI DGKELCKMTK 180 

DDFQRLTPSY NADILLSHLH YLRETPLPHL TSDDVDKALQ NSPRLMHARN TDLPYEPPRR 240 

SAWTGHGHPT PQSKAAQPSP STVPKTEDQR PQLDPYQILG PTSSRLANPG SGQIQLWQFL 3 00 

LELLSDSSNS SCITWEGTNG EFKMTDPDEV ARRWGERKSK PNMNYDKLSR ALRY YYDKN I 3 60 

MTKVHGKRYA YKFDFHGIAQ ALQPHPPESS LYKYPSDLPY MGSYHAHPQK MNFVAPHPPA 420 

LPVTSSSFFA APNPYWNSPT GGIYPNTRLP TSHMPSHLGT YY 462 

Seq ID NO: 332 DNA sequence 
Nucleic Acid Accession #: NM_000020 
Coding sequence: 283-1794 

1 11 21 31 41 51 

I l I I I > 

AGGAAACGGT TTATTAGGAG GGAGTGGTGG AGCTGGGCCA GGCAGGAAGA CGCTGGAATA 60 

AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCGC CAGCTGCGCC 12 0 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 

CCAGCGCTGG CGGTGCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCCGA 240 

AGGCTAGCGC CCCGCCACCC GCAGAGCGGG CCCAGAGGGA CCATGACCTT GGGCTCCCCC 300 

AGGAAAGGCC TTCTGATGCT GCTGATGGCC TTGGTGACCC AGGGAGACCC TGTGAAGCCG 360 

TCTCGGGGCC CGCTGGTGAC CTGCACGTGT GAGAGCCCAC ATTG CAAGGG GCCTACCTGC 420 

CGGGGGGCCT GGTGCACAGT AGTGCTGGTG CGGGAGGAGG GGAGGCACCC CCAGGAACAT 480 

CGGGGCTGCG GGAACTTGCA CAGGGAGCTC TGCAGGGGGC GCCCCACCGA GTT CGTCAAC 540 

CACTACTGCT GCGACAGCCA CCTCTGCAAG CACAACGTGT CCCTGGTGCT GGAGGCCACC 600 

CAACCTCCTT CGGAGCAGCC GGGAACAGAT GGCCAGCTGG CCCTGATCCT GGGCCCCGTG 660 

CTGGCCTTGC TGGCCCTGGT GGCCCTGGGT GTCCTGGGCC TGTGGCATGT CCGACGGAGG 72 0 

CAGGAGAAGC AGCGTGGCCT GCACAGCGAG CTGGGAGAGT CCAGTCTCAT CCTGAAAGCA 7 80 

TCTGAGCAGG GCGACACGAT GTTGGGGGAC CTCCTGGACA GTGACTGCAC CACAGGGAGT 840 

GGCTCAGGGC TCCCCTTCCT GGTGCAGAGG ACAGTGGCAC GGCAGGTTGC CTTGGTGGAG 900 

TGTGTGGGAA AAGGCCGCTA TGGCGAAGTG TGGCGGGGCT TGTGGCACGG TGAGAGTGTG 960 

GCCGTCAAGA TCTTCTCCTC GAGGGATGAA CAGTCCTGGT TCCGGGAGAC TGAGATCTAT 102 0 

AACACAGTAT TGCTCAGACA CGACAACATC CTAGGCTTCA TCGCCTCAGA CATGACCTCC 10 8 0 

CGCAACTCGA GCACGCAGCT GTGGCTCATC ACGCACTACC ACGAGCACGG CTCCCTCTAC 1140 

GACTTTCTGC AGAGACAGAC GCTGGAGCCC CATCTGGCTC TGAGGCTAGC TGTGTCCGCG 1200 

GCATGCGGCC TGGCGCACCT GCACGTGGAG ATCTTCGGTA CACAGGGCAA ACCAGCCATT 12 60 

GCCCACCGCG ACTTCAAGAG CCGCAATGTG CTGGTCAAGA GCAACCTGCA GTGTTGCATC 132 0 

GCCGACCTGG GCCTGGCTGT GATGCACTCA CAGGGCAGCG ATTACCTGGA CATCGGCAAC 1380 

AACCCGAGAG TGGGCACCAA GCGGTACATG GCACCCGAGG TGCTGGACGA GCAGATCCGC 1440 

ACGGACTGCT TTGAGTCCTA CAAGTGGACT GACATCTGGG CCTTTGGCCT GGTGCTGTGG 1500 

GAGATTGCCC GCCGG AC CAT CGTGAATGGC AT CGTGGAGG ACTATAGACC ACCCTTCTAT 1560 

GATGTGGTGC CCAATGACCC CAGCTTTGAG GACATGAAGA AGGTGGTGTG TGTGGATCAG 162 0 

CAGACCCCCA CCATCCCTAA CCGGCTGGCT GCAGACCCGG TCCTCTCAGG CCTAGCTCAG 1680 

ATGATGCGGG AGTGCTGGTA CCCAAACCCC TCTGCCCGAC TCACCGCGCT GCGGATCAAG 1740 

AAGACACTAC AAAAAATTAG CAACAGTCCA GAGAAGCCTA AAGTGATTCA ATAGCCCAGG 1800 

AGCACCTGAT TCCTTTCTGC CTGCAGGGGG CTGGGGGGGT GGGGGGCAGT GGATGGTGCC 1860 

CTATCTGGGT AGAGGTAGTG TGAGTGTGGT GTGTGCTGGG GATGGGCAGC TGCGCCTGCC 1920 

TGCTCGGCCC CCAGCCCACC CAGCCAAAAA TACAGCTGGG CTGAAACCTG ATCCCCTGCT 1980 

GTCTGGCCTG CTCAAAGCGG CAGGCTCCCT GACGCCTGGC TCTCTCCCCA CCCCTATGGC 2 040 

CAGCATGGTG CACCCCCTAC CACTCCCGGG ACAGGATGCA AAAGAGGCT C CAGAGTCAGA 2100 

GTGCCAAGCC AGGGAATCCC AGT CCCAG AC TCAGAGCCCG GGCCTGCACT TTGCCCCCTG 2160 

CCCTTGATCA ACCCCACTGC CCCACCAGAG CTGCCAGGGT GGCACAGGGC CCTGTCCAGC 222 0 

CCCTGGCACA CACTTCCCTG CCAGGCCTCA GCCTCTAGCA TAAGCT CCAG AGAGCCAGGG 22 80 

CCCATCAGTT TCTCTCTGTG GAT TTGTATC TCAGCTCCAT GATGCCTTGG GCTTTCTGTC 234 0 

TCCTCAACAA GAGTGCAGCT TGCTGAATGT CAGCTGCCTG AGAGAGCTGG GGCCTGACTT 2400 

ACTAGGGCAT TAAATCCTAA GAGGTCCTAC TGAGGTGTGG CAGGATCACA GGCCAGTGGA 2460 

AAAAGGGCAG GTCAGATGGG CAAGG CCCAG GACTTTCAGA TTAACTGAGA GGATATCGAG 252 0 

GCCAAGCATG GCAGGGGGAA GGTCAGTGGG TGTCAAGAGA CCCAGGTCTG ACCCCGGATG 2580 

TTTGCTCCAT GTGACAAAAG CAGGCCTGTC TCAGGACCTT TTCTTTTCTT TTTTCCTTCT 2 640 

TTTTTTTTTT GACACGGAGT TTCGCTCTTG TTGTCCAGGC TAGAGTGCAA TGGCATGATC 2700 

CCAGCTCACC GCAACGTCTA CCTCCCAGGT TCAAATCATT CTCTTGCCTC AGACTCCCGA 2760 

GTAGCTGGGA TTACAGGCAC ATGCCACCAT GCCTGGCTAA TTTTGTATAT TTAGTAGAAA 2820 

CAGGGTTTCA CCATGCTGGC CATGCTGGTT CTCGAACTCC TGACCTCAGG TGTTCCACCT 2880 

ACCTCAGCCT CCCAAAGTGC TGGGGTTACA GGTGTGAGCC ATCGCGCCTG GCCAGGACCT 2 940 

TTGTTTCTTA TCTACATATT GGAAGATTTG GTCCTGATGT CCTTTGAGGC TTCTTTAGCT 3000 

CTAGTTCTCT GACACTT CAG CCTATAT CAC AGCTAACTTC YTCAGTCTCA TCTATTCCTT 3060 

ATGCTCCAGC CCCTGGCAAT TTGCCTCAAG ATGGGGGTTT GAAAATAACT TTACCTGACT 3120 

CAAGGAGTGT CTGGAGCACC TCCTAGTCTA AGTCTGCAAG CTCCAGTTCT TGCCTAAAAC 3180 

CATGCCAGTG GCCACCCTTG GGCTCAGACA GCTCTGGGCC TTTTGACCAC AAGCCAGCCC 3240 

CTCGCCCTCT CTGTGGCATA GTCTTCTCTG CCCCAGGACT GCAGGGCGGC TTCCTCCAAG 3300 

GCTTCCAAGG CTCAAAAGAA ATTTGGCTCC AT CCAAGAAG GCTCCAGCTC CCCTACTGGC 3360 

CCCTGGCTTC AGGCCCACAC CCCTGGGCCA GGSCCAGAGA GTGTGTCTCA GGAGAATT CA 3420 

ATGGGCTCTA GAGAGACACA CAGAAAGTTT GGGCATTTGG GAAATTTTCA AGGRTGTATG 3480 
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TATGGYTCAC GTATGGWGCA GGTTGTCCTG GTCCYKGGGT GCAGGGAAGT GGGCTGCAGG 3540 

GAAGTGGATT GGAGGGGAGC TTGAGGAATA TAAGGAGCGG GGGTGGAGAC TCAGGCTATG 3 600 

GACAAGGACA GCCCCAAGGT TGGGAAGACC TGGCCTTAGT CGTCCTCAGC CTAGGGCAGG 3660 

GCAGTGAAGA AAGCTCTCCC CGCTCCTGCT GTAATGACCC AGAGTAGCCT CCCCAGGCCG 3720 

GCATCTTATG TGTGTCTTCC ACCATCCTCA TGGTGGCACT TTTCTAGGCC TGTCTCCCAG 3780 

CATTGTGCAA GGCTCGGAAG AGAACCAGGA AGTGAAACTG GGTGAAAACA GAAAGCTCAA 3 840 

TGGATGGGCT AGGTT CCCAG AT CAT T AGGG CAGAGTTTGC ACGTCCTCTG GTTCACTGGG 3 900 

AATCCACCCA GCCCACGAAT CATCTCCCTC TTTGAAGGAT TTTWATTTCT ACTGGGTTTT 3 960 

GGAACAAACT CCTGCTGAGA CCCCACAGCC AGAAACTGAA AGCAGCAGCT CCCCAAAGCC 4020 

TGGAAAATCC CTAAGAGAAG GCCTGGGGGA MAGGAAKTGG AGTGACAGGG GACAGGTAGA 4080 

GAGAAGGGGG CCCAATGGCC AGGGAGTGAA GGAGGTGGCG TTGCTGAGAG CAGTCTGCAC 4140 

ATGCTTCTGT CTGAGTGCAG GAAGGTGTTC CAGGGTCGAA ATTACACTTC TCGTACCTGG 420 0 

AGACGCTGTT TGTGGGAGCA CTGGGCTCAT GCCTGGCACA CAATAGGTCT GCAATAAACC 4260 
ATGGTTAAAT CCTGAAAAAA AAAAAAAAA 

Seq ID NO: 333 Protein sequence 
Protein Accession #: NP_000011 



1 11 21 31 41 51 

I I I I I I 

MTLGSPRKGL LMLLMALVTQ GDPVKPSRGP LVTCTCESPH CKGPTCRGAW CTWLVREEG 60 

RHPQEHRGCG NLHRELCRGR PTEFVNHYCC DSHLCNHNVS LVLEATQPPS EQPGTDGQLA 12 0 

LILGPVLALL ALVALGVLGL WHVRRRQEKQ RGLHSELGES SLILKASEQG DTMLGDLLDS 180 

DCTTGSGSGL PFLVQRTVAR Q VALVE CVGK GRYGEVWRGL WHGESVAVKI FSSRDEQSWF 240 

RETEIYNTVL LRHDNILGFI ASDMTSRNSS TQLWLITHYH EHGSLYDFLQ RQTLEPHLAL 3 00 

RLAVSAACGL AHLHVE I FGT QGKPAIAHRD FKSRNVLVKS NLQCCIADLG LAVMHSQGSD 3 60 

YLDIGNNPRV GTKRYMAPEV LDEQIRTDCF ESYKWTDIWA FGLVLWEIAR RTIVNGIVED 420 

YRPPFYDWP NDPSFEDMKK WCVDQQTPT IPNRLAADPV LSGLAQMMRE CWYPNPSARL 480 
TALRIKKTLQ KISNSPEKPK VIQ 

Seq ID NO: 334 DNA sequence 

Nucleic Acid Accession #: NM_004126.1 

Coding sequence: 10 8-329 

1 11 21 31 41 51 

11)111 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 
AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGCGAAAATG CCTGCCCTTC 120 
ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 
AAGTGAAGTT GCAGAGACAA CAAGTGTCTA AATGTTCTGA AGAAATAAAG AACTATATTG 240 
AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 3 00 
AAGAAAAAGG CAGCTGTGTT ATTTCATAAA TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 
AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 
TGAAAT TAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 
GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 
ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 6 00 
GCTTCAAATA AAGTTTTGTC TT 

Seq ID NO: 3 35 Protein sequence 
Protein Accession #: NP_004117.1 

1 11 21 31 41 51 

I I I I I I 

MPALHIEDLP EKEKLKMEVE QLRKEVKLQR QQVSKCSEEI KNYIEERSGE DPLVKGIPED 60 
KNPFKEKGSC VIS 



Seq ID NO: 336 DNA sequence 
Nucleic Acid Accession #: NM_005795 
Coding sequence: 555-1940 

1 11 21 31 41 51 
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GCACGAGGGA ACAACCTCTC TCTCTSCAGC AGAGAGTGTC ACCTCCTGCT TTAGGACCAT 60 

CAAGCTCTGC TAACTGAATC TCAT CCTAAT TGCAGGATCA CATTGCAAAG CTTTCACTCT 120 

TTCCCACCTT GCTTGTGGGT AAATCTCTTC TGCGGAATCT CAGAAAGTAA AGTTCCATCC 180 

TGAGAATATT TCACAAAGAA TTTCCTTAAG AGCTGGACTG GGTCTTGACC CCTGGAATTT 240 

AAGAAATTCT TAAAGACAAT GTCAAATATG ATCCAAGAGA AAATGTGATT TGAGTCTGGA 300 

GACAATTGTG CATATCGTCT AATAATAAAA ACCCATACTA GCCTATAGAA AACAATATTT 360 

GAATAATAAA AACCCATACT AGCCTATAGA AAACAATATT TGAAAGATTG CTACCACTAA 42 0 

AAAGAAAACT ACTACAACTT GACAAGACTG CTGCAAACTT CAATTGGTCA CCACAACTTG 480 

ACAAGGTTGC TATAAAACAA GATTGCTACA ACTTCTAGTT TATGTTATAC AGCATATTTC 540 

ATTTGGGCTT AATGATGGAG AAAAAGTGTA CCCTGTATTT TCTGGTTCTC TTGCCTTTTT 600 

TTATGATTCT TGTTACAGCA GAATTAGAAG AGAGTCCTGA GGACTCAATT CAGTTGGGAG 660 

TTACTAGAAA TAAAATCATG ACAGCTCAAT ATGAATGTTA CCAAAAGATT ATGCAAGACC 720 

CCATTCAACA AGCAGAAGGC GTTTACTGCA ACAGAACCTG GGATGGATGG CTCTGCTGGA 780 

ACGATGTTGC AGCAGGAACT GAAT CAATGC AGCTCTGCCC TGATTACTTT CAGGACTTTG 840 

ATCCATCAGA AAAAGTTACA AAGATCTGTG ACCAAGATGG AAACTGGTTT AGACATCCAG 900 

CAAGCAACAG AACATGGACA AATTATACCC AGTGTAATGT TAACACCCAC GAGAAAGTGA 960 

AGACTGCACT AAATTTGTTT TACCTGACCA TAATTGGACA CGGATTGTCT ATTGCATCAC 102 0 

TGCTTATCTC GCTTGGCATA TTCTTTTATT TCAAGAGCCT AAGTTGCCAA AGGATTACCT 10 80 

TACACAAAAA TCTGTTCTTC TCATTTGTTT GTAACTCTGT TGTAACAATC ATTCACCTCA 1140 

CTGCAGTGGC CAACAACCAG GCCTTAGTAG CCACAAATCC TGTTAGTTGC AAAGTGTCCC 1200 

AGTTCATTCA TCTTTACCTG ATGGGCTGTA ATTACTTTTG GATGCTCTGT GAAGGCAT TT 12 60 

ACCTACACAC ACTCATTGTG GTGGCCGTGT TTGCAGAGAA GCAACATTTA ATGTGGTATT 1320 

ATTTTCTTGG CTGGGGATTT CCACTGATTC CTGCTTGTAT ACATGCCATT GCTAGAAGCT 1380 
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TATATTACAA TGACAATTGC TGGAT CAGTT CTGATACCCA TCTCCTCTAC ATTATCCATG 1440 

GCCCAATTTG TGCTGCTTTA CTGGTGAATC TTTTTTTCTT GTTAAATATT GTACGCGTTC 1S00 

TCATCACCAA GTTAAAAGTT ACACACCAAG CGGAATCCAA TCTGTACATG AAAGCTGTGA 1560 

GAGCTACTCT TATCTTGGTG CCATTGCTTG GCATTGAATT TGTGCTGATT CCATGGCGAC 1620 

CTGAAGGAAA GATTGCAGAG GAGGTATATG ACTACATCAT GCACATCCTT ATGCACTTCC 1680 

AGGGTCTTTT GGTCTCTACC ATTTTCTGCT TCTTTAATGG AGAGGTTCAA GCAATTCTGA 1740 

GAAGAAACTG GAATCAATAC AAAATCCAAT TTGGAAACAG CTTTTCCAAC TCAGAAGCTC 1800 

TTCGTAGTGC GTCTTACACA GTGTCAACAA TCAGTGATGG TCCAGGTTAT AGT CATG ACT 1860 

GTCCTAGTGA ACACTTAAAT GGAAAAAGCA TCCATGATAT TGAAAATGTT CTCTTAAAAC 1920 

CAGAAAATTT ATATAATTGA AAATAGAAGG ATGGTTGTCT CACTGTTTGG TGCTTCTCCT 1980 

AACTCAAGGA CTTGGACCCA TGACTCTGTA GCCAGAAGAC TTCAATATTA AATGACTTTG 2040 

GGGAATGTCA TAAAGAAGAG CCTTCACATG AAATTAGTAG TGTGTTGATA AGAGTGTAAC 2100 

ATCCAGCTCT ATGTGGGAAA AAAGAAATCC TGGTTTGTAA TGTTTGTCAG TAAATACTCC 2160 

CACTATGCCT GATGTGACGC TACTAACCTG ACATCACCAA GTGTGGAATT GGAGAAAAGC 222 0 

ACAATCAACT TTTCTGAGCT GGTGTAAGCC AGTTCCAGCA CACCATTGAT GAATTCAAAC 2280 

AAATGGCTGT AAAACTAAAC ATACATGTTG GGCATGATTC TACCCTTATT CSCCCCAAGA 2340 

GACCTAGCTA AGGTCTATAA ACATGAAGGG AAAATTAGCT TTTAGTTTTA AAACTCTTTA 2400 

TCCCATCTTG ATTGGGGCAG TTGACTTTTT TTTTTTCCCA GAGTGCCGTA GTCCTTTTTG 2460 

TAACTACCCT CTCAAATGGA CAATACCAGA AGTGAATTAT CCCTGCTGGC TTTCTTTTCT 2520 

CTATGAAAAG CAACTGAGTA CAATTGTTAT GATCTACTCA TTTGCTGACA CATCAGTTAT 2580 

ATCTTGTGGC ATAT CCATTG TGGAAACTGG ATGAACAGGA TGTATAATAT G CAATCTT AC 2640 

TTCTATATCA TTAGGAAAAC ATCTTAGTTG ATGCTACAAA ACACCTTGTC AACCTCTTCC 2700 

TGTCTTACCA AACAGTGGGA GGGAATTCCT AGCTGTAAAT ATAAATTTTG CCCTTCCATT 2760 

T CTACTGT AT AAACAAATTA GCAATCATTT TATATAAAGA AAATCAATGA AGGATTTCTT 2820 

ATTTTCTTGG AATTTTGTAA AAAGAAATTG TGAAAAATGA GCTTGTAAAT ACTCCATTAT 2880 

TTTATTTTAT AGTCTCAAAT CAAATACATA CAACCTATGT AATTTTTAAA G CAAAT AT AT 2 940 

AATGCAACAA TGTGTGTATG TTAATATCTG ATACTGTATC TGGGCTGATT TTTTAAATAA 3000 
AATAGAGTCT GGAATGCT 

Seq ID NO: 337 protein sequence 
Protein Accession #: NP_005786.1 

1 11 21 31 41 51 
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MEKKCTLYFL VLLPFFMILV TAELEESPED SIQLGVTRNK IMTAQYECYQ KIMQDPIQQA 60 

EGVYCNRTWD GWLCWNDVAA GTESMQLCPD YFQDFDPSEK VTKICDQDGN WFRHPASNRT 120 

WTNYTQCNVN THEKVKTALN LFYLTIIGHG LSIASLLISL GIFFYFKSLS CQRITLHKNL 180 

FFSFVCNSW T I I HLTAVAN NQALVATNPV SCKVSQFIHL YLMGCNYFWM LCEGIYLHTL 240 

IWAVFAEKQ HLMWYYFLGW GFPLIPACIH AIARSLYYND NCWISSDTHL LYIIHGPICA 300 

ALLVNLFFLL NIVRVLITKL KVTHQAESNL YMKAVRATLI LVPLLGIEFV LIPWRPEGKI 360 

AEEVYDYIMH ILMHFQGLLV STIFCFFNGE VQAILRRNWN QYKIQFGNSF SNSEALRSAS 42 0 
YTVSTISDGP GYSHDCPSEH LNGKSIHDIE NVLLKPENLY N 

Seq ID NOi 338 DNA sequence 
Nucleic Acid Accession #: NM_001795 
Coding sequence: 2 5-2379 

1 11 21 31 41 51 
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GCACGAT CTG TTCCTCCTGG GAAGATGCAG AGGCTCATGA TGCTCCTCGC CACATCGGGC 60 

GCCTGCCTGG GCCTGCTGGC AGTGGCAGCA GTGGCAGCAG CAGGTGCTAA CCCTGCCCAA 120 

CGGGACACCC ACAGCCTGCT GCCCACCCAC CGGCGCCAAA AGAGAGATTG GATTTGGAAC 180 

CAGATGCACA TTGATGAAGA GAAAAACACC TCACTTCCCC ATCATGTAGG CAAGATCAAG 240 

TCAAGCGTGA GTCGCAAGAA TGCCAAGTAC CTGCTCAAAG GAGAATATGT GGGCAAGGTC 3 00 

TTCCGGGTCG ATGCAGAGAC AGGAGACGTG TTCGCCATTG AGAGGCTGGA CCGGGAGAAT 3 60 

ATCTCAGAGT ACCACCTCAC TGCTGTCATT GTGGACAAGG ACACTGGTGA AAACCTGGAG 420 

ACTCCTTCCA GCTTCACCAT CAAAGTTCAT GACGTGAACG ACAACTGGCC TGTGTTCACG 480 

CATCGGTTGT TCAATGCGTC CGTGCCTGAG TCGTCGGCTG TGGGGACCTC AGTCATCTCT 540 

GTGACAGCAG TGGATGCAGA CGACCCCACT GTGGGAGACC ACGCCTCTGT CATGTACCAA 600 

ATCCTGAAGG GGAAAGAGTA TTTTGCCATC GATAATTCTG GACGTATTAT CACAATAACG 660 

AAAAGCTTGG ACCGAGAGAA GCAGGCCAGG TATGAGATCG TGGTGGAAGC GCGAGATGCC 720 

CAGGGCCTCC GGGGGGACTC GGGCACGGCC ACCGTGCTGG TCACTCTGCA AGACAT CAAT 780 

GACAACTTCC CCTTCTTCAC CCAGACCAAG TACACATTTG TCGTGCCTGA AGACACCCGT 840 

GTGGGCACCT CTGTGGGCTC TCTGTTTGTT GAGGACCCAG ATGAGCCCCA GAACCGGATG 900 

ACCAAGTACA GCATCTTGCG GGGCGACTAC CAGGACGCTT TCACCATTGA GACAAACCCC 960 

GCCCACAACG AGGGCATCAT CAAGCCCATG AAGCCTCTGG ATTATGAATA CAT C C AG C AA 1020 

T AC AG C T T C A TCGTCGAGGC CACAGACCCC ACCATCGACC TCCGATACAT GAGCCCTCCC 10 80 

GCGGGAAACA GAG CCCAGGT CATTATCAAC ATCACAGATG TGGACGAGCC CCCCATTTTC 1140 

CAGCAGCCTT TCTACCACTT CCAGCTGAAG GAAAACCAGA AGAAGCCTCT GATTGGCACA 12 00 

GTGCTGGCCA TGGACCCTGA TGCGGCTAGG CATAGCATTG GATACTCCAT CCGCAGGACC 12 60 

AGTGACAAGG GCCAGTTCTT CCGAGTCACA AAAAAGGGGG ACATTTACAA TGAGAAAGAA 132 0 

CTGGACAGAG AAGTCTACCC CTGGTATAAC CTGACTGTGG AGGCCAAAGA ACTGGATTCC 13 80 

ACTGGAACCC CCACAGGAAA AGAATCCATT GTGCAAGTCC ACATTGAAGT TTTGGATGAG 1440 

AATGACAATG CCCCGGAGTT TGCCAAGCCC TACCAGCCCA AAGTGTGTGA GAACGCTGTC 1500 

CATGGCCAGC TGGTCCTGCA GATCTCCGCA ATAGACAAGG ACATAACACC ACGAAACGTG 1560 

AAGT T CAAAT TCACCTTGAA TACTGAGAAC AACTTTACCC TCACGGATAA TCACGATAAC 162 0 

ACGGCCAACA TCACAGTCAA GTATGGGCAG TTTGACCGGG AGCATACCAA GGTCCACTTC 1680 

CTACCCGTGG TCATCTCAGA CAATGGGATG CCAAGTCGCA CGGGCACCAG CACGCTGACC 1740 

GTGGCCGTGT GCAAGTGCAA CGAGCAGGGC GAGTTCACCT TCTGCGAGGA TATGGCCGCC 1800 

CAGGTGGGCG TG AG CAT CCA GGCAGTGGTA GCCATCTTAC TCTGCATCCT CACCATCACA 1860 

GTGATCACCC TGCTCATCTT CCTGCGGCGG CGGCTCCGGA AGCAGGCCCG CGCGCACGGC 192 0 

AAGAGCGTGC CGGAGATCCA CGAGCAGCTG GTCACCTACG ACGAGGAGGG CGG CGGCG AG 1980 

ATGGACACCA CCAGCTACGA TGTGTCGGTG CTCAACTCGG TGCGCCGCGG CGGGGCCAAG 2 040 

CCCCCGCGGC CCGCGCTGGA CGCCCGGCCT TCCCTCTATG CGCAGGTGCA GAAGCCACCG 2100 

AGGCACGCGC CTGGGGCACA CGGAGGGCCC GGGGAGATGG CAGCCATGAT CGAGGTGAAG 2160' 

AAGGACGAGG CGGACCACGA CGGCGACGGC CCCCCCTACG ACACGCTGCA CAT CTACGGC 222 0 

TACGAGGGCT CCGAGT CC AT AGCCGAGTCC CTCAGCTCCC TGGG CACCGA CTCATCCGAC 22 80 
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TCTGACGTGG ATTACGACTT CCTTAACGAC TGGGGACCCA GGTTTAAGAT GCTGGCTGAG 2340 

CTGTACGGCT CGGACCCCCG GGAGGAGCTG CTGTATTAGG CGGCCGAGGT CACTCTGGGC 2400 

CTGGGGACCC AAACCCCCTG CAGCCCAGGC CAGTCAGACT CCAGGCACCA CAGCCTCCAA 2460 

AAATGG CAGT GACTCCCCAG CCCAGCACCC CTTCCTCGTG GGTCCCAGAG ACCTCATCAG 2 520 

CCTTGGGATA GCAAACTCCA GGTTCCTGAA ATATCCAGGA ATATATGTCA GTGATGACTA 2580 

TTGTCAAATG CTGGCAAATC CAGGCTGGTG TTCTGTCTGG GCTCAGACAT CCACATAACC 2640 

CTGTCACCCA CAGACCGCCG TCTAACTCAA AGACTTCCTC TGGCTCCCCA AGGCTGCAAA 270 0 

GCAAAACAGA CTGTGTTTAA CTGCTGCAGG GTCTTTTTCT AGGGTCCCTG AACGCCCTGG 2760 

TAAGGCTGGT GAGGTCCTGG TGCCTATCTG CCTGGAGGCA AAGGCCTGGA CAGCTTGACT 2 82 0 

TGTGGGGCAG GATTCTCTGC AGCCCATTCC CAAGGGAGAC TGACCATCAT GCCCTCTCTC 2880 

GGGAGCCCTA GCCCTGCTCC AACTCCATAC TCCACTCCAA GTGCCCCACC ACTCCCCAAC 2940 

CCCTCTCCAG GCCTGTCAAG AGGGAGGAAG GGGCCCCATG GCAGCTCCTG ACCTTGGGTC 3000 

CTGAAGTGAC CTCACTGGCC TGCCATGCCA GTAACTGTGC TGTACTGAGC ACTGAACCAC 3060 

ATTCAGGGAA ATGCTTATTA AACCTTGAAG CAACTGTGAA TTCATTCTGG AGGGGCAGTG 3120 

GAGATCAGGA GTGACAGATC ACAGGGTGAG GGCCACCTCC ACACCCACCC CCTCTGGAGA 3180 

AGGCCTGGAA GAGCTGAGAC CTTGCTTTGA GACTCCTCAG CACCCCTCCA GTTTTGCCTG 3240 

AGAAGGGGCA GATGTTCCCG GAGATCAGAA GACGTCTCCC CTTCTCTGCC TCACCTGGTC 33 00 

GCCAATCCAT GCTCTCTTTC TTTTCTCTGT CTACTCCTTA TCCCTTGGTT TAGAGGAACC 3360 

CAAGATGTGG CCTTTAGCAA AACTGACAAT GTCCAAACCC ACTCATGACT GCATGACGGA 3420 

GCCGAGCATG TGTCTTTACA CCTCGCTGTT GTCACATCTC AGGGAACTGA CCCTCAGGCA 3480 

CACCTTGCAG AAGGAAGGCC CTGCCCTGCC CAACCTCTGT GGTCACCCAT GCATCATTCC 3540 

ACTGGAACGT TTCACTGCAA ACACACCTTG G AG AAGTGG C ATCAGTCAAC AGAGAGGGGC 3600 

AGGGAAGGAG ACACCAAGCT CACCCTTCGT CATGGACCGA GGTTCCCACT CTGGCAAAGC 3660 

CCCTCACACT GCAAGGGATT GTAGATAACA CTGACTTGTT TGTTTTAACC AATAACTAGC 372 0 

TTCTTATAAT GATTTTTTTA CTAATGATAC TTACAAGTTT CTAGCTCTCA CAGACATATA 3 780 

GAATAAGGGT TTTTGCATAA TAAGCAGGTT GTTATTTAGG TTAACAATAT TAATTCAGGT 3840 

TTTTTAGTTG GAAAAACAAT TCCTGTAACC TTCTATTTTC TATAATTGTA GTAATTGCTC 3 900 

TACAGATAAT GT CT AT AT AT TGGCCAAACT GGTGCATGAC AAGTACTGTA TTTTTTTATA 3 960 
CCTAAATAAA GAAAAATCTT TAGCCTGGGC AACAAAAAAA 

Seq ID NOi 339 Protein sequence 
Protein Accession #: NP_001786 

1 11 21 31 41 51 

1 I I I 1 1 

MQRLMMLLAT SGACLGLLAV AAVAAAGANP AQRDTHSLLP THRRQKRDWI WNQMHIDEEK 60 

NT S L PHHVGK IKSSVSRKNA KYLLKGEYVG KVFRVDAETG DVFAIERLDR ENISEYHLTA 120 

VIVDKDTGEN LETPSSFTIK VHDVNDNWPV FTHRLFNASV PESSAVGTSV I SVTAVDADD 180 

PTVGDHASVM YQILKGKEYF AIDNSGRIIT ITKSLDREKQ AR YE I WEAR DAQGLRGDSG 240 

TATVLVTLQD INDNFPFFTQ TKYTFWPED TRVGTSVGSL FVEDPDEPQN RMTKYSILRG 300 

DYQDAFTIET NPAHNEGIIK PMKPLDYEYI QQYSFIVEAT DPTIDLRYMS PPAGNRAQVI 3 60 

INITDVDEPP IFQQPFYHFQ LKENQKKPLI GT VL AMD PDA ARHSIGYSIR RTSDKGQFFR 420 

VTKKGDIYNE KELDREVYPW YNLTVEAKEL DSTGTPTGKE SIVQVHIEVL DENDNAPEFA 480 

KPYQPKVCEN AVHGQLVLQI SAIDKDITPR NVKFKFTLNT ENNFTLTDNH DNTANITVKY 540 

GQFDREHTKV HFLPWI SDN GMPSRTGTST LTVAVCKCNE QGEFTFCEDM AAQVGVSIQA 600 

WAILLCILT ITVITLLIFL RRRLRKQARA HGKSVPEIHE QLVTYDEEGG GEMDTTSYDV 6 60 

SVLNSVRRGG AKPPRPALDA RPSIiYAQVQK PPRHAPGAHG GPGEMAAMIE VKKDEADHDG 720 

DGPPYDTLHI YGYEGSESIA ESLSSLGTDS SDSDVDYDFL NDWGPRFKML AELYGSDPRE 780 
ELLY 



Seq ID NO: 340 DNA sequence 
Nucleic Acid Accession #: NM_003088 
Coding sequence: 112-1593 

1 11 21 31 41 51 

I I I I I I 

GCGGAGGGTG CGTGCGGGCC GCGGCAGCCG AACAAAGGAG CAGGGGCGCC GCCGCAGGGA 60 

CCCGCCACCC ACCTCCCGGG GCCGCGCAGC GGCCTCTCGT CTACTGCCAC CATGACCGCC 12 0 

AACGGCACAG CCGAGGCGGT GCAGATCCAG TTCGGCCTCA TCAACTGCGG CAACAAGTAC 180 

CTGACGGCCG AGGCGTTCGG GTTCAAGGTG AACGCGTCCG CCAGCAGCCT GAAGAAGAAG 2 40 

CAGATCTGGA CGCTGGAGCA GCCCCCTGAC GAGGCGGGCA GCGCGGCCGT GTGCCTGCGC 300 

AGCCACCTGG GCCGCTACCT GGCGGCGGAC AAGGACGGCA ACGTGACCTG CGAGCGCGAG 3 60 

GTGCCCGGTC CCGACTGCCG TTTCCTCATC GTGGCGCACG ACGACGGTCG CTGGTCGCTG 420 

CAGTCCGAGG CGCACCGGCG CTACTTCGGC GGCACCGAGG ACCGCCTGTC CTGCTTCGCG 480 

CAGACGGTGT CCCCCGCCGA GAAGTGGAGC GTGCACATCG CCATGCACCC TCAGGTCAAC 540 

AT CTACAGTG TCACCCGTAA GCGCTACGCG CACCTGAGCG CGCGGCCGGC CGACGAGATC 600 

GCCGTGGACC GCGACGTGCC CTGGGGCGTC GACTCGCTCA TCACCCTCGC CTTCCAGGAC 660 

CAGCGCTACA GCGTGCAGAC CGCCGACCAC CGCTTCCTGC GCCACGACGG GCGCCTGGTG 720 

GCGCGCCCCG AGCCGGCCAC TGGCTACACG CTGGAGTTCC GCTCCGGCAA GGTGGCCTTC 7 80 

CGCGACTGCG AGGGCCGTTA CCTGGCGCCG TCGGGGCCCA GCGGCACGCT CAAGGCGGGC 840 

AAGGCCACCA AGGTGGGCAA GGACGAGCTC TTTGCTCTGG AGCAGAGCTG CGCCCAGGTC 90 0 

GTGCTGCAGG CGGCCAACGA GAGGAACGTG TCCACGCGCC AGGGTATGGA CCTGTCTGCC 960 

AATCAGGACG AGGAGACCGA CCAGGAGACC TTCCAGCTGG AGAT CGACCG CGACACCAAA 102 0 

AAGTGTGCCT TCCGTACCCA CACGGGCAAG TACTGGACGC TGACGGCCAC CGGGGGCGTG 1080 

CAGTCCACCG CCTCCAGCAA GAATGCCAGC TGCTACTTTG ACATCGAGTG GCGTGACCGG 1140 

CGCATCACAC TGAGGGCGTC CAATGGCAAG TTTGTGACCT CCAAGAAGAA TGGGCAGCTG 12 0 0 

GCCGCCTCGG TGGAGACAGC AGGGGACTCA GAGCTCTTCC TCATGAAGCT CATCAACCGC 12 60 

CCCATCATCG TGTTCCGCGG GGAGCATGGC TTCATCGGCT GCCGCAAGGT CACGGGCACC 1320 

CTGGACGCCA ACCGCTCCAG CTATGACGTC TTCCAGCTGG AGTTCAACGA TGGCGCCTAC 13 80 

AACATCAAAG ACTCCACAGG CAAATACTGG ACGGTGGGCA GTGACTCCGC GGTCACCAGC 1440 

AGCGGCGACA CTCCTGTGGA CTTCTTCTTC GAGTTCTGCG ACTATAACAA GGTGGCCATC 1500 

AAGGTGGGCG GGCGCTACCT GAAGGGCGAC CACGCAGGCG TCCTGAAGGC CTCGGCGGAA 1560 

ACCGTGGACC CCGCCTCGCT CTGGGAGTAC TAGGGCCGGC CCGTCCTTCC CCGCCCCTGC 1620 

CCACATGGCG GCTCCTGCCA ACCCTCCCTG CTAACCCCTT CTCCGCCAGG TGGGCTCCAG 1680 

GGCGGGAGGC AAGCCCCCTT GCCTTTCAAA CTGGAAACCC CAGAGAAAAC GGTGCCCCCA 1740 

CCTGTCGCCC CTATGGACTC CCCACTCTCC CCTCCGCCCG GGTTCCCTAC TCCCCTCGGG 1800 
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TCAGCGGCTG CGGCCTGGCC CTGGGAGGGA TTTCAGATGC CCCTGCCCTC TTGTCTGCCA 1860 

CGGGGCGAGT CTGGCACCTC TTTCTTCTGA CCTCAGACGG CTCTGAGCCT TATTTCTCTG 192 0 

GAAGCGGCTA AGGGACGGTT GGGGGCTGGG AGCCCTGGGC GTGTAGTGTA ACTGGAATCT 1980 

TTTGCCTCTC CCAGCCACCT CCTCCCAGCC CCCCAGGAGA GCTGGGCACA TGT CCCAAGC 2040 

CTGTCAGTGG CCCTCCCTGG TGCACTGTCC CCGAAACCCC TGCTTGGGAA GGGAAGCTGT 2100 

CGGGAGGGCT AGGACTGACC CTTGTGGTGT TTTTTTGGGT GGTGGCTGGA AACAGCCCCT 2160 

CTCCCACGTG GGAGAGGCTC AGCCTGGCTC CCTTCCCTGG AGCGGCAGGG CGTGACGGCC 2 22 0 

ACAGGGTCTG CCCGCTGCAC GTTCTGCCAA GGTGGTGGTG GCGGGCGGGT AGGGGTGTGG 2280 

GGGCCGTCTT CCTCCTGTCT CTTTCCTTTC ACCCTAGCCT GACTGGAAGC AGAAAATGAC 2 340 

CAAATCAGTA TTTTTTTTAA TGAAATATTA TTGCTGGAGG CGTCCCAGGC AAGCCTGGCT 240 0 

GTAGTAGCGA GTGATCTGGC GGGGGGCGTC TCAGCACCCT CCCCAGGGGG TGCATCTCAG 2460 

CCCCCTCTTT CCGTCCTTCC CGTCCAGCCC CAGCCCTGGG CCTGGGCTGC CGACACCTGG 2 520 

GCCAGAGCCC CTGCTGTGAT TGGTGCTCCC TGGGCCTCCC GGGTGGATGA AGCCAGGCGT 2 580 

CGCCCCCTCC GGGAGCCCTG GGGTGAGCCG CCGGGGCCCC CCTGCTGCCA GCCTCCCCCG 2 640 

TCCCCAACAT GCATCTCACT CTGGGTGTCT TGGTCTTTTA TTTTTTGTAA GTGTCATTTG 2 700 

TATAACTCTA AACGCCCATG ATAGTAGCTT CAAACTGGAA ATAGCGAAAT AAAATAACTC 2 760 
AGTCTGC 

Seq ID NO: 341 Protein sequence 
Protein Accession #: NP_003079 



PCT/US02/12476 



MTANGTAEAV 
CLRSHLGRYL 
CFAQTVSPAE 
FQDQRYSVQT 
KAGKATKVGK 
DTKKCAFRTH 
GQLAASVETA 
GAYNIKDSTG 
SAETVDPASL 



11 
I 

QIQFGLINCG 
AADKDGNVTC 
KWSVHIAMHP 
ADHRFLRHDG 
DELFALEQSC 
TGKYWTLTAT 
GDSELFLMKL 
KYWTVGSDSA 
WEY 



21 
I 

NKYLTAEAFG 
EREVPGPDCR 
QVNIYSVTRK 
RLVARPEPAT 
AQWLQAANE 
GGVQSTASSK 
INRPIIVFRG 
VTSSGDTPVD 



31 
I 

FKVNASASSL 
FL I VAHDDGR 
RYAHLSARPA 
GYTLEFRSGK 
RNVSTRQGMD 
NASCYFDIEW 
EHGFIGCRKV 
FFFEFCDYNK 



41 
I 

KKKQIWTLEQ 
WSLQSEAHRR 
DEIAVDRDVP 
VAFRDCEGRY 
LSANQDEETD 
RDRRITLRAS 
TGTLDANRSS 
VAIKVGGRYL 



51 

I 

PPDEAGSAAV 
YFGGTEDRLS 
WGVDSLITLA 
LAPSGPSGTL 
QETFQLEIDR 
NGKFVTSKKN 
YDVFQLEFND 
KGDHAGVLKA 



Seq ID NO: 342 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 
Coding sequence : 660 1705 



CGCTCCGCAC 
GGGATTCTTG 
CCGGGTTGGG 
CTCGAGCGGG 
CCGGCGTTCG 
CACCCCACTG 
GCTACCATGA 
ACCGTGCGTG 
CCCGGAGGGG 
TGCACCCTGG 
CTCGAAAGCG 
TGCGCGGTAC 
ACCTGCGCGC 
GCCCCGGGGC 
TGGACTTCAG 
CAGTTACTTG 
TGTGTCCCTG 
TAGACGACTT 
GCCGCTCTTG 
CCAGGCGCCC 
TCGACGAGAA 
TTCCTGAGAT 
AAGCCGAGTC 
CGACTTCCTC 
TGAGCACAGC 
TCTGCTTTCA 
TGGAGAGTGA 
GGGTGAAAGT 
CCCCTCTTGG 



11 
1 

ACATTTCCTG 
GCAGTTGGGG 
GAAGCCAGCT 
ACAGATCCAA 
CCCTGTGCCT 
CCGACCGTGC 
AGCGGCAGGC 
CGGGCGCCGA 
GCTCCAAAGA 
AGAACGAGCC 
ACACGCTGCA 
TCCAGGCCAC 
CAACGGCTAC 
CGCCTCTAAC 
TCCACCTGGG 
CATCGCGGAC 
CCCCGGGAGG 
GGGAGGCTTT 
TGTGACCAGT 
GCCGGCCACT 
GCTGGGAGAG 
TCCTCGATGG 
AAAGGCCACT 
TGCCACTCCT 
AGTAGTAGTG 
CGAAAGCCCC 
TCCTGAGCCC 
CGGGGACTGT 
CTCTAGTGAT 



21 
I 

TCGCGGCCTA 
GGTCCGTCGG 
GTAGAGGGCG 
GTTGGGAGCA 
CCTCTGGCAG 
TGGCTGCTCG 
GGCCGAGGAG 
GCTGCGCGCT 
CCTGCTGTTC 
TTTGCGGGGT 
GTGGGTGGAG 
CGGTGGGGTC 
CTGTGCAAGT 
TTGAGCTATC 
ACCGAGGTGA 
GAAATCGGCG 
TACCTCCGTG 
GCCTGCGAAT 
GGGGAAGGAC 
GCAACCAGCC 
ACACCACTTG 
GGATCACAGA 
ATCACCCCAT 
CAGGCTTTCG 
TTGGTGATCT 
TCTTCCCAGC 
GCTGCTTTGG 
GATCTGCGGG 
GCATAG 



31 

I 

AGGGAAACTG 
GAGCGAGGGC 
GTGACCGCGC 
GCTCTGCGTG 
GCGCTCTGGC 
GCCTCGGGGG 
GCCTGCATCC 
GTGCTCGCGC 
TGGGTCGCAC 
TTCTCCTGGC 
GAGCCCCAAC 
GAGCCCGCAG 
ACCAGTTTGA 
GCGCGCCCTT 
GTGCGCTCTG 
CTCGCTGGGA 
CTGGCAAATG 
GTGCTACGGG 
AGCCGACCCT 
CCGTGCCGCA 
TCCCTGAACA 
GCACGATGTC 
CAGGGAGCGT 
ACTCCTCCTC 
TGACCATGAC 
CAAGGAAGGA 
GCTCCAGTTC 
ACAGAGCAGA 



41 
1 

TTGGCCGCTG 
GGAGGGGAAG 
TCCAGACACA 
CGGGGCCTCA 
CCGGGCCGGG 
CCTGCTACAG 
TGCGAGGTGG 
TCCTGCGGGC 
TGGAGCGCAG 
TGTCCTCCGA 
GCTCCTGCAC 
CTGGAAGGAG 
GGTCTTGTGT 
CCAGCTGCAC 
CCGGGGACAG 
CAAACTCTCG 
CGCAGAGCTC 
CTTCGAGCTG 
TGGGGGGACC 
GAGAACATGG 
AGACAATTCA 
TACCCTTCAA 
GATTT CCAAG 
TGCCGTGGTC 
AGTACTGGGG 
GTCTATGGGC 
TGCACATTGC 
GGGTGCCTTG 



51 

I 

GGCCCGCGGG 
GGAGGGGGAA 
GCTCTGCGTC 
GAGAATGAGG 
CGGCGGCGAA 
CCTGCACCAC 
GGCGCTCAGC 
AGGCCCAGGG 
GCGTTCCCAC 
CCCCGGCGGT 
CGCGCGGAGA 
ATGCGATGCC 
CCTGCGCCGC 
AGCGCCGCTC 
CTCCCGATCT 
GGCGATGTGT 
CCTAACTGCC 
GGGAAGGACG 
GGGGTGCCCA 
CCAATCAGGG 
GTAACATCTA 
ATGTCCCTTC 
TTTAATTCTA 
TTCATATTTG 
CTTGTCAAGC 
CCGCCGGGCC 
ACAAACAATG 
CTGGCGGAGT 



Seq ID NO: 343 Protein sequence 
Protein Accession #: FGENESH predicted 



MGKDFMTKTP 
IARIYKELEQ 
GLGKPAVEGG 
PHCRPCWLLG 
RRGLQRPAVL 
RGT PGHRWGR 
DFSPPGTEVS 
DDLGGFACEC 
DEKLGETPLV 
TSSATPQAFD 
ESDPEPAALG 



11 
I 

KAFATKAKID 
IYKKKKPTKT 
DRAPDTAIiRP 
LGGLLQPAPR 
GRTGAQAFPL 
ARSWKEMRCH 
ALCRGQLPIS 
ATGFELGKDG 
PEQDNSVTSI 
SSSAWFIFV 
SSSAHCTNNG 



21 
I 

KWDLIKLKSF 
LRTHFLSRPK 
RAGQIQVGSS 
YHEAAGGRGG 
HPGERAFAGF 
LRANGYLCKY 
VTCIADEIGA 
RSCVTSGEGQ 
PEIPRWGSQS 
STAWVLVIL 
VKVGDCDBRD 



31 
I 

CTAKETI IRV 
GNCWPLGPRG 
SACGASENEA 
LHPARWGAQH 
LLAVLRPRRS 
QFEVLCPAPR 
RWDKLSGDVL 
PTLGGTGVPT 
TMSTLQMSLQ 
TMTVLGLVKL 
RAEGALLAES 



41 
I 

NSQPTDWQKT 
DSWQLGGPSG 
GVRPVPPLAG 
RACGRRAARC 
RKRHAAVGGG 
PGAASNLSYR 
CPCPGRYLRA 
RRPPATATSP 
AESKATITPS 



51 

I 

FAIYPSDKGV 
ARAEGKGGGT 
ALARAGRRRT 
ARAPAGRPRA 
APTLLHRAEM 
APFQLHSAAL 
GKCAELPNCL 
VPQRTWPIRV 
GSVISKFNST 
RKESMGPPGL 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
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Seq ID NO: 344 DNA sequence 
Nucleic Acid Accession #: NM_012072 
Coding sequence: 149-2107 

1 . 11 21 31 41 51 

AAAGCCCTCA GCCTTTGTGT CCTTCTCTGC GCCGGAGTGG CTGCAGCTCA CCCCTCAGGT 60 

CCCCTTGGGG CCCAGCTGGG AG CCG AG AT A GAAGCTCCTG TCGCCGCTGG GCTTCTCGCC 120 

TCCCGCAGAG GGCCACACAG AGACCGGGAT GGCCACCTCC ATGGGCCTGC TGCTGCTGCT 180 

GCTGCTGCTC CTGACCCAGC CCGGGGCGGG G ACGGG AG CT GACACGGAGG CGGTGGTCTG 240 

CGTGGGGACC GCCTGCTACA CGGCCCACTC GGGCAAGCTG AGCGCTGCCG AGGCCCAGAA 300 

CCACTGCAAC CAGAACGGGG GCAACCTGGC CACTGTGAAG AGCAAGGAGG AGGCCCAGCA 3 60 

CGT CCAGCG A GTACTGGCCC AGCTCCTGAG GCGGGAGGCA GCCCTGACGG CGAGGATGAG 420 

CAAGTTCTGG ATTGGGCTCC AGCGAGAGAA GGGCAAGTGC CTGGACCCTA GTCTGCCGCT 480 

GAAGGGCTTC AGCTGGGTGG GCGGGGGGGA GGACACGCCT TACTCTAACT GGCACAAGGA 540 

GCTCCGGAAC TCGTGCATCT CCAAGCGCTG TGTGTCTCTG CTGCTGGACC TGTCCCAGCC 600 

GCTCCTTCCC AACCGCCTGC CCAAGTGGTC TGAGGGCCCC TGTGGGAGCC CAGGCTCCCC 660 

CGGAAGTAAC ATTGAGGGCT TCGTGTGCAA GTTCAGCTTC AAAGGCATGT GCCGGCCTCT 720 

GGCCCTGGGG GGCCCAGGTC AGGTGACCTA CACCACCCCC TTCCAGACCA CCAGTTCCTC 780 

CTTGGAGGCT GTGCCCTTTG CCTCTGCGGC CAATGTAGCC TGTGGGGAAG GTGACAAGGA 840 

CGAGACT CAG AGTCATTATT TCCTGTGCAA GGAGAAGGCC CCCGATGTGT TCGACTGGGG 900 

CAGCTCGGGC CCCCTCTGTG TCAGCGCCAA GTATGGCTGC AACTTCAACA ATGGGGGCTG 960 

CCACCAGGAC TGCTTTGAAG GGGGGGATGG CTCCTTCCTC TGCGGCTGCC GACCAGGATT 1020 

CCGGCTGCTG GATGACCTGG TGACCTGTGC CTCTCGAAAC CCTTGCAGCT CCAGCCCATG 1080 

TCGTGGGGGG GCCACGTGCG TCCTGGGACC CCATGGGAAA AACTACACGT GCCGCTGCCC 1140 

CCAAGGGTAC CAGCTGGACT CGAGT CAGCT GGACTGTGTG GACGTGGATG AATGCCAGGA 1200 

CTCCCCCTGT GCCCAGGAGT GTGTCAACAC CCCTGGGGGC TTCCGCTGCG AATGCTGGGT 1260 

TGGCTATGAG CCGGGCGGTC CTGGAGAGGG GGCCTGTCAG GATGTGGATG AGTGTGCTCT 1320 

GGGTCGCTCG CCTTGCGCCC AGGGCTGCAC CAACACAGAT GGCTCATTTC ACTGCTCCTG 1380 

TGAGGAGGGC TACGTCCTGG CCGGGGAGGA CGGGACTCAG TGCCAGGACG TGGATGAGTG 144 0 

TGTGGGCCCG GGGGGCCCCC TCTGCGACAG CTTGTGCTTC AACACACAAG GGTCCTTCCA 1500 

CTGTGGCTGC CTGCCAGGCT GGGTGCTGGC CCCAAATGGG GTCTCTTGCA CCATGGGGCC 1560 

TGTGTCTCTG GGACCACCAT CTGGGCCCCC CGATGAGGAG GACAAAGGAG AGAAAGAAGG 1620 

GAGCACCGTG CCCCGCGCTG CAACAGCCAG TCCCACAAGG GGCCCCGAGG GCACCCCCAA 1680 

GGCTACACCC ACCACAAGTA GACCTTCGCT GTCATCTGAC GCCCCCATCA CATCTGCCCC 17 40 

ACTCAAGATG CTGGCCCCCA GTGGGTCCTC AGGCGTCTGG AGGGAGCCCA GCATCCATCA 180 0 

CGCCACAGCT GCCTCTGGCC CCCAGGAGCC TGCAGGTGGG GACTCCTCCG TGGCCACACA 1860 

AAACAACGAT GGCACTGACG GGCAAAAGCT GCTTTTATTC TACATCCTAG GCACCGTGGT 192 0 

GGCCATCCTA CTCCTGCTGG CCCTGGCTCT GGGGCTACTG GTCTATCGCA AGCGGAGAGC 1980 

GAAGAGGGAG GAGAAGAAGG AGAAGAAGCC CCAGAATGCG GCAGACAGTT ACTCCTGGGT 2040 

TCCAGAGCGA GCTGAGAGCA GGGCCATGGA GAACCAGTAC AGTCCGACAC C TGGG AC AG A 210 0 

CTGCTGAAAG TGAGGTGGCC CTAGAGACAC TAGAGTCACC AGCCACCATC CTCAGAGCTT 2160 

TGAACTCCCC ATTCCAAAGG GGCACCCACA TTTTTTTGAA AGACTGGACT GGAATCTTAG 222 0 

CAAACAATTG TAAGTCTCCT CCTTAAAGGC CCCTTGGAAC ATGCAGGTAT TTTCTACGGG 2280 

TGTTTGATGT TCCTGAAGTG GAAGCTGTGT GTTGGCGTGC CACGGTGGGG ATTTCGTGAC 2340 

TCTATAATGA TTGTTACTCC CCCTCCCTTT TCAAATTCCA ATGTGACCAA TTCCGG AT CA 240 0 

GGGTGTGAGG AGGCTGGGGC TAAGGGGCTC CCCTGAATAT CTTCTCTGCT CACTTCCACC 2460 

ATCTAAGAGG AAAAGGTGAG TTGCTCATGC TGATTAGGAT TGAAATGATT TGTTTCTCTT 2520 

CCTAGGATGA AAACTAAATC AATTAATTAT TCAATTAGGT AAGAAGATCT GGTTTTTTGG 2580 

TCAAAGGGAA CATGTTCGGA CTGGAAACAT TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

AGCACAAGTC TTGCTAAATG TGATACTGTT GACATCCTCC AGAATGGCCA GAAGTGCAAT 2700 

TAACCTCTTA GGTGGCAAGG AGGCAGGAAG TGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 

CTTGGGTTTA TTTGCAAAGG AAG CTTG AAA AATATGAGAA AAGTTGCTTG AAGTGCATTA 2820 

CAGGTGTTTG TGAAGTCACA TAATCTACGG GGCTAGGGCG AGAGAGGCCA GGGATTTGTT 2880 

CACAGATACT TGAATTAATT CATCCAAATG TACTGAGGTT ACCACACACT TGACTACGGA 2 940 

TGTGATCAAC ACTAACAAGG AAACAAATTC AAGGACAACC TGTCTTTGAG CCAGGGCAGG 3 0 00 

CCTCAGACAC CCTGCCTGTG GCCCCGCCTC CACTTCATCC TGCCCGGAAT GCCAGTGCTC 3060 

CGAGCTCAGA CAGAGGAAGC CCTGCAGAAA GTTCCATCAG GCTGTTTCCT AAAGGATGTG 3120 

TGAACGGGAG ATGATGCACT GTGTTTTGAA AGTTGTCATT TTAAAGCATT TTAGCACAGT 3180 

TCATAGTCCA CAGTTGATGC AGCATCCTGA GATTTTAAAT CCTGAAGTGT GGGTGGCGCA 3240 

CACACCAAGT AGGGAGCTAG TCAGGCAGTT TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

TTCCTTAAAA TTGGGGGTAA GGAGGGAAGG AAGAGGGAAA GAGATGACTA ACTAAAATCA 3360 

TTTTTACAGC AAAAACTGCT CAAAGCCATT TAAATTATAT CCTCATTTTA AAAGTTACAT 3420 

TTGCAAATAT TTCTCCCTAT GATAATGCAG TCGATAGTGT GCACTCTTTC TCTCTCTCTC 3480 

TCTCTCTCAC ACACACACAC ACACACACAC ACACACACAC AGAGACACGG CACCATTCTG 3540 

CCTGGGGCAC TGGAACACAT TCCTGGGGGT CACCGATGGT CAGAGTCACT AGAAGTTACC 3 600 

TGAGTATCTC TGGGAGGCCT CATGTCTCCT GTGGGCTTTT TACCACCACT GTGCAGGAGA 3660 

ACAGACAGAG GAAATGTGTC TCCCTCCAAG GCCCCAAAGC CT CAG AG AAA GGGTGTTTCT 3720 

GGTTTTGCCT TAGCAATGCA TCGGTCTCTG AGGTGACACT CTGGAGTGGT TGAAGGGCCA 3780 

CAAGGTG CAG GGTTAATACT CTTGCCAGTT TTGAAATATA GATGCTATGG TTCAGATTGT 3 840 

TTTTAATAGA AAACTAAAGG GGCAGGGGAA GTGAAAGGAA AGATGGAGGT TTTGTGCGGC 3900 

TCGATGGGGC ATTTGGAACT TCTTTTTAAA GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 3960 

AACTCTGGTG TTTAACACTT AAGGGAGACA AAGGCTGTGT CCATTTGGCA AAACTTCCTT 4020 

GGCCACGAGA CTCTAGGTGA TGTGTGAAGC TGGGCAGTCT GTGGTGTGGA GAGCAGCCAT 4080 

CTGTCTGGCC ATTCAGAGGA TTCTAAAGAC ATGGCTGGAT GCGCTGCTGA CCAACATCAG 4140 

CACTTAAATA AATGCAAATG CAACATTTCT CCCTCTGGGC CTTGAAAATC CTTGCCCTTA 4200 

TCATTTGGGG TGAAGGAGAC ATTTCTGTCC TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 4260 

GTATGATTCC TGGGATCCAA CGAGCCCTCC TATTTTCACA GTGTTCTGAT TGCTCTCACA 432 0 

GCCCAGGCCC ATCGTCTGTT CTCTGAATGC AGCCCTGTTC TCAACAACAG GGAGGTCATG 4380 

GAACCCCTCT GTGGAACCCA CAAGGGGAGA AATGGGTGAT AAAGAATCCA GTTCCTCAAA 4440 

ACCTTCCCTG GCAGGCTGGG TCCCTCTCCT GCTGGGTGGT GCTTTCTCTT GCACACCACT 4500 

CCCACCACGG GGGGAGAGCC AGCAACCCAA CCAGACAGCT CAGGTTGTGC ATCTGATGGA 4560 

AACCACTGGG CT CAAACACG TGCTTTATTC TCCTGTTTAT TTTTGCTGTT ACTTTGAAGC 462 0 

ATGGAAATTC TTGTTTGGGG GATCTTGGGG CTACAGTAGT GGGTAAACAA ATGCCCACCG 4680 

GCCAAGAGGC CATTAACAAA TCGTCCTTGT CCTGAGGGGC CCCAGCTTGC TCGGGCGTGG 4740 

CACAGTGGGG AAT C CAAGGG TCACAGTATG GGGAGAG GTG CACCCTGCCA CCTGCTAACT 4800 
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TCTCGCTAGA CACAGTGTTT CTGCCCAGGT GACCTGTTCA GCAG CAGAAC AAGCCAGGGC 4860 

CATGGGGACG GGGGAAGTTT TCACTTGGAG ATGGACACCA AGACAATGAA GATTTGTTGT 4920 

CCAAATAGGT CAATAATTCT GGGAGACTCT TGGAAAAAAC TGAATATATT CAGGACCAAC 4980 

TCTCTCCCTC CCCTCATCCC ACATCTCAAA GCAGACAATG TAAAGAGAGA ACATCTCACA 5040 

CACCCAGCTC GCCATGCCTA CTCATTCCTG AATTTCAGGT GCCATCACTG CTCTTTCTTT 5100 

CTTCTTTGTC ATTTGAGAAA GGATGCAGGA GGACAATTCC CACAGATAAT CTGAGGAATG 5160 

CAGAAAAACC AGGGCAGGAC AGTTATCGAC AATGCATTAG AACTTGGTGA GCATCCTCTG 5220 

TAGAGGGACT CCACCCCTGC TCAACAGCTT GGCTTCCAGG CAAGACCAAC CACATCTGGT 52 80 

CTCTGCCTTC GGTGGCCCAC ACAGCTAAGC GTCATCGTCA TTGCCATAGC ATCATGATGC 5340 

AACACATCTA CGTGTAGCAC TACGACGTTA TGTTTGGGTA ATGTGGGGAT GAACTGCATG 540 0 

AGGCTCTGAT TAAGGATGTG GGGAAGTGGG CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA GGAGCAAGGC TTCAGGAAGG GCCAGCCACA 552 0 

TGCCATCTTC CCTGCGATCA GGCAAAAAAG TGGAATTAAA AAGTCAAACC TTTATATGCA 5580 

TGTGTTATGT CCATTTTGCA GGATGAACTG AGTTTAAAAG AATTTTTTTT TCTCTTCAAG 5640 

TTGCTTTGTC TTTTCCATCC TCATCACAAG CCCTTGTTTG AGTGTCTTAT CCCTGAGCAA 5700 

TCTTTCGATG GATGGAGATG AT CATTAGGT ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

ATTTCTGTGA AAACTAGGAG AACAGAGATG AGATTTGACA AAAAAAAATT GAATTAAAAA 5820 

TAACACAGTC TTTTTAAAAC TAACATAGGA AAGCCTTTCC TATTATTTCT CTTCTTAGCT 5880 

TCTCCATTGT CTAAATCAGG AAAACAGGAA AACACAGCTT TCTAGCAGCT GCAAAATGGT 5940 

TTAATGCCCC CTACATATTT CCATCACCTT GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

TATGATCCCA GAAAACATCT GTCTCTACTT CGGCTGCAAA ACCCATGGTT TAAATCTATA 6060 

TGGTTTGTGC ATTTTCTCAA CTAAAAATAG AGATGATAAT CCGAATTCTC CATATATTCA 612 0 

CTAATCAAAG ACACTATTTT CATACTAGAT TCCTGAGACA AATACTCACT GAAGGGCTTG 6180 

TTTAAAAATA AATTGTGTTT TGGTCTGTTC TTGTAGATAA TGCCCTTCTA TTTTAGGTAG 6240 

AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG CTCTTATCTG CAAGGTGGCA AGCAGTTCTT 630 0 

TTCAGCAGAT TTTGCCCACT ATTCCTCTGA GCTGAAGTTC TTTGCATAGA TTTGGCTTAA 6360 

GCTTGAATTA GATCCCTGCA AAGGCTTGCT CTGTGATGTC AGATGTAATT GTAAATGTCA 642 0 

GTAATCACTT CATGAATGCT AAATGAGAAT GTAAGTATTT TTAAATGTGT GTATTTCAAA 6480 

TTTGTTTGAC TAATTCTGGA ATTACAAGAT TTCTATGCAG GATTTACCTT CATCCTGTGC 6540 

ATGTTTCCCA AACTGTGAGG AGGGAAGGCT CAGAGATCGA GCTTCTCCTC TGAGTTCTAA 6600 

CAAAATGGTG CTTTGAGGGT CAGCCTTTAG GAAGGTGCAG CTTTGTTGTC CTTTGAGCTT 6660 
TGTGTTATGT GCCTATCCTA ATAAACTCTT AAACACATT 

Seq ID NO: 345 Protein sequence 
Protein Accession #: NP_036204 



PCT/US02/12476 



ii 



21 



31 



41 



51 



MAT SMGLLLL 
ATVKSKEEAQ 
EDTPYSNWHK 
KFSFKGMCRP 
KEKAPDVFDW 
ASRNPCSSSP 
TPGGFRCECW 
DGTQCQDVDE 
PDEEDKGEKE 
SGVWREPSIH 
LGLLVYRKRR 



LLLLLTQPGA 
HVQRVLAQLIi 
ELRNSCISKR 
LALGGPGQVT 
GSSGPLCVSP 
CRGGATCVLG 
VGYEPGGPGE 
CVGPGGPLCD 
GSTVPRAATA 
HATAASGPQE 
AKREEKKEKK 



GTGADTEAW 
RREAALTARM 
CVSLLLDLSQ 
YTTPFQTTSS 
KYGCNFNNGG 
PHGKNYTCRC 
GACQDVDECA 
SLCFNTQGSF 
SPTRGPEGTP 
PAGGDSSVAT 
PQNAADSYSW 



Seq ID NO: 346 DNA sequence 
Nucleic Acid Accession #: Z31560 
Coding sequence: <l-966 



1 
I 

CACAGCGCCC 
ACTTCGGGGG 
AGCCCGGACC 
CGCAAGATGG 
GCCGAGTGGA 
CTGCGAGCGC 
AAGACGCTCA 
AATAGCATGG 
ATGGACAGTT 
CAGCTGGGCT 
ATGCACCGCT 
ATGAACGGCT 
CTTGGCTCCA 
TCTTCCTCCC 
TATCTCCCCG 
CACTACCAGA 
ATGTGAGGGC 
TGGGAGGGGT 
AAAAA 



11 
I 

GCATGTACAA 
GCGGCGGCGG 
GCGTCAAGCG 
CCCAGGAGAA 
AACTTTTGTC 
TGCACATGAA 
TGAAGAAGGA 
CGAGCGGGGT 
ACGCGCACAT 
ACCCGCAGCA 
ACGACGTGAG 
CGCCCACCTA 
TGGGTTCGGT 
ACTCCAGGGC 
GCGCCGAGGT 
GCGGCCCGGT 
CGGACAGCGA 
GCAAAAGAGG 



21 

I 

CATGATGGAG 
CAACTCCACC 
GCCCATGAAT 
CCCCAAGATG 
GGAGACGGAG 
GGAG CACCCG 
TAAGTACACG 
CGGGGTGGGC 
GAACGGCTGG 
CCCGGGCCTC 
CGCCCTGCAG 
CAG CATGTCC 
GGTCAAGTCC 
GCCCTGCCAG 
GCCGGAACCC 
GCCCGGCACG 
ACTGGAGGGG 
AGAGTAAGAA 



CVGTACYTAH 
SKFWIGLQRE 
PLLPNRLPKW 
SLEAVPFASA 
CHQDCFEGGD 
PQGYQLDSSQ 
LGRSPCAQGC 
HCGCLPGWVL 
KATPTTSRPS 
QNNDGTDGQK 
VPERAESRAM 



31 

I 

ACGGAGCTGA 
GCGGCGGCGG 
GCCTTCATGG 
CACAACT CGG 
AAGCGGCCGT 
GATTATAAAT 
CTGCCCGGCG 
GCCGGCCTGG 
AGCAACGGCA 
AATGCGCACG 
TACAACTCCA 
TACTCGCAGC 
GAGGCCAGCT 
GCCGGGGACC 
GCCGCCCCCA 
GCCATTAACG 
GGAGAAATTT 
ACAGCATGGA 



SGKLSAAEAQ 
KGKCLDPSLP 
SEGPCGSPGS 
ANVACGEGDK 
GSFLCGCRPG 
LDCVDVDECQ 
TNTDGSFHCS 
APNGVSCTMG 
LSSDAPITSA 
LLLFYILGTV 
ENQYSPTPGT 



41 

I 

AGCCGCCGGG 
CCGGCGGCAA 
TGTGGTCCCG 
AG AT CAGCAA 
T CAT CG ACG A 
ACCGGCCCCG 
GGCTGCTGGC 
GCGCGGGCGT 
GCTACAGCAT 
GCGCAGCGCA 
TGACCAGCTC 
AGGGCACCCC 
CCAGCCCCCC 
TCCGGGACAT 
GCAGACTTCA 
GCACACTGCC 
TCAAAGA^AA 
GAAAACCCGG 



Seq ID NO: 347 Protein sequence 
Protein Accession #: CAA83435 



HSARMYNMME 
RKMAQENPKM 
KTLMKKDKYT 
QLGYPQHPGL 
LGSMGSWKS 
HYQSGPVPGT 



11 

1 

TELKPPGPQQ 
HNSEISKRLG 
LPGGLLAPGG 
NAHGAAQMQP 
EASSSPPWT 
AINGTLPLSH 



21 
I 

TSGGGGGNST 
AEWKLLSETE 
NSMASGVGVG 
MHRYDVSALQ 
SSSHSRAPCQ 
M 



31 
I 

AAAAGGNQKN 
KRPFIDEAKR 
AGLGAGVNQR 
YNSMTSSQTY 
AGDLRDMISM 



41 
I 

SPDRVKRPMN 
LRALHMKEHP 
MDSYAHMNGW 
MNGSPTYSMS 
YLPGAEVPEP 



NHCNQNGGNL 
LKGFSWVGGG 
PGSNIEGFVC 
DETQSHYFLC 
FRLLDDLVTC 
DSPCAQECVN 
CEEGYVLAGE 
PVSLGPPSGP 
PLKMLAPSGS 
VAILLLLALA 
DC 



51 

I 

CCCGCAGCAA 
CCAGAAAAAC 
CGGGCAGCGG 
GCGCCTGGGC 
GGCTAAGCGG 
GCGGAAAACC 
CCCCGGCGGC 
GAACCAGCGC 
GATGCAGGAC 
GATGCAGCCC 
GCAGACCTAC 
TGGCATGGCT 
TGTGGTTACC 
GATCAGCATG 
CATGTCCCAG 
CCTCTCACAC 
ACGAGGGAAA 
TACGCTCAAA 



51 
I 

AFMVWSRGQR 
DYKYRPRRKT 
SNGSYSMMQD 
YSQQGTPGMA 
AAPSRLHMSQ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 
240 
300 



317 



WO 02/086443 



PCT/US02/12476 



Seq ID NO: 348 DNA sequence 
Nucleotide Accession #: NM_002638 
^ Coding sequence: 120-473 

1 11 21 31 41 51 

111!!) 

CAATACAGCT AAGGAATTAT CCCTTGTAAA TACCACAGAC CCGCCCTGGA GCCAGGCCAA 60 

GCTGGACTGC ATAAAGATTG GTATGGCCTT AGCTCTTAGC CAAACACCTT CCTGACACCA 120 

10 TGAGGGCCAG CAGCTTCTTG ATCGTGGTGG TGTTCCTCAT CGCTGGGACG CTGGTTCTAG 180 

AGGCAGCTGT CACGGGAGTT CCTGTTAAAG GTCAAGACAC TGTCAAAGGC CGTGTTCCAT 240 

TCAATGGACA AGATCCCGTT AAAGGACAAG TTTCAGTTAA AGGTCAAGAT AAAGTCAAAG 300 

CGCAAGAGCC AGTCAAAGGT CCAGTCTCCA CTAAGCCTGG CTCCTGCCCC ATTATCTTGA 360 

TCCGGTGCGC CATGTTGAAT CCCCCTAACC GCTGCTTGAA AGATACTGAC TGCCCAGGAA 420 

15 TCAAGAAGTG CTGTGAAGGC TCTTGCGGGA TGGCCTGTTT CGTTCCCCAG TGAAGGGAGC 4 80 

CGGTCCTTGC TGCACCTGTG CCGTCCCCAG AGCTACAGGC CCCATCTGGT CCTAAGTCCC 540 

TGCTGCCCTT CCCCTTCCCA CACTGT CCAT TCTTCCTCCC ATTCAGGATG CCCACGGCTG 600 
GAGCTGCCTC TCTCATCCAC TTTCCAATAA A 

20 Seq ID NO: 349 Protein sequence: 

Protein Accession #: NP_002629 

1 11 21 31 41 51 

oc I 1 I 1 1 1 

ZD MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 60 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 350 DNA sequence 
30 Nucleic Acid Accession #: NM_007183 
Coding sequence: 75-2468 

1 11 21 31 41 51 

^ I I I I I I 

3D gaattccgga CAGGACGTGA agatagttgg GTTTGGAGGC GGCCGCCAGG CCCAGGCCCG 60 

GTGGACCTGC CGCCATGCAG GACGGTAACT TCCTGCTGTC GGCCCTGCAG CCTGAGGCCG 120 

GCGTGTGCTC CCTGGCGCTG CCCTCTGACC TGCAGCTGGA CCGCCGGGGC GCCGAGGGGC 180 

CGGAGGCCGA GCGGCTGCGG GCAGCCCGCG TCCAGGAGCA GGTCCGCGCC CGCCTCTTGC 240 

AGCTGGGACA GCAGCCGCGG CACAACGGGG CCGCTGAGCC CGAGCCTGAG GCCGAGACTG 300 

40 CCAGAGGCAC ATCCAGGGGG CAGTACCACA CCCTGCAGGC TGGCTTCAGC TCTCGCTCTC 360 

AGGGCCTGAG TGGGGACAAG ACCTCGGGCT TCCGGCCCAT CGCCAAGCCG GCCTACAGCC 42 0 

CAGCCTCCTG GTCCTCCCGC TCCGCCGTGG ATCTGAGCTG CAGTCGGAGG CTGAGTTCAG 480 

CCCACAATGG GGGCAGCGCC TTTGGGGCCG CTGGGTACGG GGGTGCCCAG CCCACCCCTC 540 

CCATGCCCAC CAGGCCCGTG TCCTTCCATG AGCGCGGTGG GGTTGGGAGC CGGGCCGACT 600 

45 ATGACACACT CTCCCTGCGC TCGCTGCGGC TGGGGCCCGG GGGCCTGGAC GACCGCTACA 660 

GCCTGGTGTC TGAGCAGCTG GAGCCCGCGG CCACCTCCAC CTACAGGGCC TTTGCGTACG 720 

AGCGCCAGGC CAGCTCCAGC TCCAGCCGGG CAGGGGGGCT GGACTGGCCC GAGGCCACTG 780 

AGGTTTCCCC GAGCCGGACC ATCCGTGCCC CTGCCGTGCG GACCCTGCAG CGATTCCAGA 840 

GCAGCCACCG GAGCCGCGGG GTAGGCGGGG CAGTGCCGGG GGCCGTCCTG GAG CCAGTGG 900 

50 CTCGAGCGCC ATCTGTGCGC AGCCTCAGCC TCAGCCTGGC TGACTCGGGC CACCTGCCGG 960 

ACGTGCATGG GTTCAACAGC TACGGTAGCC ACCGAACCCT GCAGAGACTC AGCAGCGGTT 102 0 

TTGATGACAT TGACCTGCCC TCAGCAGTCA AGTACCTCAT GGCTTCAGAC CCCAACCTGC 1080 

AGGTGCTGGG AGCGGCCTAC ATCCAGCACA AGTGCTACAG CGATGCAGCC GCCAAGAAGC 1140 

AGGCCCGCAG CCTTCAGGCC GTGCCTAGGC TGGTGAAGCT CTTCAACCAC GCCAACCAGG 1200 

55 AAGTGCAGCG CCATGCCACA GGTGCCATGC GCAACCTCAT CTACGACAAC GCTGACAACA 12 60 

AGCTGGCCCT GGTGGAGGAG AACGGGATCT TCGAGCTGCT GCGGACACTG CGGGAGCAGG 1320 

ATGATGAGCT TCGCAAAAAT GTCACAGGGA TCCTGTGGAA CCTTTCATCC AGCGACCACC 13 80 

TGAAGGACCG CCTGGCCAGA GACACGCTGG AGCAGCTCAC GGACCTGGTG TTGAGCCCCC 1440 

TGTCGGGGGC TGGGGGTCCC CCCCTCATCC AGCAGAACGC CTCGGAGGCG GAGATCTTCT 1500 

60 ACAACGCCAC CGGCTTCCTC AGGAACCTCA GCTCAGCCTC TCAGGCCACT CGCCAGAAGA 1560 

TGCGGGAGTG CCACGGGCTG GTGGACGCCC TGGTCACCTC TATCAACCAC GCCCTGGACG 1620 

CGGGCAAATG CGAGGACAAG AGCGTGGAGA ACGCGGTGTG CGTCCTGCGG AACCTGTCCT 1680 

ACCGCCTCTA CGACGAGATG CCGCCGTCCG CGCTGCAGCG GCTGGAGGGT CGCGGCCGCA 1740 

GGGACCTGGC GGGGGCGCCG CCGGGAGAGG TCGTGGGCTG CTTCACGCCG CAGAGCCGGC 1800 

65 GGCTGCGCGA GCTGCCCCTC GCCGCCGATG CGCTCACCTT CGCGGAGGTG TCCAAGGACC 1860 

CCAAGGGCCT CGAGTGGCTG TGGAGCCCCC AGATCGTGGG GCTGTACAAC CGGCTGCTGC 1920 

AGCGCTGCGA GCTCAACCGG CACACGACGG AGGCGGCCGC CGGGGCGCTG CAGAACATCA 1980 

CGGCAGGCGA CCGCAGGTGG GCGGGGGTGC TGAGCCGCCT GGCCCTGGAG CAGGAGCGTA 2040 

TTCTGAACCC CCTGCTAGAC CGTGT CAGG A CCGCCGACCA CCACCAGCTG CGCTCACTGA 2100 

70 CTGGCCTCAT CCGAAACCTG TCTCGGAACG CTAGGAACAA GGACGAGATG TCCACGAAGG 2160 

TGGTGAGCCA CCTGATCGAG AAGCTGCCAG GCAGCGTGGG TGAGAAGTCG CCCCCAGCCG 222 0 

AGGTGCTGGT CAACATCATA GCTGTGCTCA ACAACCTGGT GGTGGCCAGC CCCATCGCTG 2280 

CCCGAGACCT GCTGTATTTT GACGGACTCC GAAAGCTCAT CTTCAT CAAG AAGAAGCGGG 2340 

ACAGCCCCGA CAGTGAGAAG TCCTCCCGGG CAGCATCCAG CCTCCTGGCC AACCTGTGGC 2400 

75 AGTACAACAA GCTCCACCGT GACTTTCGGG CGAAGGGCTA TCGGAAGGAG GACTTCCTGG 2460 

GCCCATAGGT GAAGCCTTCT GGAGGAGAAG GTGACGTGGC CCAGCGTCCA AGGGACAGAC 2520 

T CAGCTCC AG GCTGCTTGGC AGCCCAGCCT GGAGGAGAAG GCTAATGACG GAGGGGCCCC 258 0 

TCGCTGGGGC CCCTGTGTGC ATCTTTGAGG GTCCTGGGCC ACCAGGAGGG GCAGGGTCTT 2 640 

ATAGCTGGGG ACTTGGCTTC CGCAGGGCAG GGGGTGGGGC AGGGCTCAAG GCTGCTCTGG 2700 

80 TGTATGGGGT GGTGACCCAG TCACATTGGC AGAGGTGGGG GTTGGCTGTG GCCTGGCAGT 2760 

ATCTTGGGAT AGCCAGCACT GGGAATAAAG ATGGCCATGA ACAGTCACAA AAAAAAAAAA 2820 
AAAAGGAATT C 

Seq ID NO: 351 Protein sequence 
85 Protein Accession #: NP_00 9114.1 

1 ii 21 31 41 51 



318 



WO 02/086443 

I I I I I I 

MQDGNFLLSA LQPEAGVCSL ALPSDLQLDR RGAEGPEAER LRAARVQEQV RARLLQLGQQ 60 

PRHNGAAEPE PEAETARGTS RGQYHTLQAG FSSRSQGLSG DKTSGFRPIA KPAYSPASWS 12 0 

SRSAVDLSCS RRLSSAHNGG SAFGAAGYGG AQPTPPMPTR PVSFHERGGV GSRADYDTLS 180 

5 LRSLRLGPGG LDDRYSLVSE QLEPAATSTY RAFAYERQAS SSSSRAGGLD WPEATEVSPS 240 

RTIRAPAVRT LQRFQSSHRS RGVGGAVPGA VLEPVARAPS VRSLSLSLAD SGHLPDVHGF 300 

NSYGSHRTLQ RLSSGFDDID LP S AVKYLMA SDPNLQVLGA AYIQHKCYSD AAAKKQARSL 3 60 

QAVPRLVKLF NHANQEVQRH ATGAMRNLIY DNADNKLALV EENGIFELLR TLREQDDELR 420 

KNVTGILWNL SSSDHLKDRL ARDTLEQLTD LVLSPLSGAG GPPLIQQNAS EAEIFYNATG 480 

10 FLRNLSSASQ ATRQKMRECH GLVDALVTSI NHALDAGKCE DKSVENAVCV LRNLSYRLYD 540 

EMPPSALQRL EGRGRRDLAG APPGEWGCF TPQSRRLREL PL AADALT FA EVSKDPKGLE 600 

WLWSPQIVGL YNRLLQRCEL NRHTTEAAAG ALQNITAGDR RWAGVLSRLA LEQERILNPL 660 

LDRVRTADHH QLRSLTGLIR NLSRNARNKD EMSTKWSHL IEKLPGSVGE KSPPAEVLVN 72 0 

I I AVLNNLW ASPIAARDLL YFDGLRKLIF IKKKRDSPDS EKSSRAASSL LANLWQYNKL 780 

15 HRDFRAKGYR KEDFLGP 



PCT/US02/12476 



20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 3 52 DNA sequence 
Nucleic Acid Accession #: M31469 
Coding sequence: 1-651 



ATGGCTGCGC 
ACTGGAAAAA 
GCCACCTTGG 
TTCAATGTAT 
ATCCAAGCCC 
GTGCCTAACT 
GG C AACAAAG 
AAGAAGAATC 
TTCCTCTGGC 
GCTCTCGCCC 
TTAGAGGTTG 



11 
I 

AGGGAGAGCC 
CGACCTTCGT 
GTGTTGAGGT 
GGGACACAGC 
AGTGTGCCAT 
GGCATAGAGA 
TGGATATTAA 
TTCAGTACTA 
TTGCTAGGAA 
CACCAGAAGT 
CTCAGACAAC 



21 
I 

CCAGGT CCAG 
G AAACGT CAT 
TCATCCCCTA 
CGGCCAGGAG 
CATAATGTTT 
TCTGGTACGA 
GGACAGGAAA 
CGACATTTCT 
GCTCAT TGGA 
TGTCATGGAC 
TGCTCTCCCG 



31 
I 

TTCAAACTTG 
TTGACTGGTG 
GTGTT CCACA 
AAATTCGGTG 
GATGTAACAT 
GTGTGTGAAA 
GTGAAGGCGA 
GCCAAAAGTA 
GACCCTAACT 
CCAGCTTTGG 
GATGAGGATG 



41 
I 

TATTGGTTGG 
AATT TGAGAA 
CCAACAGAGG 
GACTGAGAGA 
CGAGAGTTAC 
ACATCCCCAT 
AATCCATTGT 
ACTACAACTT 
TGGAATTTGT 
CAGCACAGTA 
ATGACCTGTG 



51 

I 

TGATGGTGGT 
GAAGTATGTA 
ACCTATTAAG 
TGGCTATTAT 
TTACAAGAAT 
TGTGTTGTGT 
CTTCCACCGA 
TGAAAAGCCC 
TGCCATGCCT 
TGAGCACGAC 
A 



Seq ID NO: 353 Protein sequence 
Protein Accession #: AAA36546 



11 



21 



31 



41 



51 



MAAQGEPQVQ FKLVLVGDGG TGKTTFVKRH LTGEFEKKYV ATLGVEVHPL VFHTNRGP I K 
FNVWDTAGQE KFGGLRDGYY IQAQCAIIMF DVTSRVTYKN VPNWHRDLVR VCENIPIVLC 
GNKVD I KDRK VKAKS I VFHR KKNLQYYDIS AKSNYNFEKP FLWLARKLIG DPNLEFVAMP 
ALAPPEWMD PALAAQYEHD LEVAQTTALP DEDDDL 



Seq ID NO: 354 DNA sequence 
Nucleic Acid Accession #: NM_002820 
Coding sequence: 304-831 



CCGGTTCGCA 
CCCTGTTCCA 
CGTGTAAACA 
TTCAGAGGAA 
GTTTGGAGAA 
ACGATGCAGC 
GTGCCCTCCT 
GAACATCAGC 
CTTCACCATC 
CCTAACTCCA 
GAGGGCAGAT 
AAGACACCTG 
AAACGGCGAA 
GACCACCTGT 
CTGGCCCGTA 
GCTTGGACAA 
CAGAGAATAA 
TGTCCTCCAG 
CATCAATCCT 
ATCTTCATAA 
TTCTTCAGTG 
GATATTATCT 
ACTTTTTATT 
TAAATTATGT 
CCAGCTCATA 
GGTTTTTCTC 
CCGTAGGAAA 



11 
I 

AAGAAGCTGA 
CGAACCCAGG 
CACTACTTAT 
GCGCCTCTGA 
AGCACAGTTG 
GGAGACTGGT 
GCGGGCGCTC 
TCCTCCATGA 
TGAT CGCAGA 
AGCCCTCTCC 
ACCTAACTCA 
GGAAGAAAAA 
CTCGCTCTGC 
CTGACACCTC 
GCCTCAGCGG 
ACCTAGAATT 
CTCAGAATAT 
CACCATAGAG 
TTACCACTCT 
TTTGCTGGAG 
TTTTTCATTT 
ACAAACACTG 
TAATTAAATG 
TTTAAACACA 
CAAAATAAAT 
ATGTATCTTT 
AATAAAACTT 



21 
I 

CTTCAGAGGG 
AGAACTGCTG 
CATTGATGCA 
TTTGTTTCTT 
GAGTAG CCGG 
TCAGCAGTGG 
GGTGGAGGGT 
CAAGGGGAAG 
AATCCACACA 
CAACACAAAG 
GGAAACTAAC 
GAAAGGCAAG 
CTGGTTAGAC 
CACAACGTCG 
GGTGCTCTCA 
TTCTCCCTTT 
TGTCTGCCTT 
AGGCGCTAGA 
ACCAAATAAT 
AAGTGTATTT 
CTTACGTTCT 
CAGAACAGCA 
TATTTAATTA 
TGCCTTAAAT 
GGTTTCTGAA 
TTGTTCATTG 
CACATTTAAA 



31 
I 

GGAAACTTTC 
GCCAGATTAA 
TATATAAAAC 
TTTTCCCTTT 
TTGCTAAATA 
AGCGTCGCGG 
CTCAGCCGCC 
TCCATCCAAG 
GCTGAAATCA 
AACCACCCCG 
AAGGTGGAGA 
CCCGGGAAAC 
TCTGGAGTGA 
CTGGAGCTCG 
GCTGGGTTTT 
ATGTATCTCT 
AAAGCAGTAC 
GCCCATTCCT 
TTCATATTCA 
CTTCCCCTTA 
TTCACTTCAA 
T CATGT CAT A 
AATCTCAAAT 
TTGTTTAATT 
AATGTTTAAG 
GCAAGATGAA 
AAAAA 



41 

i 

TTCTTTTAGG 
TTAGACATTG 
CATTTTATTT 
TTGCTCTTTC 
AGTCCCGAGC 
TGTTCCTGCT 
GCCTCAAAAG 
ATTTACGGCG 
GAGCTACCTC 
TCCGATTTGG 
CGTACAAAGA 
GCAAGGAGCA 
CTGGGAGTGG 
ATTCACGGTA 
GGAGCCTCCC 
ATCGATTGTG 
CCCCCTACCA 
CTTTCTCCAC 
AGCTTCAGAA 
CTCTCACACC 
GGGAGAATAT 
AACGATTCTG 
TTATTTTAAT 
AAATTTAACT 
TATTAACTTA 
ATAATTTTTC 



51 
I 

AGGCGGTTAG 
CTATGGGAGA 
TGGCTATTAT 
TGGCTGTGTG 
GCGAGCGGAG 
GAGCTACGCG 
AGCTGTGTCT 
ACGATTCTTC 
GGAGGTGTCC 
GTCTGATGAT 
GCAGCCGCTC 
GGAAAAGAAA 
GCTAGAAGGG 
ACAGGCTTCT 
TTCTGCCTTG 
TAGCAATTGA 
CACACACCCC 
CGTCACCCAA 
GCTAGTGACC 
TGGGCAAACT 
AGAAGCATTT 
AGCCATT CAC 
GTAAAGAACT 
CTGGTTTCTA 
CAAGGATATA 
TAGGGTAATG 



Seq ID NO: 355 Protein sequence 
Protein Accession #: NM_002820 

1 11 21 31 41 51 

I I I i I I 

MQRRLVQQWS VAVFLLSYAV PSCGRSVEGL SRRLKRAVSE HQLLHDKGKS IQDLRRRFFL 
HHLIAEIHTA EIRATSEVSP NSKPSPNTKN HPVRFGSDDE GRYLTQETNK VETYKEQPLK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



60 
120 



319 



WO 02/086443 

TPGKKKKGKP GKRKEQEKKK RRTRSAWLDS GVTGSGLEGD HLSDTSTTSL ELDSR 



PCT/US02/12476 



Seq ID NO: 356 DNA sequence 
Nucleic Acid Accession #: NM_017522 
5 Coding sequence: 1-2100 

1 11 21 31 41 51 

I I I I I I 

ATGGGCCTCC CCGAGCCGGG CCCTCTCCGG CTTCTGGCGC TGCTGCTGCT GCTGCTGCTG 60 

10 CTGCTGCTGC TGCGGCTCCA GCATCTTGCG GCGGCAGCGG CTGATCCGCT GCTCGGCGGC 120 

CAAGGGCCGG CCAAGGAGTG CGAAAAGGAC CAATTCCAGT GCCGGAACGA GCGCTGCATC 180 

CCCTCTGTGT GGAGATGCGA CGAGGACGAT GACTGCTTAG ACCACAGCGA CGAGGACGAC 240 

TGCCCCAAGA AGACCTGTGC AGACAGTGAC TTCACCTGTG ACAACGGCCA CTGCATCCAC 300 

GAACGGTGGA AGTGTGACGG CGAGGAGGAG TGTCCTGATG GCT CCGATGA GTCCGAGGCC 360 

15 ACTTGCACCA AGCAGGTGTG TCCTGCAGAG AAGCTGAGCT GTGGACCCAC CAGCCACAAG 420 

TGTGTACCTG CCTCGTGGCG CTGCGACGGG GAGAAGGACT GCGAGGGTGG AGCGGATGAG 480 

GCCGGCTGTG CTACCTCACT GGGCACCTGC CGTGGGGACG AGTTCCAGTG TGGGGATGGG 540 

ACATGTGTCC TTGCAATCAA GCACTGCAAC CAGGAGCAGG ACTGTCCAGA TGGGAGTGAT 600 

GAAGCTGGCT GCCTACAGGG GCTGAACGAG TGT CTGCACA ACAATGGCGG CTGCTCACAC 660 

20 ATCTGCACTG ACCTCAAGAT TGGCTTTGAA TGCACGTGCC CAGCAGGCTT CCAGCTCCTG 720 

GACCAGAAGA CTTGTGGCGA CATTGATGAG TGCAAGGACC CAGATGCCTG CAGCCAGATC 780 

TGTGTCAATT ACAAGGGCTA TTTTAAGTGT GAGTGCTACC CTGGCTGCGA GATGGACCTA 840 

CTGACCAAGA ACTGCAAGGC TGCTGCTGGC AAGAGCCCAT CCCTAATCTT CACCAACCGC 900 

ACGAGTGCGG AGGATCGACC TGTGAAGCGG AACTATTCAC GCCTCATCCC CATGCTCAAG 960 

25 AATGTCGTGG CACTAGATGT GGAAGTTGCC ACCAATCGCA TCTACTGGTG TGACCTCTCC 1020 

TACCGTAAGA TCTAT AG CGC CTACATGGAC AAGGCCAGTG ACCCGAAAGA GCGGGAGGTC 1080 

CTCATTGACG AGCAGTTGCA CTCTCCAGAG GGCCTGGCAG TGGACTGGGT CCACAAGCAC 1140 

AT CTACTGG A CTGACTCGGG CAATAAGACC AT CTCAGTGG CCACAGTTGA TGGTGGCCGC 1200 

CGACGCACTC TCTTCAGCCG TAACCTCAGT GAACCCCGGG CCATCGCTGT TGACCCCCTG 1260 

30 CGAGGGTTCA TGTATTGGTC TGACTGGGGG GACCAGGCCA AGATTGAGAA ATCTGGGCTC 1320 

AACGGTGTGG ACCGGCAAAC ACTGGTGTCA GACAATATTG AATGGCCCAA CGGAATCACC 1380 

CTGGATCTGC TGAGCCAGCG CTTGTACTGG GTAGACTCCA AGCTACACCA ACTGTCCAGC 1440 

ATTGACTTCA GTGGAGGCAA CAGAAAGACG CTGATCTCCT CCACTGACTT CCTGAGCCAC 1500 

CCTTTTGGGA TAGCTGTGTT TGAGGACAAG GTGTTCTGGA CAGACCTGGA GAACGAGGCC 1560 

35 ATTTTCAGTG CAAATCGGCT CAATGGCCTG GAAATCTCCA TCCTGGCTGA GAACCTCAAC 162 0 

AACCCACATG ACATTGT CAT CTTCCATGAG CTGAAGCAGC CAAGAGCTCC AGATGCCTGT 1680 

GAGCTGAGTG TCCAGCCTAA TGGAGGCTGT GAATACCTGT GCCTTCCTGC TCCTCAGATC 1740 

TCCAGCCACT CTCCCAAGTA CACATGTGCC TGTCCTGACA CAATGTGGCT GGGTCCAGAC 1800 

ATGAAGAGGT GCTACCGAGA TGCAAATGAA GACAGTAAGA TGGGCTCAAC AGTCACTGCC 1860 

40 GCTGTTATCG GGAT CATCGT GCCCATAGTG GTGATAGCCC TCCTGTGCAT GAGTGGATAC 192 0 

CTGATCTGGA GAAACTGGAA GCGGAAGAAC ACCAAAAGCA TGAATTTTGA CAACCCAGTC 1980 

TACAGGAAAA CAACAGAAGA AGAAGATGAA GATGAGCTCC ATATAGGGAG AACTGCTCAG 2040 

ATTGGCCATG TCTATCCTGC ACGAGTGGCA TTAAGCCTTG AAGATGATGG ACTACCCTGA 2100 

GGATGGGATC ACCCCCTTCG TGCCTCATGG AATTCAGTCC CATGCACTAC ACTCCGGATG 2160 

45 GTGTATGACT GGATGAATGG GTTTCTATAT ATGGGTCTGT GTGAGTGTAT GTGTGTGTGT 2220 

GATTTTTTTT TTTAAATTTA TGTTGCGGAA AGGTAACCAC AAAGTTATGA TGAACTGCAA 2280 

ACATCCAAAG GATGTGAGAG TTTTTCTATG TATAATGTTT TATACACTTT TTAACTGGTT 2340 

GCACTACCCA TGAGGAATTC GTGGAATGGC TACTGCTGAC TAACATGATG CACATAACCA 2400 

AATGGGGGCC AATGGCACAG TACCTTACTC ATCATTTAAA AACTATATTT ACAGAAGATG 24 60 

50 TTTGGTTGCT GGGGGGCTTT TTTAGGTTTT GGGCATTTGT TTTTTGTAAA TAAGATGATT 2520 
ATGCTTTGTG GCTATCCATC AACATAAGT 

Seq ID NO: 357 Protein sequence 

NP_059992 

21 31 41 51 

I I I I 

LLLLRLQHLA AAAADPLLGG QGPAKECEKD QFQCRNERCI 60 

CPKKTCADSD FTCDNGHCIH ERWKCDGEEE CPDGSDESEA 12 0 

CVPASWRCDG EKDCEGGADE AGCATSLGTC RGDEFQCGDG 180 

EAGCLQGLNE CLHNNGGCSH ICTDLKIGFE CTCPAGFQLL 240 

CVNYKGYFKC ECYPGCEMDL LTKNCKAAAG KSPSLIFTNR 300 

NWALDVEVA TNRIYWCDLS YRKIYSAYMD KASDPKEREV 360 

IYWTDSGNKT I SVATVDGGR RRTLFSRNLS EPRAIAVDPL* 420 

NGVDRQTLVS DNIEWPNGIT LDLLSQRLYW VDSKLHQLSS 480 

PFGIAVFEDK VFWTDLENEA IFSANRLNGL EI S ILAENLN 540 

ELSVQPNGGC EYLCLPAPQI SSHSPKYTCA CPDTMWLGPD 600 

AVIGIIVPIV VIALLCMSGY LIWRNWKRKN TKSMNFDNPV 660 
I GHVYPARVA LSLEDDGLP 

Seq ID NO: 358 DNA sequence 
Nucleic Acid Accession #: M27826 
Coding sequence: <l-503 

75 1 11 21 31 41 51 

I I 1 I 1 I 

AGCCCAAGAA ACATCTCACC AATTTCAAAT CTGATCTATT CGGCTTAGCG ACTGAAGATT 60 

GACGCTGCCC GATCGCCTCG GAAGTCCCCT GGACCATCAC AGAAGCCGAG CTT CGGGTAA 120 

CTCT CACAGT GGAGGGTAAG TCCATCCCCT GTTTAATCGA TACGGGGGCT ACCCACTCCA 180 

80 CGTTGCCTTC TTTTCAAGGG CCTGTTTCCC TTGCCCCCAT AACTGTTGTG GGTATTGACG 240 

GCCAAGCTTC AAAACCCCTG AAAACTCCCC CACTCTGGTG CCAACTTGGA CAACACTCTT 300 

TTATGCACTC TTTTTTAGTT ATCCCCACCT GCCCACTTCC CTTATTAGGC CGAAATATTT 3 60 

TAACCAAATT ATCTGCTTCC CTGACTATTC CTGGAGTACA GCTACATCTC ATTGCTGCCC 420 

TTCTTCCCAA TCCAAAGCCT CCTTTGTGTC CTCTAACATC CCCACAATAT CAGCCCTTAC 480 

85 CACAAGACCT CCCTTCAGCT TAATCTCTCC CACT CTAGGT TCCCACGCCG CCCCTAATCC 540 

CACTTGAAGC AGCCCTGAGA AACATCGCCC ATTCTCTCTC CATACCACCC CCCAAAAATT 600 

TTCGCCGCTC CAACACTTCA ACACTATTTT GTTTTATTTG TCTTATTAAT ATCAGAAGGC 660 



55 



Protein Accession #: 



1 11 

I I 
MGLPEPGPLR LLALLLLLLL 
PSVWRCDEDD DCLDHSDEDD 

60 TCTKQVCPAE KLSCGPTSHK 

TCVLAIKHCN QEQDCPDGSD 
DQKTCGDIDE CKDPDACSQI 
TSAEDRPVKR NYSRLIPMLK 
LIDEQLHSPE GLAVDWVHKH 

65 RGFMYWSDWG DQAKIEKSGL 

IDFSGGNRKT LISSTDFLSH 
NPHDIVIFHE LKQPRAPDAC 
MKRCYRDANE DSKMGSTVTA 
YRKTTEEEDE DELHI GRTAQ 
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AGGAATGTCA GGCCTCTGAG CCCAGGCCAG GCCAT CGCAT 
ACATCCAGAT GGCCTGAAGT AACTGAAGAT CCACAAAAGA 
ATGACATTCC ACCATTGTGA TTTGTTCCTG CCCCACCCTA 
ATCTCCCCCA CCCTTAAGAA GGTTCTTTGT AATTCTCCCC 
TGAGATCCAC CCCTGCCCAC CAGAGAACAA CCCCCTTTGA 
CCCAAATCCT ATAAAACAGC CCCACCCCTA TCTTCCTTCA 
AGCCACGGGC ACCCAGGTGA AATAAACAGC TTTATTGCTC 



PCT/US02/12476 



LLLL J. U 1 VJrtL 


X J. Vjl_.rtV_.V3 lni 


72 0 


AG T AAAAAC A 


GCCTTAACTG 


780 


ACTGATCAAT 


GTACTTTGTA 


840 


ACCCTTGAGA 


ATGTACTTTG 


900 


TTGTAATTTT 


TTATTACCTT 


960 


CTGACTCTCT 


TTTCGGACT C 


1020 


AC 







10 



Seq ID NO: 359 Protein sequence 
Protein Accession #: AAA65999 



1 11 21 31 41 51 

I 1 I I I I 

PKKHLTNFKS DLFGLATEDW RCPIASEVPW TITEAELRVT LTVEGKSIPC LIDTGATHST 

15 LPSFQGPVSL APITWGIDG QASKPLKTPP LWCQLGQHSF MHSFLVIPTC PLPLLGRNIL 

TKLSASLTIP GVQLHLIAAL LPNPKPPLCP LTSPQYQPLP QDLPSA 



60 
120 



Seq ID NO: 360 DNA sequence 
Nucleic Acid Accession #: NM_001854 
20 Coding sequence: 162-5582 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



80 



85 



1 
I 

AAC CAT C AAA 
TGTGTAGCAA 
TTCGTGGGTT 
GGTGGAAAAC 
TCTTCCAAGC 
TTCACAATTC 
CTAAAGGCTC 
AACAGTT AT T 
CAAAAAAAGG 
TTGGTGTTGA 
CCCCAGAAGA 
TAGCAATCAG 
CGAAACCACT 
GAACAAGGAT 
GTGATCCCAA 
CACCCAAGGC 
TCGAATATGA 
GACCCACTGT 
AAGAATACAA 
GGACAAATGA 
ATTATGATTC 
GCAGGGATTC 
AAGAATATGA 
CAGAAACTGA 
AGAAAGGAGA 
CAGGACCTGC 
ACCCTGGCGA 
GTCCTCCTGG 
CAACCATCTC 
TGAGAGGCCC 
GTTCATCTGG 
AGGGTCCCCC 
GAAGAGGAAT 
GTCTGCCAGG 
CTGGTGATGA 
AAGCTGGCCC 
GTATGGCAGG 
CTGGGCCTCC 
CAATTGGTCC 
GTGCTGATGG 
TGGGTCCCCC 
CAGATGGTGT 
GATTCAAAGG 
GAGGGNAAGA 
CTTCAGGTCA 
GAAGACAAGG 
AAGGTGCACG 
CTCGAGGTTC 
GTGGCGATGG 
TTGGATTCCC 
ACCCTGGGCA 
GAGTGGTTGG 
CTGGTCCTCC 
CAAAGGGTGA 
GTTTCCCAGG 
AAGGTCCCCA 
CAGCTGGCCC 
GAGAGAAAGG 
AAGGTCCTGT 
ACAAGGGTGA 
GCCCTCCCGG 
ATGGTGAACC 
CCAGAGGCTT 
GTGAAAAAGG 



11 
I 

TTTAGAAGAA 
ACATCCCTGG 
GAGTTCACAG 
GAAACGGTGG 
TAGAGAGGTC 
TCCAGAGGGA 
AGATACTGCT 
TCCAGGTGGA 
AATTCAGTCT 
GGTTGGGAGA 
CTATCCCCTC 
CGTGGAGAAG 
TGATAGAAGT 
TTTGGATGAA 
GGCAGCATAT 
TGCTCAAGCT 
CTATGAGTAT 
AACTGAGGAG 
CTATGGAACA 
GCCAAATCCA 
CCAGAGGAAA 
TGATCTTCTG 
AGATAAACCA 
TATTACAGAA 
ACCAGCAGTG 
AGGTATTATG 
TAGGGGCCCC 
TACTATGTTG 
TGCTCAGGAA 
ACCTGGCCCA 
GGCCAAAGGT 
TGGTCCAACG 
GCCAGGAGAA 
TGACAAAGGT 
TGGAATGAGG 
ACGAGGTTTG 
TGTAGATGGC 
AGGTCAACAA 
TCCTGGTGAA 
GCCTCCTGGT 
TGGTCCACAA 
CAGAGGTCTC 
TGACATGGGT 
TGGCCCTGAA 
AG CAGGAGAA 
TCCAAAGGGT 
GGGAGTAGCT 
AAGAGGTGCA 
CCCTCCTGGC 
TGGACCAAAA 
ACGTGGGGAG 
ACCACAGGGA 
TGGCCCTCCT 
TCCAGGT CCT 
GGAAAGAGGT 
GGGCCCACCA 
AATTGGTTTA 
TGCTCCTGGA 
TGGTCTCCCA 
AATTGGTGAG 
TCCCCCAGGT 
AGGTCCTAGA 
CCCTGGACCT 
TGAAAATGGG 



21 

I 

AAAGCCCTTT 
CGATACCTTG 
TTGTGAGTGC 
CTCTGGGATT 
AGAGGAGCTG 
AT AT C AAAAA 
TACAGAGTTT 
ACTTTCCCAG 
TTCCTTTTAT 
TCACCTGTTT 
TTCAGAACTG 
AAAACTGTGA 
GAGAGAGCAA 
GAAGTTTTTG 
GACTACTGTG 
CAGGAACCTC 
GGGGAAGCAG 
ACAATAGCAC 
ATGGAAAGTT 
GTTGAAGAAA 
AAT TCTGAGG 
GTAGATGGAG 
ACAAGCCCCC 
ACAAGCATAA 
GTTGAGCCTG 
GGTCCTCCAG 
CCAGGACGTC 
ATGTTACCGT 
GCTCAGGCTC 
ATGGGTCTAA 
GAGAGTGGTG 
GGAAAACCTG 
CCTGGGGCAA 
CACAGGGGTG 
GGAGAAGATG 
CTGGGTCCAA 
CCCCCAGGAC 
GGGAATCCAG 
AAAGGACCAC 
CATCCTGGGA 
GGTCCTATTG 
AAGGGATCTA 
CTAAAAGGTG 
GGACCCAAAG 
AAGGGAAAAC 
TCCACTGGAT 
GGCAAACCAG 
AGAGGTCCCA 
CCTCCAGGTG 
GGCCCTCCTG 
ACTGGATTTC 
CCAACCGGTG 
GGTGAGCAAG 
CAAGGTATCT 
CTTCCTGGAG 
GGTCCAGTTG 
CGAGGGCGCC 
GAAAAAGGTC 
GGGCCAGCTG 
CCGGGACAAA 
CTTCAAGGAC 
GGACAGCAGG 
CCTGGTCCAA 
GATGTTGGTC 



31 
I 

GACTTTTTCC 
GAAAGGACGA 
GGGGCTCGGA 
TCACCGTAAC 
CTCCAGTTGA 
CAACGGGATT 
CAAAGCAAGC 
AAGACTTTTC 
CTATATATAA 
TTCTGTTTGA 
TTAACATCGC 
CAATGATTGT 
TTGTTGATAC 
AGGGGGACAT 
AGCATTATAG 
AGATAGATGA 
AGTATAAAGA 
AGACGGAGGC 
ACCAGACAGA 
TATTTACTGA 
ATACACTATA 
ATTTAGGCGA 
CTAATGAAGA 
ATGGCCATGG 
GTATGCTTGT 
GTCTACAAGG 
CTGGCTTACC 
TCCGTTATGG 
AAGCTATTCT 
CTGGAAGACC 
ATCCAGGTCC 
GAAAAAGGGG 
AGGGAG AT CG 
AACGAGGTCC 
GAGAAATTGG 
GGGGAACTCC 
CAAAAGGGAA 
GACCTCAGGG 
AAGGAAAACC 
AAGAAGGCCA 
GATNNCCGGG 
AAGGTGAAAA 
ACAGAGGAGA 
GTCGAGCAGG 
TTGGAGTTCC 
TCCCTGGGTT 
GCCCTCGGGG 
CTGGGAAACC 
AAAGAGGTCC 
GACCACCAGG 
AAGGCAAGAC 
AGACTGGTCC 
GTCTTCCTGG 
CAGGGAAAGA 
CTCAGGGTGC 
GCTCACCAGG 
CGGGACCTCA 
CCCAAGGGCC 
GTCCTGCCGG 
AAGGCAGCAA 
CAGTTGGTGC 
GGATGTTTGG 
TAGGTCTTCA 
CATGGGGGCC 



41 
I 

CCCTCTCCCT 
AGTTGGTCTG 
GATGGAGCCG 
AACCCTCGCA 
TGTACTAAAA 
TTGCACAAAC 
ACAACTCAGT 
AATACTATTT 
TGAGCATGGT 
AGACCACACT 
TGACGGGAAG 
TGATTGTAAG 
CAATGGAATC 
TCAGCAGTTT 
TCCAGACTGT 
GTATGCACCA 
GGCTGAAAGT 
AAACATCGTT 
AGCTCCTAGG 
AGAATATCTA 
TGAAAACAAA 
ATATGATTTT 
ATTTGGTCCA 
TGCATATGGA 
CGAAGGACCA 
CCCCACTGGA 
AGGGGCTGAT 
TGGTG AT GGT 
TCAGCAGGCT 
AGGTCCTGTG 
TCAGGGCCCT 
TCGTCCAGGT 
AGGGTTTGAT 
TCAAGGTCCT 
ACCAAGAGGT 
AGGAGCTCCA 
CATGGGTCCC 
TCTTCCTGGT 
AGGACTTGCT 
GTCTGGAGAA 
CCCCCGGGGA 
GGGTGAAGAT 
AGTTGGT CAA 
CCCAACTGGA 
AGGATTACCA 
TCCAGGTGCC 
TCAGCGTGGT 
TGGGCCAAAG 
TCAAGGACCT 
AAGGATGGGC 
CGGCCCTCCT 
AATAGGGGAA 
TGCTGCAGGA 
TGGACCAGCA 
ACCTGGACTG 
AGAACGTGGG 
GGGTCCTCCT 
TGCAGGGAGA 
CTCCCCTGGG 
GGGTGG CAAG 
CCCTGGAATT 
GCAAAAAGGT 
GGGTCTGCCA 
ACCTGGTCCT 



51 

I 

CCCCAATGGC 
CAGTCGCAAT 
TGGTCCTCTA 
TTGACCTTCC 
GCACTAGATT 
AGAAAGAATT 
GCCCCAACAA 
ACAGTAAAAC 
ATTCAGCAAA 
GGAAAACCTG 
TGGCATCGGG 
AAGAAAACCA 
ACGGTTTTTG 
TTGATCACAG 
GACTCTTCAG 
GAGGATATAA 
GTAACAGAGG 
GATGATTTTC 
CATGTTTCTG 
ACGGGAGAGG 
GAAATAGACG 
TATGAATATA 
GGTGTACCAG 
GAGAAAGGAC 
CCAGGACCAG 
CCCCCTGGTG 
GGTCTACCTG 
TCCAAAGGAC 
CGGATTGCTC 
GGGGGGCCTG 
CGAGGCGTCC 
GCAGATGGAG 
GGACTTCCGG 
CCAGGTCCTC 
CTTCCAGGTG 
GGGCAGCCTG 
CAAGGGGAGC 
CCACAAGGTC 
GGACTTCCTG 
AAGGGGGCTC 
GTAAAGGGAG 
GGTTTTCCAG 
ATTGGCCCAA 
GACCCAGGTC 
GGATATCCAG 
AATGGAGAGA 
CCAACGGGTC 
GGCACTTCAG 
CAGGGTCCAG 
TGCCCAGGAC 
GGGCCAGGGG 
CGTGGGTATC 
AAAGAAGGTG 
GGATTACGTG 
AAAGGAGGGG 
TCAG CAGGT A 
GGTCCAGCTG 
GATGGAGTTC 
GAAGACGGAG 
GGAGAAAATG 
GCTGGAGGTG 
GATGAGGGTG 
GGCCCACCTG 
CCAGGCCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
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GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 

CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3 960 

GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAA GCTGGTCCAC 4020 

CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 

ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCCCTGCAG 4140 

GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGGAGA TCCTGGTCAA CCGGGTCCTC 42 0 0 

CTGGCCCATC TGGTGAGGCT GGCCCACCAG GTCCTCCTGG AAAACGAGGT CCTCCTGGAG 42 60 

CTGCAGGTGC AGAGGGAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAGAAG 4320 

GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGGACC TGCAGGAAAG CCTGGTCCAG 4380 

AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 

AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 

GCTCCAAGGG TGAAAAGGGA CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAGAAC 4560 

AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 

ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4 680 

GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGGACC CGCTGGCCAG AAAGGTGACA 4740 

GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTT TAC 4800 

CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 

ATAATATTCT TGATTACTCG GATGGAATGG AAGAAATATT TGGTTCCCTC AATT CCCTGA 4920 

AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA GCCCGAACTT 4980 

GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 

ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 

CTTGCATTTA TCCAGACAAA AAATCTGAGG GAGTAAGAAT TTCATCATGG CCAAAGGAGA 5160 

AACCAGGAAG TTGGTTTAGT GAATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG 5220 

AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 52 80 

GGCAAAATTT CACCTACCAC TGTCAT CAGT CAGCAGCCTG GTATGATGTG TCATCAGGAA 5340 

GTTATGACAA AGCACTTCGC TTCCTGGGAT CAAATGATGA GGAGATGTCC TATGACAATA 5400 

ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAAAACTG 5460 

TCATTGAAAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATCAGTG 5520 

ACTTTGGTGA TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT TTTCTTGGCT 5580 

AAGATTAAGA CAAAGAACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 

TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 

TACCATTTAG GAAATAC CGA TGCCTTTGTG GGGGCAGAAT CACAGACAAA AGCTTTGAAA 5760 

ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 582 0 

TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5880 

AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAGGATGT ACTAAAACAG 5940 

TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 

CCAAAACTTG CACGTGTCCC TGAATTCCGC TGACT CTAAT TTATGAGGAT GCCGAACTCT 6060 

GATGG CAAT A ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT TAATGATGAA ATTCCTTTGT GTGTGTTT 

Seq ID NO: 361 Protein sequence 
Protein Accession #: NP_001845 

1 11 21 31 41 51 

1 I 1 I I I 

MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF 60 

CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFDLSIYN 120 

EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKT VTM I V 180 

DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFLITGDPK AAYDYCEHYS 240 

PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 3 00 

NIVDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY 360 

ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG 420 

AYGEKGQKGE PAWEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 

GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRGP PGPMGLTGRP 540 

GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 

GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 

GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 

GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 

GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 

GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 90 0 

GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 

GPPGPGGWG PQGPTGETGP IGERGYPGPP GPPGEQGLPG AAGKEGAKGD PGPQGISGKD 102 0 

GPAGLRGFPG ERGLPGAQGA PGLKGGEGPQ GPPGPVGSPG ERGSAGTAGP IGLRGRPGPQ 1080 

GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK 1140 

GGKGENGPPG PPGLQGPVGA PGIAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 12 00 

GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 12 60 

GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 132 0 

GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK 13 80 

GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVGEQG LPGAAGQDGP PGPMGPPGLP 1440 

GLKGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 

GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 

MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 162 0 

EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 

LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 

EMS YDNNP F I KTLYDGCTSR KGYEKTVIEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1800 
PVCFLG 



Seq ID NO: 3 62 DNA sequence 
Nucleic Acid Accession #: NM_003107 
Coding sequence: 351-1775 

1 11 21 31 41 51 

1 | I I I 1 

TTCCCCAGCA TTCGAGAAAC TCCTCTCTAC TTTAGCACGG TCTCCAGACT CAGCCGAGAG 60 
ACAGCAAACT GCAGCGCGGT GAGAGAGCGA GAGAGAGGGA GAGAGAGACT CTCCAGCCTG 120 
GGAACTATAA CTCCTCTGCG AGAGGCGGAG AACTCCTTCC CCAAATCTTT TGGGGACTTT 180 
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TCTCTCTTTA CCCACCTCCG CCCCTGCGAG GAGTTGAGGG 
CGTCTTCCCG TTCGGCGTGT GCTTGGCCCG GGGAACCGGG 
CGGCCGCCGC GAGGGTGTGA GCGCGCGTGG GCGCCCGCCG 
AAACCAACAA TGCCGAGAAC ACGGAAGCGC TGCTGGCCGG 
CCGGCCTCGA GCTGGGAATC GCCTCCTCCC CCACGCCCGG 
GCAAGGCCGA CGACCCGAGC TGGTGCAAGA CCCCGAGTGG 
ACGCCTTCAT GGTGTGGTCG CAGATCGAGC GGCGCAAGAT 
TGCACAACGC CGAGATCTCC AAGCGGCTGG GCAAACGCTG 
ACAAG AT CCC TTTCATTCGA GAGGCGGAGC GGCTGCGCCT 
CCGACTACAA GTACCGGCCC AGGAAGAAGG TGAAGTCCGG 
CGGCCGCCGC CTCCTCCAAG CCGGGGGAGA AGGGAGACAA 
GCGGCCATGG GGGCGGCGGC GGCGGCGGGA GCAGCAACGC 
CGAGTGGCGG CGGCGCCAAC TCCAAACCGG CGCAGAAAAA 
CGGGCGGCGC GGGCGGTGGG GTTAGCAAAC CGCACGCCAA 
GCGGCGGCGG GAAAGCAGCG GCTGCCGCCG CCGCCTCCTT 
CCGCCGCCCT GCTGCCCCTG GGCGCCGCCG CCGACCACCA 
CTCCCAGCGC CTCGGCCTCC GCCTCCTCGG CAGCCTCGGC 
CGGGCAAGCA CCTGGCGGAG AAGAAGGTGA AGCGCGTCTA 
CGTCGTCGTC GCCCGTGGGC GGCGTGGGCG CGGGAGCCGA 
TGTACGAGGA GGAGGGCGCG GGCTGCTCGC CCGACGCGCC 
GCGCCGCCTC GTCCCCCGCC GCCGGCCGCT CGCCCGCCGA 
TGCGCGCCGC CTCGCCCGCC CCGTCCAGCG CGCCCTCGCA 
CCCACTCCTC CTCTTCCTCC TCCTCGGGCT CCTCGTCCTC 
ACCTGCTCGA CCTGAACCCC AGCTCAAACT TTGAGAGCAT 
CGTCGTCGGC GCTCGACCGG GACCTGGATT TTAACTTCGA 
TCGAGTTCCC GGACTACTGC ACGCCCGAGG TGAGCGAGAT 
AGTCCAGCAT CTCCAACCTG GTTTTCACCT ACTGAAGGGC 
CGGGGGGGGT AGGAGAGGAG AAAAAAAAAG TGAAAAAAAG 
GAGTTTAAAG AGAAAAGGGA AAAAAGAAAG AAAAAGTAAG 
TCTCGTCGTC GGAT CAAGGA GCGCGGCGGC GTTTTGGACC 
TCCCGGGCCG GGGACCCACT CTGCCCAGCC GGAGGGACGC 
AGGGGCGACC TGTGATTGTT GTTATTGATG TTGTTGTTGA 
TTCGAGTTTG CTCCCCTTTG CTTGAAGAGA CCCCCTCCCC 
TTGTCTGCAC CCCCAGCAAG AAGGCGAGTT AGTTTT CTAG 
CTTCCTGCAT CACCACCTTG GTTTTGTTTT ATTTTGCTTC 
GAACCCAGCG CACCCCTCCC CCCCTTTTTT TAAACGCGTG 
GGTGACGAAT TTGGCCGATG GCAGATGTTT TGGGGGAACG 
GCAGGCGAAT TCCCGTTTGG GGCCTTTTTT TCCTCCCTCT 
AGCCGGAGGA GGAGATGTTG AGGGGAGGAG GCCAGCCAGT 
ACCCGAGAAC CCCGTTGGAA GCGCAGCAGC GGGAGCTAGG 
CGAACTGGAA GGGGGTTCAC GGTCAAACTG AAATGGATTT 
CGGCGGCTGC TGGGCCTCCG CCTTCTTTTC TACGTGAAAT 
GACCCCGGAG GCGTGGAGGA GAGGAGACTG TTTGATGTGG 
GGCGAGTGGT TTCGGAAAAA AAAAAAGAAA AAAAGGG 

Seq ID NO: 363 Protein sequence 
Protein Accession ft: NP_003098 



PCT/US02/12476 



MVQQTNNAEN 
RPMNAFMVWS 
ADYPDYKYRP 
GGGASGGGAN 
QAGAAALLPL 
GLGTSSSPVG 
YASLRAASPA 
SFSSSSALDR 



11 

I 

TE ALL AGES S 
QIERRKIMEQ 
RKKVKSGNAN 
SKPAQKKSCG 
GAAADHH SLY 
GVG AG AD PSD 
PSSAPSHASS 
DLDFNFEPGS 



21 
I ' 

DSGAGLELGI 
SPDMHNAEIS 
SSSSAAASSK 
SKVAGGAGGG 
KARTPSASAS 
PLGLYEEEGA 
SASSHSSSSS 
GSHFEFPDYC 



31 

1 

ASSPTPGSTA 
KRLGKRWKLL 
PGEKGDKVGG 
VSKPHAKLIL 
ASSAASASAA 
GCSPDAPSLS 
SSGSSSSDDE 
TPEVSEMISG 



Seq ID NO: 3 64 DNA sequence 
Nucleic Acid Accession #: U10860 
Coding sequence: 123-2204 



TGCCGGCTGC 
GGCACCCTCC 
CGATGGCTCT 
GCCACCACCA 
TCATAGACCG 
CAGCATTTGC 
CTGTGTATGC 
TTCTTGGAAT 
AAAAAAGTGT 
TCAGGGGCCT 
TAGCTGATGG 
AATCTAAAAA 
AAGTAATACT 
AGAACAGAGA 
TGGTTTTACT 
TGAACCAAGA 
GCCAGTCTGT 
CTCATTCTTT 
GGAAAAGAAT 
TTGGGGATAC 
AGGAGGTTTT 



11 
I 

TCCTCGACCA 
CGCCCCGTCT 
GTGCAACGGA 
CTATGAAGGA 
AAGAGTGAGG 
TATAAAGGAA 
TGAAGATGCT 
TTGCTATGGT 
CAGAGAAGAT 
TCAGAAGGAA 
ATTCAAGGTT 
GTTATATGGA 
GAAGAATTT C 
ACTTGAGTGT 
CAGTGGTGGA 
ACAAGTCATT 
TGAAGAGGCC 
CTACAATGGA 
TAGCAAAACG 
TTTTGTTAAG 
CCTTGCCCAA 



21 
1 

GGCCTCCTTC 
CGTACTGTCG 
GACTCCAAGC 
GCTGTTGTCA 
GAACTGTTCG 
CAAGGAT TCC 
CCCTGGTTTG 
ATGCAGATGA 
GGAGTTTTCA 
GAAGTTGTTT 
GTGGCACGTT 
GCACAGTTCC 
CTTTATGATA 
ATTCGAGAGA 
GTAGACTCAA 
GCTGTG CACA 
CTCAAAAAGC 
ACAACAACCC 
TTAAATATGA 
ATTGCCAATG 
GGTACTTTAC 



31 

1 

T CAACCT CAG 
CCGTCACCGC 
TGGAGAATGC 
TTCTGGATGC 
TGCAGTCTGA 
GTGCTATTAT 
ATCCAGCAAT 
TGAATAAGGT 
ACATTAGTGT 
TGCTTACACA 
CTGGAAACAT 
ACCCTGAAGT 
TAGCTGGATG 
T C AAAG AG AG 
CAGTTTGTAC 
TTGATAATGG 
TTGGAATTCA 
TACCAATATC 
CCACAAGTCC 
AAGTAATTGG 
GGCCTGATCT 



GCCAGTTCGG 
AGGGCCCGGC 
AGCCGAGGCC 
CGAGAGCTCG 
CTCCACCGCC 
GCACAT CAAG 
CATGGAGCAG 
GAAGCTGCTC 
CAAG CA CAT G 
CAACGCCAAC 
GGTCGGTGGC 
GGGGGGAGGA 
GAGCTGCGGC 
GCTCATCCTG 
CGCCGCCGAA 
CTCGCTGTAC 
CTCCGCAGCG 
CCTGTTCGGC 
CCCCAGCGAC 
CAGCCTGAGC 
CCACCGCGGC 
CGCGTCCTCC 
CGACGACGAG 
GTCCCTGGGC 
GCCCGGCTCC 
GATCTCGGGA 
GCGCAGGCAG 
AAACGAAAAG 
CAGGGCTCGT 
CGCGCTCCCA 
GGAGGAGGAA 
TGGCAAAAAA 
CTTCCAACGA 
AGACTTGAAG 
TTGGTCAAGA 
ATGAAGACAG 
CCGGGACTGA 
TTTCCCCTTG 
GTGACCGGCG 
GGCGGGGGCG 
GCACGTTGGG 
CAGTGAGGTG 
TACAGGGGCA 



41 
I 

STGGKADDPS 
KDSDKIPFIR 
SGGGGHGGGG 
AGGGGGGKAA 
LAAPGKHLAE 
GRSSAASSPA 
FEDDLLDLNP 
DWLESSISNL 



41 

I 

CCCGCGGCGC 
CGCGGCTCCG 
TGGAGGAGAC 
TGGTGCTCAG 
AATTTTCCCC 
CATCTCTGGA 
ATTCACTATT 
ATTTGGAGGT 
GGATAATACA 
TGGAGATAGT 
AGTAGCAGGC 
TGGCCTTACA 
CAGTGGAACC 
AGTAGGCACG 
AGCTTTGCTA 
CTTTATGAGA 
GGTCAAAGTG 
AGATGAAGAT 
TGAAGAGAAA 
AGAAATGAAC 
AATTGAAAGT 



CCGCCGCGCG 
GATCGCGCGG 
ATGGTGCAGC 
GACTCGGGCG 
TCCACGGGCG 
CGACCCATGA 
TCGCCCGACA 
AAAGACAGCG 
GCTGACTACC 
TCCAGCTCCT 
AGTGGCGGGG 
GGCGGCGGTG 
TCCAAAGTGG 
GCAGGCGGCG 
CAGGCGGGGG 
AAGGCGCGGA 
CTCGCGGCCC 
GGCCTGGGCA 
CCCCTGGGCC 
GGCCGCAGCA 
TACGCCAGCC 
TCGGCCTCGT 
TTCGAAGACG 
AGCTTCAGTT 
GGCTCGCACT 
GACTGGCTCG 
GGAGAAGGGC 
GACAGACGAA 
TCGCCCGCGT 
TCCCCCACCT 
GAGGGTAGAC 
AAAAAGCGAC 
GCTTCCGGAC 
GAGTCTCCCC 
AAGGAGGGGA 
AAGGCTCCGG 
GAGACTCCAC 
CCCCCTCTGC 
CTAGGAAATG 
GAGGAGGACA 
GAGCTGGCGG 
AGACTTCCCA 
GTCAGTGGAG 



51 
I 

WCKTPSGHIK 
EAERLRLKHM 
GGGSSNAGGG 
AAAAASFAAE 
KKVKRVYLFG 
AGRSPADHRG 
SSNFESMSLG 
VFTY 



51 
I 

CGACCCTTCC 
GCCCTGGCCC 
CTTAAGGATG 
TACGGGAAAG 
TTGGAAACAC 
GGACCTAATT 
GGCAAGCCTG 
ACTGTGCACA 
TGTTCATTAT 
GTAGACAAAG 
ATAGCAAATG 
GAAAATGGAA 
TTCACCGTGC 
TCAAAAGTTT 
AATCGTGCTT 
AAACGAGAAA 
ATAAATGCTG 
AGAACCCCAC 
AGAAAAATCA 
TTGAAACCAG 
GCATCCCTTG 



240 
300 
360 
420 
430 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



323 



10 



15 



WO 02/086443 

TTGCAAGTGG CAAAGCTGAA CTCATCAAAA CCCATCACAA TGACACAGAG CT CAT CAG AA 1320 

AGT T GAG AG A GGAGGGAAAA GTAATAGAAC GTCTGAAAGA TTTT CATAAA GATGAAGTGA 13 80 

GAATTTTGGG CAGAGAACTT GGACTTCCAG AAGAGTTAGT TTCCAGGCAT CCATTTCCAG 1440 

GTCCTGGCCT GGCAATCAGA GTAATATGTG CTGAAGAACC TTATATTTGT AAGGACTTTC 1500 

CTGAAACCAA CAATATTTTG AAAATAGTAG CTGATTTTTC TGCAAGTGTT AAAAAGCCAC 1560 

ATACCCTATT AC AG AG AGT C AAAGCCTGCA CAACAGAAGA GGATCAGGAG AAGCTGATGC 1620 

AAATTACCAG TCTGCATTCA CTGAATGCCT TCTTGCTGCC AATTAAAACT GTAGGTGTGC 1680 

AGGGTGACTG TCGTTCCTAC AGTTACGTGT GTGGAATCTC CAGTAAAGAT GAACCTGACT 1740 

GGGAATCACT TATTTTT CTG GCTAGGCTTA TACCTCGCAT GTGTCACAAC GTTAACAGAG 180 0 

TTGTT TATAT ATTTGGCCCA CCAGTTAAAG AACCTCCTAC AGATGTTACT CCCACTTTCT 1860 

TGACAACAGG GGTGCTCAGT ACTTTACGCC AAGCTGATTT TGAGGCCCAT AACATTCTCA 192 0 

GGGAGTCTGG GTATGCTGGG AAAATCAGCC AGATGCCGGT GATTTTGACA CCATTACATT 1980 

TTGATCGGGA CCCACTTCAA AAGCAGCCTT CATGCCAGAG ATCTGTGGTT ATTCGAACCT 2040 

TTATTACTAG TGACTTCATG ACTGGTATAC CTGCAACACC TGGCAATGAG ATCCCTGTAG 2100 

AGGTGGTATT AAAGATGGTC ACTGAGATTA AGAAGATTCC TGGTATTTCT CGAATTATGT 2160 
ATGACTTAAC ATCAAAGCCC CCAGGAACTA CTGAGTGGGA GTAATAAACT TC 
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20 



25 



30 



35 



Seq ID NO: 365 Protein sequence 
Protein Accession #: AAA60331 



MALCNGDSKL 
AFAIKEQGFR 
KSVREDGVFN 
SKKLYGAQFH 
VLLSGGVDST 
HSFYNGTTTL 
EVFLAQGTLR 
ILGRELGLPE 
TLLQRVKACT 
ESIiIFLARLI 
ESGYAGKISQ 
WLKMVTEIK 



11 
I 

ENAGGDLKDG 
AIIISGGPNS 
ISVDNTCSLF 
PEVGLTENGK 
VCTALLNRAL 
PI SDEDRTPR 
PDLIESASLV 
ELVSRHPFPG 
TEEDQEKLMQ 
PRMCHNVNRV 
MPVILTPLHF 
KIPGISRIMY 



21 
I 

HHHYEGAWI 
VYAEDAPWFD 
RGLQKEEWL 
VILKNFLYDI 
NQEQVIAVHI 
KR I SKTLNMT 
ASGKAELIKT 
PGLAIRVICA 
ITSLHSLNAF 
VYIFGPPVKE 
DRDPLQKQPS 
DLTSKPPGTT 



31 

I 

LDAGAQYGKV 
PAIFTIGKPV 
LTHGDSVDKV 
AGCSGTFTVQ 
DNGFMRKRES 
TSPEEKRKII 
HHNDTEL I RK 
EEPYICKDFP 
LLPIKTVGVQ 
PPTDVTPTFL 
CQRSWIRTF 



41 
I 

IDRRVRELFV 
LGICYGMQMM 
ADGFKWARS 
NRELECIREI 
QSVEEALKKL 
GDTFVKIANE 
LREEGKVIEP 
ETNNILKIVA 
GDCRSYSYVC 
TTGVLSTLRQ 
ITSDFMTGIP 



51 
I 

QSEIFPLETP 
NKVFGGTVHK 
GNIVAGIANE 
KERVGTSKVL 
GIQVKVINAA 
VIGEMNLKPE 
LKDFHKDEVR 
DFSASVKKPH 
GISSKDEPDW 
ADFEAHNILR 
ATPGNEIPVE 



60 
120 

ieo 

240 
300 
360 
420 
480 
540 
600 
660 



40 
45 
50 
55 



Seq ID NO: 366 DNA sequence 
Nucleic Acid Accession #: NM_004219 
Coding sequence: 46-654 



GCGGCCTCAG 
TATGTTGATA 
CTGGGGTCTG 
TTTGGCAAAA 
ACTGTCAACA 
CCAAGCTTTT 
GCCTCAGATG 
GAGAGTTTTG 
CTCATGATCC 
CCTGTGAAGA 
CTGTCGACCC 
TAGTGCTTCA 
AAAAAAAA 



11 
I 

ATGAATG CGG 
AGGAAAATGG 
GACCTTCAAT 
CGTTCGATGC 
GAGCTACAGA 
CTGCCAAAAA 
ATGCCTATCC 
ACCTGCCTGA 
TTGACGAGGA 
TGCCCTCTCC 
TGGATGT TG A 
GAGTTTGTGT 



21 

I 

CTGTTAAGAC 
AGAACCAGGC 
CAAAGCCTTA 
CCCACCAGCC 
AAAGTCTGTA 
GATGACTGAG 
AGAAATAGAA 
AGAGCACCAG 
GAGAGAGCTT 
ACCATGGGAA 
ATTGCCACCT 
GTATTTGTAT 



31 

i 

CTGCAATAAT 
ACCCGTGTGG 
GATGGGAGAT 
TTACCTAAAG 
AAGACCAAGG 
AAGACTGTTA 
AAATTCTTTC 
ATTGCGCACC 
GAAAAGCTGT 
TCCAATCTGT 
GTTTGCTGTG 
TAATAAAGCA 



41 
1 

CCAGAATGGC 
TTGCTAAGGA 
CTCAAGTTTC 
CTACTAGAAA 
GACCCCTCAA 
AAGCAAAAAG 
CCTTCAATCC 
TCCCCTTGAG 
TTCAGCTGGG 
TGCAGTCTCC 
ACATAGATAT 
TTCTTCAACA 



51 

1 

TACTCTGATC 
TGGGCTGAAG 
AACACCACGT 
GGCTTTGGGA 
ACAAAAACAG 
CTCTGTTCCT 
TCTAGACTTT 
TGGAGTGCCT 
CCCCCCTTCA 
TTCAAGCATT 
TTAAATTTCT 
GAAAAAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
-600 
660 
720 



60 



65 



Seq ID NO: 367 Protein sequence 
Protein Accession #: NP_004210 

1 11 21 31 41 51 

I I 1 i I I 

MATLIYVDKE NGEPGTRWA KDGLKLGSGP SIKALDGRSQ VSTPRFGKTF DAPPALPKAT 

RKALGTVNRA TEKSVKTKGP LKQKQPSFSA KKMTEKTVKA KSSVPASDDA YPEIEKFFPF 

NPLDFESFDL PEEHQIAHLP LSGVPLMILD EERELEKLFQ LGPPSPVKMP SPPWESNLLQ 
SPSSILSTLD VELPPVCCDI DI 



60 
120 
180 



70 
75 
80 
85 



Seq ID NO: 368 DNA sequence 
Nucleic Acid Accession #: NM_000597 
Coding sequence: 118-1104 



ATTCGGGGCG 
CCTGCCCGCC 
CTGCCGAGAG 
CCGCTGCTGC 
CTGTTCCGCT 
GCGCCGCCCG 
GTCCGGGAGC 
GGCGTCTACA 
CTGCCCCTGC 
TATGGCGCCA 
GTGGAGAACC 
AAGCCCCTCA 
CACCGGCAGA 
CGACCACCCC 



11 

1 

AGGGAGGAGG 
CGCCCGCTCG 
TGGGCTGCCC 
TGCTGCTACT 
GCCCGCCCTG 
CCGCGGTGGC 
CGGGCTGCGG 
CCCCGCGCTG 
AGGCGCTGGT 
GCCCGGAGCA 
ACGTGGACAG 
AGT CGGGT AT 
TGGGCAAGGG 
CTGCCAGGAC 



21 

1 

AAGAAGCGGA 
CTCGCTCGCC 
CGCGCTGCCG 
GGGCGCGAGT 
CACACCCGAG 
CGCAGTGGCC 
CTGCTGCTCG 
CGGCCAGGGG 
CATGGGCGAG 
GGTTGCAGAC 
CACCATGAAC 
GAAGGAGCTG 
TGGCAAGCAT 
TCCCTGCCAA 



31 
I 

GGAGGCGGCT 
CGCCGCGCCG 
CTGCCGCCGC 
GGCGGCGGCG 
CGCCTGGCCG 
GGAGGCGCCC 
GTGTGCGCCC 
CTGCGCTGCT 
GGCACTTGTG 
AATGGCGATG 
ATGTTGGGCG 
GCCGTGTTCC 
CACCTTGGCC 
CAGGAACTGG 



41 

I 

CCCGCTCGCA 
CGCTGCCGAC 
CGCCGCTGCT 
GCGGGGCGCG 
CCTGCGGGCC 
GCATGCCATG 
GGCTGGAGGG 
ATCCCCACCC 
AGAAGCGCCG 
ACCACTCAGA 
GGGGAGGCAG 
GGGAGAAGGT 
TGGAGGAGCC 
ACCAGGTCCT 



51 

I 

GGGCCGTGCA 
CGCCAGCATG 
GCCGCTGCTG 
CGCGGAGGTG 
CCCGCCGGTT 
CGCGGAGCTC 
CGAGGCGTGC 
GGGCTCCGAG 
GGACGCCGAG 
AGGAGGCCTG 
TGCTGGCCGG 
CACTGAGCAG 
CAAGAAGCTG 
GGAGCGGATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



324 



10 



WO 02/086443 

TCCACCATGC GCCTTCCGGA TGAGCGGGGC CCTCTGGAGC ACCTCTACTC CCTGCACATC 900 

CCCAACTGTG ACAAGCATGG CCTGTACAAC CTCAAACAGT GCAAGATGTC TCTGAACGGG 960 

CAGCGTGGGG AGTGCTGGTG TGTGAACCCC AACACCGGGA AGCTGATCCA GGGAGCCCCC 1020 

ACCAT CCGGG GGGACCCCGA GTGTCATCTC TTCTACAATG AGCAGCAGGA GGCTTGCGGG 10 BO 

GTGCACACCC AGCGGATGCA GTAGACCGCA GCCAGCCGGT GCCTGGCGCC CCTGCCCCCC 1140 

GCCCCTCTCC AAACACCGGC AGAAAACGGA GAGTGCTTGG GTGGTGGGTG CTGGAGGATT 1200 

TTCCAGTTCT GACACACGTA TTTATATTTG GAAAGAGACC AGCACCGAGC TCGGCACCTC 1260 

CCCGGCCTCT CTCTTCCCAG CTGCAGATGC CACACCTGCT CCTTCTTGCT TTCCCCGGGG 1320 

GAGGAAGGGG GTTGTGGTCG GGGAGCTGGG GTACAGGTTT GGGGAGGGGG AAGAGAAATT 1380 
TTTATTTTTG AACCCCTGTG TCCCTTTTGC ATAAGATTAA AGGAAGGAAA AGT 
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Seq ID NO: 3 69 Protein sequence 
Protein Accession #: NP_000588 

15 1 11 21 31 41 51 

I I I I I ! 

MLPRVGCPAL PLPPPPLLPL LPLLLLLLGA SGGGGGARAE VLFRCPPCTP ERLAACGPPP 60 
VAPPAAVAAV AGGARMPCAE LVREPGCGCC SVCARLEGEA CGVYTPRCGQ GLRCYPHPGS 120 
ELPLQALiVMG EGTCEKRRDA EYGASPEQVA DNGDDHSEGG LVENHVDSTM NMLGGGGSAG 180 
20 RKPLKSGMKE LAVFREKVTE QHRQMGKGGK HHLGIjEEPKK DRPPPARTPC QQELDQVLER 240 

ISTMRLPDER GPLEHLYSLH IPNCDKHGLY NLKQCKMSLN GQRGECWCVN PNTGKLIQGA 3 00 
PTIRGDPECH LFYNEQQEAC GVHTQRMQ 



Seq ID NO: 370 DNA sequence 
25 Nucleic Acid Accession #: NM_004264 
Coding sequence: 6-440 

1 .11 21 31 41 51 

„ n I I I I I I 

3U GGAACATGGC GGAT CGGCTC ACGCAGCTTC AGGACGCTGT GAATTCGCTT GCAGATCAGT 60 

TTTGTAATGC CATTGGAGTA TTGCAGCAAT GTGGT CCTCC TGCCTCTTTC AATAATATTC 120 
AGACAGCAAT TAACAAAGAC CAGCCAGCTA ACCCTACAGA AGAGTATGCC CAGCTTTTTG 180 
CAGCACTGAT TGCACGAACA GCAAAAGACA TTGATGTTTT GATAGATTCC TTACCCAGTG 240 
AAGAATCTAC AGCTGCTTTA CAGGCTGCTA GCTTGTATAA GCTAGAAGAA GAAAACCATG 3 00 

35 AAGCTGCTAC ATGTGTGGAG GATGTTGTTT ATCGAGGAGA CATGCTTCTG GAGAAGATAC 360 

AAAGCGCACT TGCTGATATT GCACAGTCAC AGCTGAAGAC AAGAAGTGGT ACCCATAGCC 420 
AGTCTCTTCC AGACTCATAG CATCAGTGGA TACCATGTGG CTGAGAAAAG AACTGTTTGA 480 
GTGCCATTAA GAATTCTGCA TCAGACTTAG ATACAAGCCT TACCAACAAT TACAGAAACA 540 
TTAAACACTA TGACACATTA CCTTTTTAGC TATTTTTAAT AGTCTTCTAT TTTCACTCTT 600 

40 GATAAGCTTA TAAATCATGA TTGAATCAGC TTTAAAGCAT CATACCATCA TTTTTTAACT 660 

GAGTGAAATT ATTAAGG CAT GTAATACATT AATGAACATA ATATAAGGAA ACATATGTAA 720 
AATTCTGTTA TGACATAATT TATGTCTCCA TTTTGTTGTA TTGGCCAGTA CTTTTACAAT 7 80 
C 

45 Seq ID NO: 3 71 Protein sequence 

Protein Accession # : NP_0 04255 

1 11 21 31 41 51 

- n I I I I I I 

DU MADRLTQLQD AVNSLADQFC naigvlqqcg PPASFNNIQT AINKDQPANP TEEYAQLFAA 60 
LIARTAKDID VLIDSLPSEE STAALQAASL YKLEEENHEA ATCVEDWYR GDMLLEKIQS 120 
ALADIAQSQL KTRSGTHSQS LPDS 

Seq ID NO: 372 DNA sequence 
55 Nucleic Acid Accession #: AJ271091 
Coding sequence: 1-1113 

1 11 21 31 41 51 

^ n I I I I I I 

OU ATGGAGAATC AGGTGTTGAC GCCGCATGTC TACTGGGCTC AGCGACACCG CGAGCTATAT 60 

CTGCGCGTGG AGCTGAGTGA CGTACAGAAC CCTGCCATCA GCAT CACTGA AAACGTGCTG 120 

CATTTCAAAG CTCAAGGACA TGGTGCCAAA GGAGACAATG TCTATGAATT TCACCTGGAG 180 

TTCTTAGACC TTGTGAAACC AGAGCCTGTT TACAAACTGA CCCAGAGGCA GGTAAACATT 240 

ACAGTACAGA AGAAAGTGAG TCAGTGGTGG GAGAGACTCA CAAAGCAGGA AAAGCGACCA 300 

65 CTGTTTTTGG CTCCTGACTT TGAT CGTTGG CTGGATGAAT CTGATGCGGA AATGGAGCTC 360 

AGAGCTAAGG AAGAAGAGCG CCTAAATAAA CTCCGACTGG AAAGCGAAGG CTCTCCTGAA 42 0 

ACTCTTACAA ACTTAAGGAA AGGATACCTG TTTATGTATA ATCTTGTGCA ATTCTTGGGA 480 

TTCTCCTGGA TCTTTGTCAA CCTGACTGTG CGATTCTGTA TCTTGGGAAA AGAGTCCTTT 540 

TATGACACAT TCCATACTGT GGCTGACATG ATGTATTTCT GCCAGATGCT GGCAGTTGTG 600 

70 GAAACTATCA ATGCAG CAAT TGGAGTCACT ACGTCACCGG TGCTGCCTTC TCTGATCCAG 660 

CTTCTTGGAA GAAAT TTTAT TTTGTTTATC ATCTTTGGCA CCATGGAAGA AATGCAGAAC 72 0 

AAAGCTGTGG TTTTCTTTGT GTTTTATTTG TGGAGTGCAA TTGAAATTTT CAGGTACTCT 780 

TTCTACATGC TGACGTGCAT TGACATGGAT TGGAAGGTGC TCACATGGCT TCGTTACACT 840 

CTGTGGATTC CCTTATATCC ACTGGGATGT TTGGCGGAAG CTGTCTCAGT GATTCAGTCC 900 

75 ATTCCAATAT TCAATGAGAC CGGACG ATT C AGTTTCACAT TGCCATATCC AGTGAAAATC 960 

AAAGTTAGAT TTTCCTTTTT TCTTCAGATT TATCTTATAA TGATATTTTT AGGTTTATAC 102 0 

ATAAATTTTC GTCACCTTTA TAAACAG CGC AGACTGAAAA TGAGGG CAGG CGCAGTGGCT 108 0 
CATGCCTGTG ATCCCAGCGC TTTGGGAGGC TGA 

80 Seq ID NO: 3 73 Protein sequence 
Protein Accession #: CAB69070 

1 11 21 31 41 51 

or I I 1 ! 1 i 

OO MENQVLTPHV YWAQRHRELY LRVELSDVQN PAISITENVL HPKAQGHGAK GDNVYEFHLE 60 

PLDDVKPEPV YKLTQRQVNI TVQKKVSQWW ERLTKQEKRP LFDAPDFDRW LDESDAEMEL 120 
RAKEEERLNK LRLESEGSPE TLTNLRKGYL FMYNLVQFLG FSWIFVNLTV RFCILGKESF 180 



325 



10 



WO 02/086443 PCT/US02/12476 

YDTFHTVADM MYFCQMLAW ETINAAIGVT TSPVLPSLIQ LLGRNFILFI IFGTMEEMQN 240 
KAWFFVFYL WSAIEIFRYS FYMLTCIDMD WKVLTWLRYT LWIPLYPLGC LAEAVSVIQS 3 00 
IPIFNETGRF SFTLPYPVKI KVRFSFFLQI YLIMIFLGLY INFRHLYKQR RLKMRAGAVA 360 
HACDPSALGG 

Seq ID NO: 3 74 DNA sequence 
Nucleic Acid Accession ft: NM_016395 
Coding sequence: 1-1113 



1 11 21 31 41 51 

111)11 

ATGGAGAATC AGGTGTTGAC GCCGCATGTC TACTGGGCTC AGCGACACCG CGAGCTATAT 60 

CTGCGCGTGG AG CTGAGTGA CGTACAGAAC CCTGCCATCA GCATCACTGA AAACGTGCTG 120 

15 CATTTCAAAG CTCAAGGACA TGGTGCCAAA GGAGACAATG TCTATGAATT TCACCTGGAG 180 

TTCTTAGACC TTGTGAAACC AGAGCCTGTT TACAAACTGA CCCAGAGGCA GGTAAACATT 240 

ACAGTACAGA AGAAAGTGAG TCAGTGGTGG GAGAGACTCA CAAAGCAGGA AAAGCGACCA 300 

CTGTTTTTGG CTCCTGACTT TGATCGTTGG CTGGATGAAT CTGATGCGGA AATGGAGCTC 3 60 

AGAGCTAAGG AAGAAGAG CG CCTAAATAAA CTCCGACTGG AAAGCGAAGG CTCTCCTGAA 420 

20 ACTCTTACAA ACTTAAGGAA AGGATACCTG TTTATGTATA ATCTTGTGCA ATTCTTGGGA 4 80 

TTCTCCTGGA TCTTTGTCAA CCTGACTGTG CGATTCTGTA TCTTGGGAAA AGAGTCCTTT 540 

TATGACACAT TCCATACTGT GGCTGACATG ATGTATTTCT GCCAGATGCT GGCAGTTGTG 60 0 

GAAACTATCA ATGCAGCAAT TGGAGTCACT ACGT CACCGG TGCTGCCTTC TCTGATCCAG 660 

CTTCTTGGAA GAAATTTTAT TTTGTTTATC ATCTTTGGCA CCATGGAAGA AATGCAGAAC 72 0 

25 AAAGCTGTGG TTTTCTTTGT GTTTTATTTG TGGAGTGCAA TTGAAATTTT CAGGTACTCT 7 80 

TTCTACATGC TGACGTGCAT TGACATGGAT TGGAAGGTGC TCACATGGCT TCGTTACACT 840 

CTGTGGATTC CCTTATATCC ACTGGGATGT TTGGCGGAAG CTGTCTCAGT GATTCAGTCC 900 

ATTCCAATAT TCAATGAGAC CGGACGATTC AGTTTCACAT TGCCATATCC AGTGAAAATC 960 

AAAGTTAGAT TTTCCTTTTT TCTTCAGATT TATCTTATAA TGATATTTTT AGGTTTATAC 102 0 

30 ATAAATTTTC GTCACCTTTA TAAACAGCGC AGACTGAAAA TGAGGGCAGG CGCAGTGGCT 1080 
CATGCCTGTG ATCCCAGCGC TTTGGGAGGC TGA 

Seq ID NO: 375 Protein sequence 
35 Protein Accession #: NP_057479 

1 11 21 31 41 51 

I I i ! I I 

MENQVLTPHV YWAQRHRELY LRVELSDVQN PAISITENVL HFKAQGHGAK GDNVYEFHLE 60 

40 FLDLVKPEPV YKLTQRQVNI TVQKKVSQWW ERLTKQEKRP LFLAPDFDRW LDESDAEMEL 120 

RAKEEERLNK LRLESEGSPE TLTNLRKGYL FMYNLVQFLG FSWIFVNLTV RFCILGKESF 180 

YDTFHTVADM MYFCQMLAW ETINAAIGVT TSPVLPSLIQ LLGRNFILFI IFGTMEEMQN 240 

KAWFFVFYL WSAIEIFRYS FYMLTCIDMD WKVLTWLRYT LWIPLYPLGC LVEAVSVIQS 300 

IPIFNETGRF SFTLPYPVKI KVRFSFFLQI YLIMIFLGLY INFRHLYKQR RRRYGKKRKR 360 
45 STKKKDLDGF LPV 

Seq ID NO: 376 DNA sequence 
Nucleic Acid Accession #: NM_005987 
Coding sequence: 1-270 

1 11 21 31 41 51 

I I I I I 1 

ATGAATTCTC AGCAGCAGAA GCAGCCTTGC ACCCCACCCC CTCAGCCTCA GCAGCAGCAG 60 

GTGAAACAAC CTTGCCAGCC TCCACCCCAG GAACCATGCA TCCCCAAAAC CAAGGAGCCC 120 

55 TGCCAACCCA AGGTGCCTGA GCCCTGCCAC CCCAAAGTGC CTGAGCCCTG CCAGCCCAAG 180 

ATTCCAGAGC CCTGCCAGCC CAAGGTGCCT GAGCCCTGCC CTTCAACGGT CACTCCAGCA 240 
CCAGCCCAGC AGAAGACCAA GCAGAAGTAA 

Seq ID NO: 3 77 Protein sequence 
60 Protein Accession #: NP_005978 

1 11 21 31 41 51 

I I I I I I 

MNSQQQKQPC TPPPQPQQQQ VKQPCQPPPQ EPCIPKTKEP CQPKVPEPCH PKVPEPCQPK 60 
65 IPEPCQPKVP EPCPSTVTPA PAQQKTKQK 

Seq ID NO: 378 DNA sequence 
Nucleic Acid Accession #: NM_002105 
70 Coding sequence: 74-505 

1 11 21 31 41 51 

I I I I I I 

ACAGCAGTTA CACTGCGGCG GGCGTCTGTT CTAGTGTTTG AGCCGTCGTG CTTCACCGGT 60 

75 CTACCTCGCT AGCATGTCGG GCCGCGGCAA GACTGGCGGC AAGGCCCGCG CCAAGGCCAA 120 

GTCGCGCTCG TCGCGCGCCG GCCTCCAGTT CCCAGTGGGC CGTGTACACC GGCTGCTGCG 180 

GAAGGGCCAC TACGCCGAGC GCGTTGGCGC CGGCGCGCCA GTGTACCTGG CGGCAGTGCT 240 

GGAGTACCTC ACCGCTGAGA TCCTGGAGCT GGCGGGCAAT GCGGCCCGCG ACAACAAGAA 30 0 

GACGCGAATC ATCCCCCGCC ACCTGCAGCT GGCCATCCGC AACGACGAGG AGCTCAACAA 360 

80 GCTGCTGGGC GGCGTGACGA TCGCCCAGGG AGGCGTCCTG CCCAACATCC AGGCCGTGCT 420 

GCTGCCCAAG AAGACCAGCG CCACCGTGGG GCCGAAGGCG CCCTCGGGCG GCAAGAAGGC 480 

CACCCAGGCC TCCCAGGAGT ACTAAGAGGG CCCGCGCCGC GGCCGGCCGC CCCAGCTCCC 540 

CATGCCACCA CAAAGGCCCT TTTAAGGGCC ACCACCGCCC TCATGGAAAG AGCTGAGCCG 600 

CTTCAGACTG CGGGGCAAGC GGGCCGCGGC TCCCTTCCCC TCCCCTCCCC TCGCCCGCCT 660 

85 TCGCCGCCCG GCCTCGAGTC CCCGCCCGCC CCCGCTCCCG TCCCGCACCG CCTGCCGCGT 720 

CGGCCTCGGG CCTGCCCTGT CCGCCGTCCG CCCTCCGGTA GGGTTCGGGC CTTCCGGATG 780 

CGGCTTGGGC GCTCTTCGGG GACCTCCGTG GCGCGGAAGA CCCGAGCCTG CCGGGGGGAG 840 
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GCCGGCGGCG CCGCACCTGC CCGCCTCGGC GTTCGTGACT CAGCCGCCCC ATCCCGAGTC 900 

GCTAAGGGGC TGCGGGGAGG CCGCAGCACC TTCTGGAAGA CTTGGCCTTC CGCTCTGACG 960 

CAGGGCCGAG GTGGGCAGTC CAGGCCGAGA GCCGGCGGCC CTGAAGGTGA GTGAGGCCCT 102 0 

CGGCAGCTGC AGCCGGGGTG TCTGGTACCC CCCCGGCGTG GTGCTTAGCC CAGGACTTTC 1080 

AGACGGCCGC TGGCCGGGAG GCTTTGGTGG GAGAGACGCG ATCGCCGATT TCGGTCTGGC 1140 

GCCCCTTCTG CGGCCGGGAC CCAGGCCTTT CACATCAGCT CTCCCTCCAT CTTCATTCAT 12 00 

AGGTCTGCGC TGGGGCCGGG ACGAAGCACT TGGTAACAGG CACATCTTCC TCCCGAGTGA 12 60 

CTGCCTCCTA GGAGGACATT TAGGGGAGGG CAGAGGCCTG CAGTTTGGCT TCACGGCTGG 1320 

CTATGTGGAC AGCAAGAGTC GTTTTGCGGA ACGCGACTGG CAGCCAGGCC TGTCGGGCCC 13 80 

CCGACGCCGC CCCATTTCCC TTCCAGCAAA CT CAACTCGG CAAT CCAAGC ACCTAGATAC 1440 

CAGCACAAGT CGGTTAATCC CTGTCTGGAC TGAGCCTCCG TTGGCTTCTG AACTGGAATT 150 0 

CTGCAGCTAA CCCTTCCACG ACTAGAACCT TAGGCATTGG GGAGTTTTAG ATGGACTAAT 1560 
TTTATTAAAG GATTGTTTTT TTTTT 

Seq ID NO: 37 9 Protein sequence 
Protein Accession #: NP_002096 

1 ll 21 31 41 51 

I I I I 1 I 

MSGRGKTGGK ARAKAKSRSS RAGLQPPVGR VHRLLRKGHY AERVGAGAPV YLAAVLEYLT 60 

AEILELAGNA ARDNKKTRI I PRHLQLAIRN DEELNKLLGG VTIAQGGVLP NIQAVLLPKK 120 
TSATVGPKAP SGGKKATQAS QEY 



Seq ID NO: 3 80 DNA sequence 
Nucleic Acid Accession #: AL136942 
Coding sequence: 184-864 

1 11 21 31 41 51 

I I I I I 1 

ACGCGTCCGG CAGAAGCTCG GAGCTCTCGG GGT AT CGAGG AGGCAGGCCC GCGGGCGCAC 60 

GGGCGAGCGG GCCGGGAGCC GGAGCGGCGG AGGAGCCGGC AGCAGCGGCG CGGCGGGCTC 120 

CAGGCGAGGC GGTCGACGCT CCTGAAAACT TGCGCGCGCG CTCGCGCCAC TGCGCCCGGA 180 

GCGATGAAGA TGGTCGCGCC CTGGACGCGG TTCTACTCCA ACAGCTGCTG CTTGTGCTGC 240 

CATGTCCGCA CCGGCACCAT CCTGCTCGGC GTCTGGTATC TGATCATCAA TGCTGTGGTA 300 

CTGTTGATTT TATTGAGTGC CCTGGCTGAT CCGGATCAGT ATAACTTTTC AAGTTCTGAA 3 60 

CTGGGAGGTG ACTTTGAGTT CATGGATGAT GCCAACATGT GCATTGCCAT TGCGATTTCT 42 0 

CTTCTCATGA TCCTGATATG TGCTATGGCT ACTTACGGAG CGTACAAGCA ACGCGCAGCC 480 

TGGATCATCC CATTCTTCTG TTACCAGATC TTTGACTTTG CCCTGAACAT GTTGGTTGCA 540 

ATCACTGTGG TTATTTATCC AAACTCCATT CAGGAATACA TACGGCAACT GCCTCCTAAT 60 0 

TTTCCCTACA GAGATGATGT CATGT CAGTG AATCCTACCT GTTTGGTCCT TATTATTCTT 660 

CTGTTTATTA GCATTATCTT GACTTT TAAG GGTTACTTGA TTAGCTGTGT TTGGAACTGC 720 

TACCGATACA TCAATGGTAG GAACTCCTCT GATGTCCTGG TTTATGTTAC CAGCAATGAC 780 

ACTACGGTGC TGCTACCCCC GTATGATGAT GCCACTGTGA ATGGTGCTGC CAAGGAGCCA 840 

CCGCCACCTT ACGTGTCTGC CTAAGCCTTC AAGTGGGCGG AGCTGAGGGC AGCAGCTTGA 900 

CTTTGCAGAC ATCTGAGCAA TAGTTCTGTT ATTTCACTTT TGCCATGAGC CTCTCTGAGC 960 

TTGTTTGTTG CTGAAATGCT ACTTTTTAAA ATTTAGATGT TAGATTGAAA ACTGTAGTTT 102 0 

TCAACATATG CTTTGCTAGA ACACTGTGAT AGATTAACTG TAGAATTCTT CCTGTACGAT 1080 

TGGGGATATA ACGGGCTTCA CTAACCTTCC CTAGGCATTG AAACTTCCCC CAAATCTGAT 1140 

GGACCTAGAA GTCTGCTTTT GTACCTGCTG GGCCCCAAAG TTGGGCATTT TTCTCTCTGT 1200 

TCCCTCTCTT TTGAAAATGT AAAATAAAAC CAAAAATAGA CAACTTTTTC TTCAGCCATT 12 60 

CCAGCATAGA GAACAAAACC TTATGGAAAC AGGAATGTCA ATTGTGTAAT CATTGTTCTA 132 0 

ATTAGGTAAA TAGAAGTCCT TATGTATGTG TTACAAGAAT TTCCCCCACA ACATCCTTTA 1380 

TGACTGAAGT T C AATG AC AG TTTGTGTTTG GTGGTAAAGG ATTTTCTCCA TGGCCTGAAT 1440 

TAAGACCATT AGAAAGCACC AGGCCGTGGG AGCAGTGACC ATCTACTGAC TGTTCTTGTG 1500 

GATCTTGTGT CCAGGGACAT GGGGTGACAT GCCTCGTATG TGTTAGAGGG TGGAATGGAT 1560 

GTGTTTGGCG CTGCATGGGA TCTGGTGCCC CTCTTCTCCT GGATTCACAT CCCCACCCAG 1620 

GGCCCGCTTT TACTAAGTGT TCTGCCCTAG ATTGGTTCAA GGAGGTCATC CAACTGACTT 1680 

TATCAAGTGG AATTGGGATA TATTTGATAT ACTTCTGCCT AACAACATGG AAAAGGGTTT 1740 

TCTTTTCCCT GCAAGCTACA TCCTACTGCT TTGAACTTCC AAGTATGTCT AGTCACCTTT 1800 

TAAAATGTAA ACATT TTC AG AAAAATGAGG ATTGCCTTCC TTGTATGCGC TTTTTACCTT 1860 

GACTACCTGA ATTGCAAGGG ATTTTTATAT ATT CATATGT TACAAAGTCA GCAACTCTCC 1920 

TGTTGGTTCA TTATTGAATG TGCTGTAAAT TAAGTCGTTT GCAATTAAAA CAAGGTTTGC 198 0 
CCACATCCAA AAAAAAAAAA AAAAA 

Seq ID NO: 381 Protein sequence 
Protein Accession #: CAB66876 

1 11 21 31 41 51 

I I I ! I 1 

MKMVAPWTRF YSNSCCLCCH VRTGTILLGV WYLIINAWL L ILL SALAD P DQYNFSSSEL 60 

GGDFEFMDDA NMCIAIAISL LMILICAMAT YGAYKQRAAW IIPFFCYQIF D FALNMLV A I 120 

TVLIYPNSIQ EYIRQLPPNF PYRDDVMSVN PTCLVLIILL FISIILTFKG YLISCVWNCY 180 
RYINGRNSSD VLVYVT SNDT TVLLPPYDDA TVNGAAKEPP PPYVSA 

Seq ID NO: 382 DNA sequence 
Nucleic Acid Accession #: NM_002510 
Coding sequence: 92-1774 

1 11 21 31 41 51 

111)11 

CAGATGCCAG AAGAACACTG TTGCTCTTGG TGGACGGGCC CAGAGGAATT CAGAGTTAAA 60 

CCTTGAGTGC CTGCGTCCGT GAGAATTCAG CATGGAATGT CTCTACTATT TCCTGGGATT 120 

TCTGCTCCTG GCTGCAAGAT TGCCACTTGA TGCCGCCAAA CGATTTCATG ATGTGCTGGG 180 

CAATGAAAGA CCTTCTGCTT ACATGAGGGA GCACAATCAA TTAAATGGCT GGTCTTCTGA 240 

TGAAAATGAC TGGAATGAAA AACTCTACCC AGTGTGGAAG CGGGGAGACA TGAGGTGGAA 300 

AAACTCCTGG AAGGGAGGCC GTGTGCAGGC GGTCCTGACC AGTGACTCAC CAGCCCTCGT 360 
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GGGCT CAAAT ATAACATTTG CGGTGAACCT GATATTCCCT AGATGCCAAA AGGAAGATGC 42 0 

CAATGGCAAC ATAGTCTATG AGAAGAACTG CAGAAATGAG GCTGGTTTAT CTGCTGATCC 480 

ATATGTTTAC AACTGGACAG CATGGTCAGA GGACAGTGAC GGGGAAAATG GCACCGGCCA 540 

AAG CCAT CAT AACGTCTTCC CTGATGGGAA ACCTTTTCCT CACCACCCCG GATGGAGAAG 600 

ATGGAATTTC ATCTACGTCT TCCACACACT TGGTCAGTAT TTCCAGAAAT TGGGACGATG 660 

TTCAGTGAGA GTTTCTGTGA ACACAGCCAA TGTGACACTT GGGCCTCAAC TCATGGAAGT 72 0 

GACTGTCTAC AGAAGACATG GACGGGCATA TGTTCCCATC GCACAAGTGA AAGATGTGTA 78 0 

CGTGGTAACA GATCAGATTC CTGTGTTTGT GACTATGTTC CAGAAGAACG ATCGAAATTC 840 

ATCCGACGAA ACCTTCCTCA AAGAT CTCCC CATTATGTTT GATGTCCTGA TTCATGATCC 90 0 

TAGCCACTTC CTCAATTATT CTACCATTAA CTACAAGTGG AGCTTCGGGG ATAATACTGG 960 

CCTGTTTGTT TCCACCAATC ATACTGTGAA T CACACGT AT GTGCTCAATG GAACCTTCAG 102 0 

CCTTAACCTC ACTGTGAAAG CTGCAGCACC AGGACCTTGT CCGCCACCGC CACCACCACC 10 80 

CAGACCTTCA AAACCCACCC CTTCTTTAGG ACCTGCTGGT GACAACCCCC TGGAGCTGAG 1140 

TAGGATTCCT GATGAAAACT GCCAGATTAA CAGATATGGC CACTTTCAAG CCACCATCAC 12 00 

AATTGTAGAG GGAATCTTAG AGGTTAACAT CATCCAGATG ACAGACGTCC TGATGCCGGT 12 60 

GCCATGGCCT GAAAGCTCCC TAATAGACTT TGTCGTGACC TGCCAAGGGA GCATTCCCAC 132 0 

GGAGGTCTGT ACCATCATTT CTGACCCCAC CTGCGAGATC ACCCAGAACA CAGTCTGCAG 13 8 0 

CCCTGTGGAT GTGGATGAGA TGTGT CTGCT GACTGTGAGA CGAACCTTCA ATGGGTCTGG 1440 

GACGTACTGT GTGAACCTCA CCCTGGGGGA TGACACAAGC CTGGCTCTCA CGAGCACCCT 150 0 

GATTTCTGTT CCTGACAGAG ACCCAGCCTC GCCTTTAAGG ATGGCAAACA GTGCCCTGAT 1560 

CTCCGTTGGC TGCTTGGCCA TATTTGTCAC TGTGATCTCC CTCTTGGTGT ACAAAAAACA 162 0 

CAAGGAATAC AACCCAATAG AAAATAGTCC TGGGAATGTG GTCAGAAGCA AAGGCCTGAG 168 0 

TGTCTTTCTC AACCGTGCAA AAGCCGTGTT CTTCCCGGGA AACCAGGAAA AGGATCCGCT 1740 

ACTCAAAAAC CAAGAATTTA AAGGAGTTTC TTAAATTTCG ACCTTGTTTC TGAAGCTCAC 18 0 0 

TTTTCAGTGC CATTGATGTG AGATGTGCTG GAGTGGCTAT TAACCTTTTT TTCCTAAAGA 1860 

TTATTGTTAA ATAGATATTG TGGTTTGGGG AAGTTGAATT TTTTATAGGT TAAATGT CAT 192 0 

TTTAGAGATG GGGAGAGGGA TTATACTGCA GGCAGCTTCA GCCATGTTGT GAAACTGATA 198 0 

AAAGCAACTT AGCAAGGCTT CTTTTCATTA TTTTTTATGT TTCACTTATA AAGTCTTAGG 2 040 

TAACTAGTAG GATAGAAACA CTGTGTCCCG AGAGTAAGGA GAGAAGCTAC TATTGATTAG 210 0 

AGCCTAACCC AGGTTAACTG CAAGAAGAGG CGGGATACTT TCAGCTTTCC ATGTAACTGT 2160 

ATGCATAAAG CCAATGTAGT CCAGTTTCTA AGATCATGTT CCAAGCTAAC TGAATCCCAC 222 0 

TTCAATACAC ACTCATGAAC TCCTGATGGA ACAATAACAG GCCCAAGCCT GTGGTATGAT 22 80 

GTGCACACTT GCTAGACTCA GAAAAAATAC TACTCTCATA AATGGGTGGG AGTATTTTGG 2 34 0 

TGACAACCTA CTTTGCTTGG CTGAGTGAAG GAATGATATT CATATATTCA TTTATTCCAT 2 40 0 

GGACATTTAG TTAGTGCTTT TTATATACCA GGCATGATGC TGAGTGACAC TCTTGTGTAT 24 60 

ATTTCCAAAT TTTTGTATAG TCGCTGCACA TATTTGAAAT CATATATTAA GACTTTCCAA 2520 

AGATGAGGT C CCTGGTTTTT CATGGCAACT TGATCAGTAA GGATTTCACC TCTGTTTGTA 2580 

ACTAAAACCA TCTACTATAT GTTAGACATG ACATTCTTTT TCTCTCCTTC CTGAAAAATA 2 640 
AAGTGTGGGA AGAGACAAAA AAAAAAAAA 

Seq ID NO: 3 83 Protein sequence 
Protein Accession #: NP_002501 

1 ll 21 31 41 51 

I I I I 1 I 

MECLYYFLGF LLLAARLPLD AAKRFHDVLG NERPSAYMRE HNQLNGWSSD ENDWNEKLYP 60 

VWKRGDMRWK NSWKGGRVQA VLTSDSPALV GSNITFAVNL IFPRCQKEDA NGNIVYEKNC 120 

RNEAGLSADP YVYNWTAWSE DSDGENGTGQ SHHNVFPDGK PFPHHPGWRR WNFIYVFHTL 180 

GQYFQKLGRC SVRVSVNTAN VTLGPQLMEV TVYRRHGRAY VPIAQVKDVY WTDQIPVFV 240 

TMFQKNDRNS SDETFLKDLP IMFDVLIHDP SHFLNYSTIN YKWSFGDNTG LFVSTNHTVN 3 00 

HTYVLNGTFS LNLTVKAAAP GPCPPPPPPP RPSKPTPSLG PAGDNPLELS RIPDENCQIN 3 60 

RYGHFQATIT IVEGILEVNI IQMTDVLMPV PWPESSLIDF WTCQGSIPT EVCTIISDPT 420 

CEITQNTVCS PVDVDEMCLL TVRRTFNGSG TYCVNLTLGD DTSLALTSTL ISVPDRDPAS 480 

PLRMANSALI SVGCLAIFVT VISLIiVYKKH KEYNPIENSP GNWRSKGLS VFLNRAKAVF 540 
FPGNQEKDPL LKNQEFKGVS 

Seq ID NO: 384 DNA sequence 
Nucleic Acid Accession #: NM_001134 
Coding sequence: 48-1877 

1 11 21 31 41 51 

I I I 1 I I 

TCCATATTGT GCTTCCACCA CTGCCAATAA CAAAATAACT AG CAACCATG AAGTGGGTGG 60 

AATCAATTTT TTTAATTTTC CTACTAAATT TTACTGAATC CAGAACACTG CATAGAAATG 120 

AATATGGAAT AGCTTCCATA TTGGATTCTT ACCAATGTAC TGCAGAGATA AGTTTAGCTG 180 

ACCTGGCTAC CATATTTTTT GCCCAGTTTG TTCAAGAAGC CACTTACAAG GAAGTAAGCA 240 

AAATGGTGAA AGATGCATTG ACTGCAATTG AGAAACCCAC TGGAGATGAA CAGTCTTCAG 3 00 

GGTGTTTAGA AAACCAG CTA CCTGCCTTTC TGGAAGAACT TTGCCATGAG AAAGAAATTT 3 60 

TGGAGAAGTA CGGACATTCA GACTGCTGCA GCCAAAGTGA AGAGGGAAGA CATAACTGTT 420 

TTCTTGCACA CAAAAAGCCC ACTCCAGCAT CGATCCCACT TTTCCAAGTT CCAGAACCTG 480 

TCACAAGCTG TGAAGCATAT GAAGAAGACA GGGAGACATT CATGAACAAA TTCATTTATG 540 

AGATAGCAAG AAGGCATCCC TTCCTGTATG CACCTACAAT TCTTCTTTGG GCTGCTCGCT 600 

ATGACAAAAT AATTCCATCT TGCTGCAAAG CTGAAAATGC AGTTGAATGC TTCCAAACAA 660 

AGGCAGCAAC AGTTACAAAA GAATTAAGAG AAAGCAGCTT GTTAAATCAA CATGCATGTG 720 

CAGTAATGAA AAATTTTGGG ACCCGAACTT TCCAAGCCAT AACTGTTACT AAACTGAGTC 780 

AGAAGTTTAC CAAAGTTAAT TTTACTGAAA TCCAGAAACT AGTCCTGGAT GTGGCCCATG 840 

TACATGAGCA CTGTTGCAGA GGAGATGTGC TGGATTGTCT GCAGGATGGG GAAAAAATCA 90 0 

TGTCCTACAT ATGTTCTCAA CAAGACACTC TGTCAAACAA AATAACAGAA TGCTGCAAAC 960 

TGACCACGCT GGAACGTGGT CAATGTATAA TTCATGCAGA AAATGATGAA AAACCTGAAG 1020 

GTCTATCTCC AAATCTAAAC AGGTTTTTAG GAGATAGAGA TTTTAACCAA TTTTCTTCAG 1080 

GGGAAAAAAA TATCTTCTTG GCAAGTTTTG TTCATGAATA TTCAAGAAGA CATCCTCAGC 1140 

TTGCTGTCTC AGTAATTCTA AGAGTTGCTA AAGGATACCA GGAGTTATTG GAGAAGTGTT 12 00 

TCCAGACTGA AAACCCTCTT GAATGCCAAG ATAAAGGAGA AGAAGAATTA CAGAAATACA 12 60 

TCCAGGAGAG CCAAGCATTG GCAAAGCGAA GCTGCGGCCT CTTCCAGAAA CTAGGAGAAT 1320 

ATTACTTACA AAATGCGTTT CTCGTTGCTT ACACAAAGAA AGCCCCCCAG CTGACCTCGT 1380 

CGGAGCTGAT GGCCATCACC AGAAAAATGG CAGCCACAGC AGCCACTTGT TGCCAACTCA 1440 

GTGAGGACAA ACTATTGGCC TGTGGCGAGG GAGCGGCTGA CATTATTATC GGACACTTAT 1500 
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GTATCAGACA TGAAATGACT CCAGTAAACC CTGGTGTTGG CCAGTGCTGC ACTTCTTCAT 1560 

ATGCCAACAG GAGGCCATGC TTCAGCAGCT TGGTGGTGGA TGAAACATAT GTCCCTCCTG 1620 

CATTCTCTGA TGACAAGTTC ATTTTCCATA AGGATCTGTG CCAAGCTCAG GGTGTAGCGC 1680 

TGCAAACGAT GAAGCAAGAG TTTCTCATTA ACCTTGTGAA GCAAAAGCCA CAAATAACAG 1740 

AGGAACAACT TGAGGCTGTC ATTGCAGATT TCTCAGGCCT GTTGGAGAAA TGCTGCCAAG 1800 

GCCAGGAACA GGAAGTCTGC TTTGCTGAAG AGGGACAAAA ACTGATTTCA AAAACT CGTG 1860 

CTGCTTTGGG AGTTTAAATT ACTTCAGGGG AAGAGAAGAC AAAACGAGTC TTTCATT CGG 192 0 

TGTGAACTTT TCTCTTTAAT TTTAACTGAT TTAACACTTT TTGTGAATTA ATGAAATGAT 1980 
AAAG ACT T T T ATGTGAGATT TCCTTATCAC AGAAATAAAA TAT CTCCAAA TG 

Seq ID NO: 385 Protein sequence 
Protein Accession #: NP_001125 

1 11 21 31 41 51 

I I I I I I 

MKWVESIFLI FLLNFTESRT LHRNEYGIAS ILDSYQCTAE ISLADLATIF FAQFVQEATY 60 

KEVS KMVKD A LTAIEKPTGD' EQSSGCLENQ LPAFLEELCH EKEILEKYGH SDCCSQSEEG 120 

RHNCFLAHKK PTPASIPLFQ VPEPVTSCEA YEEDRETFMN KFIYEIARRH PFLYAPTILIi 180 

WAARYDKIIP SCCKAENAVE CFQTKAATVT KELRESSLLN QHACAVMKNF GTRTFQAITV 240 

TKLSQKFTKV NFTEIQKLVL DVAHVHEHCC RGDVLDCLQD GEKIMSYICS QQDTLSNKIT 3 00 

ECCKLTTLER GQCIIHAEND EKPEGLSPNL NRFLGDRDFN QFSSGEKNIF LASFVHEYSR 3 60 

RHPQLAVSVI LRVAKGYQEL LEKCFQTENP LECQDKGEEE LQKYIQESQA LAKRSCGLFQ 420 

KLGEYYLQNA FLVAYTKKAP QLTSSELMAI TRKMAATAAT CCQLSEDKLiL ACGEGAAD 1 1 480 

IGHLCIRHEM TPVNPGVGQC CTSSYANRRP CFSSLWDET YVPPAFSDDK FIFHKDLCQA 540 

QGVALQTMKQ EFLINLVKQK PQITEEQLEA VIADFSGLLE KCCQGQEQEV CFAEEGQKLI 600 
SKTRAALGV 

Seq ID NO: 386 DNA sequence 

Nucleic Acid Accession #: NM_002205.1 

Coding sequence: 1..3149 

1 11 21 31 41 51 

I I . I I I I 

ATGGGGAGCC GGACGCCAGA GTCCCCTCTC CACGCCGTGC AGCTGCGCTG GGGCCCCCGG 60 

CGCCGACCCC CGCTSSTGCC GCTGCTGTTG CTGCTSSTGC CGCCGCCACC CAGGGTCGGG 120 

GGCTTCAACT TAGACGCGGA GGCCCCAGCA GTACTCTCGG GGCCCCCGGG CTCCTTCTTC 180 

GGATTCTCAG TGGAGTTTTA CCGGCCGGGA ACAGACGGGG TCAGTGTGCT GGTGGGAGCA 240 

CCCAAGGCTA ATACC AG CCA GCCAGGAGTG CTGCAGGGTG GTGCTGTCTA CCTCTGTCCT 300 

TGGGGTGCCA GCCCCACACA GTGCACCCCC ATTGAATTTG ACAGCAAAGG CTCTCGGCTC 3 60 

CTGGAGTCCT CACTGTCCAG CTCAGAGGGA GAGGAGCCTG TGGAGTACAA GTCCTTGCAG 420 

TGGTTCGGGG CAACAGTTCG AGCCCATGGC TCCTCCATCT TGGCATGCGC TCCACTGTAC 480 

AGCTGGCGCA CAGAGAAGGA GCCACTGAGC GACCCCGTGG GCACCTGCTA CCTCTCCACA 540 

GATAACTTCA CCCGAATTCT GGAGTATGCA CCCTGCCGCT CAGATTTCAG CTGGGCAGCA 600 

GGACAGGGTT ACTGCCAAGG AGGCTTCAGT GCCGAGTTCA CCAAGACTGG CCGTGTGGTT 660 

TTAGGTGGAC CAGGAAGCTA TTTCTGGCAA GGCCAGATCC TGTCTGCCAC TCAGGAGCAG 720 

ATTGCAGAAT CTTATTACCC CGAGTACCTG ATCAACCTGG TTCAGGGGCA GCTGCAGACT 780 

CGCCAGGCCA GTTCCATCTA TGATGACAGC TACCTAGGAT ACTCTGTGGC TGTTGGTGAA 840 

TTCAGTGGTG ATGACACAGA AGACTTTGTT GCTGGTGTGC CCAAAGGGAA CCTCACTTAC 900 

GGCTATGTCA CCATCCTTAA TGGCTCAGAC ATTCGATCCC TCTACAACTT CTCAGGGGAA 960 

CAGATGGCCT CCTACTTTGG CTATGCAGTG GCCGCCACAG ACGTCAATGG GGACGGGCTG 1020 

GATGACTTGC TGGTGGGGGC ACCCCTGCTC ATGGATCGGA CCCCTGACGG GCGGCCTCAG 1080 

GAGGTGGGCA GGGTCTACGT CTACCTGCAG CACCCAGCCG GCATAGAGCC CACGCCCACC 1140 

CTTACCCTCA CTGGCCATGA TGAGTTTGGC CGATTTGGCA GCTCCTTGAC CCCCCTGGGG 1200 

GACCTGGACC AGGATGGCTA CAATGATGTG GCCAT CGGGG CTCCCTTTGG TGGGGAGACC 12 60 

CAGCAGGGAG TAGTGTTTGT ATTTCCTGGG GGCCCAGGAG GGCTGGGCTC TAAGCCTTCC 132 0 

CAGGTTCTGC AGCCCCTGTG GGCAGCCAGC CACACCCCAG ACTTCTTTGG CTCTGCCCTT 13 80 

CGAGGAGGCC GAGACCTGGA TGGCAATGGA TATCCTGATC TGATTGTGGG GTCCTTTGGT 1440 

GTGGACAAGG CTGTGGTATA CAGGGGCCGC CCCATCGTGT CCGCTAGTGC CTCCCTCACC 1500 

ATCTTCCCCG CCATGTTCAA CCCAGAGGAG CGGAGCTGCA GCTTAGAGGG GAACCCTGTG 1560 

GCCTGCATCA ACCTTAGCTT CTGCCTCAAT GCTTCTGGAA AACACGTTGC TGACTCCATT 162 0 

GGTTTCACAG TGGAACTTCA GCTGGACTGG CAGAAGCAGA AGGGAGGGGT ACGGCGGGCA 1680 

CTGTTCCTGG CCTCCAGGCA GGCAACCCTG ACCCAGACCC TGCTCATCCA GAATGGGGCT 1740 

CGAGAGGATT GCAGAGAGAT GAAGATCTAC CTCAGGAACG AGTCAGAATT TCGAGACAAA 18 00 

CTCTCGCCGA TTCACATCGC TCTCAACTTC TCCTTGGACC CCCAAGCCCC AGTGGACAGC 1860 

CACGGCCTCA GGCCAGCCCT ACATTATCAG AGCAAGAGCC GGATAGAGGA CAAGGCTCAG 192 0 

ATCTTGCTGG ACTGTGGAGA AGACAACATC TGTGTGCCTG ACCTGCAGCT GGAAGTGTTT 1980 

GGGGAGCAGA ACCATGTGTA CCTGGGTGAC AAGAATGCCC TGAACCTCAC TTTCCATGCC 2040 

CAGAATGTGG GTGAGGGTGG CGCCTATGAG GCTGAGCTTC GGGTCACCGC CCCTCCAGAG 2100 

GCTGAGTACT CAGGACTCGT CAGACACCCA GGGAACTTCT CCAGCCTGAG CTGTGACTAC 2160 

TTTGCCGTGA ACCAGAGCCG CCTGCTGGTG TGTGACCTGG GCAACCCCAT GAAGGCAGGA 2220 

GCCAGTCTGT GGGGTGGCCT TCGGTTTACA GTCCCTCATC TCCGGGACAC TAAGAAAACC 2280 

ATCCAGTTTG ACTTCCAGAT CCTCAGCAAG AATCTCAACA ACTCGCAA^G CGACGTGGTT 2 340 

TCCTTTCGGC TCTCCGTGGA GGCTCAGGCC CAGGTCACCC TGAACGGTGT CTCCAAGCCT 2400 

GAGGCAGTGC T ATT CCCAGT AAGCGACTGG CATCCCCGAG ACCAGCCTCA GAAGGAGGAG 2460 

GACCTGGGAC CTGCTGTCCA CCATGTCTAT GAG CTCATCA ACCAAGGCCC CAGCTCCATT 2520 

AGCCAGGGTG TGCTGGAACT CAGCTGTCCC CAGGCTCTGG AAGGT CAGCA GCTCCTATAT 2580 

GTGACCAGAG TTACGGGACT CAACTGCACC ACCAATCACC CCATTAACCC AAAGGGCCTG 2 640 

GAGTTGGATC CCGAGGGTTC CCTGCACCAC CAGCAAAAAC GGGAAGCTCC AAGCCGCAGC 2700 

TCTGCTTCCT CGGGACCTCA GAT CCTGAAA TGCCCGGAGG CTGAGTGTTT CAGGCTGCGC 2760 

TGTGAGCTCG GGCCCCTGCA CCAACAAGAG AGCCAAAGTC TGCAGTTGCA TTTCCGAGTC 2820 

TGGG CCAAGA CTTTCTTGCA GCGGGAGCAC CAGCCATTTA GCCTGCAGTG TGAGGCTGTG 2880 

TACAAAGCCC TGAAGATGCC CTACCGAATC CTGCCTCGGC AGCTGCCCCA AAAAGAGCGT 2 940 

CAGGTGGCCA CAGCTGTGCA ATGGACCAAG GCAGAAGGCA GCTATGGCGT CCCACTGTGG 3000 

ATCATCATCC TAGCCAT CCT GTTTGGCCTC CTGCTCCTAG GTCTACTCAT CTACATCCTC 3060 

TACAAGCTTG GATTCTTCAA ACGCTCCCTC CCATATGGCA CCGCCATGGA AAAAGCTCAG 312 0 
CTCAAGCCTC CAGCCACCTC TGATGCCTGA 
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Seq ID NOs 387 Protein sequence 
Protein Accession #: NP_002 196.1 

5 1 11 21 31 41 51 

I i I I 1 I 

MGSRTPESPL HAVQLRWGPR RRPPLLPLLL LLLPPPPRVG GFNLDAEAPA VLSGPPGSFF 60 

GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 12 0 

LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYLST 180 

10 DNFTRILEYA PCRSDFSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQIIiSATQEQ 240 

IAESYYPEYL 'INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 4 80 

15 VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVAD S I 540 

GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

LSPIHIALNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG AS LWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

20 SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 102 0 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 

Seq' ID NO: 388 DNA sequence 

Nucleic Acid Accession #: NM_002425 

Coding sequence: 26.. 1453 

30 1 11 21 31 41 51 

I I I I I I 

AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCT CGAAAAG GATGTGAAAC AGTT TAGAAG 18 0 

35 AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 

GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 

TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 42 0 

TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 

40 AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 

TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 

CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 72 0 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

45 TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 

GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAG AT AT T T 960 

TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 

50 TTTTAAAGGA AATGAGT TCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 114 0 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 12 60 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 132 0 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

55 ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 

GTTACATTGC TAGG CGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 

GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 

60 ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 

Seq ID NO: 389 Protein sequence 
Protein Accession #: NP_002416 

65 

1 11 21 31 41 51 

I I I 1 I I 

MHLAFLVLLC LPVCSAYPLS GAAKEED SNK DLAQQYLEKY YNLEKDVKQF RRKDSNLIVK 60 

KIQGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPKWRKT HLTYRIVNYT 120 

70 PDLPRDAVDS AIEKALKVWE EVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 180 

HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 

FTELAQFRLS QDDVNGIQSIi YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 

RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 

IRGNEVQAGY PRGIHTLGFP PTIRKIDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 

75 PRLIADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTHILKSN SWLHC 

Seq ID NO: 390 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 

1 11 21 31 41 51 

I l I l I l 

ATGCACAGCT TTCCTCCACT GCTGCTGCTG CTGTTCTGGG GTGTGGTGTC ACACAGCTTC 60 

CCAGCGACTC TAGAAACACA AGAGCAAGAT GTGGACTTAG TCCAGAAATA CCTGGAAAAA 120 

85 TACTACAACC TGAAGAATGA TGGGAGGCAA GTTGAAAAGC GGAGAAATAG TGGCCCAGTG 180 

GTTGAAAAAT TGAAGCAAAT GCAGGAATTC TTTGGGCTGA AAGTGACTGG GAAACCAGAT 240 

GCTGAAACCC TGAAGGTGAT GAAGCAGCCC AGATGTGGAG TGCCTGATGT GGCTCAGTTT 300 
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GTCCTCACTG AGGGGAACCC TCGCTGGGAG CAAACAC AT C 
TACACGCCAG ATTTGCCAAG AGCAGATGTG GACCATGCCA 
TGGAGTAATG TCACACCTCT GACATTCACC AAGGTCTCTG 
ATATCTTTTG TCAGGGGAGA T CATCGGG AC AACTCTCCTT 
CTTGCTCATG CTTTTCAACC AGGCCCAGGT ATTGGAGGGG 
GAAAGGTGGA CCAACAATTT CAGAGAGTAC AACTTACATC 
GGCCATTCTC TTGGACTCTC CCATT CTACT GATATCGGGG 
ACCTTCAGTG GTGATGTTCA GCTAGCTCAG GATGACATTG 
GGACGTTCCC AAAATCCTGT CCAGCCCATC GGCCCACAAA 
AAGCTAACCT TTGATGCTAT AACTACGATT CGGGGAGAAG 
TTCTACATGC GCACAAATCC CTTCTACCCG GAAGTTGAGC 
TGGCCACAAC TGCCAAATGG GCTTGAAGCT GCTTACGAAT 
CGGTTTTTCA AAGGGAATAA GTACTGGGCT GTTCAGGGAC 
CCCAAGGACA TCTACAGCTC CTTTGGCTTC CCTAGAACTG 
CTTTCTGAGG AAAACACTGG AAAAACCTAC TTCTTTGTTG 
GATGAATATA AACGATCTAT GGATCCAGGT TATCCCAAAA 
GGAATTGGCC ACAAAGTTGA TGCAGTTTTC ATGAAAGATG 
GGAACAAGAC AATACAAATT TGATCCTAAA ACGAAGAGAA 
AATAGCTGGT TCAACTGCAG GAAAAATTAG 

Seq ID NO: 3 91 Protein sequence 
Protein Accession #: NP 002412.1 



PCT/US02/12476 



MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

I 

LFWGWSH S F 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 

I 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



Seq ID NO: 392 DNA sequence 

Nucleic Acid Accession #: NM_002421.2 

Coding sequence: 1..1409 



ATGCACAGCT 
CCAGCGACTC 
TACTACAACC 
GTTGAAAAAT 
GCTGAAACCC 
GTCCTCACTG 
TACACGCCAG 
TGGAGTAATG 
ATATCTTTTG 
CTTGCTCATG 
GAAAGGTGGA 
GGCCATTCTC 
ACCTTCAGTG 
GGACGTTCCC 
AAGCTAACCT 
TTCTACATGC 
TGGCCACAAC 
CGGTTTTTCA 
CCCAAGGACA 
CTTTCTGAGG 
GATGAATATA 
GGAATTGGCC 
GGAACAAGAC 
AATAGCTGGT 



11 

I 

TTCCTCCACT 
TAGAAACACA 
TGAAGAATGA 
TGAAGCAAAT 
TGAAGGTGAT 
AGGGGAACCC 
ATTTGCCAAG 
TCACACCTCT 
TCAGGGGAGA 
CTTTTCAACC 
CCAACAATTT 
TTGGACTCTC 
GTGATGTTCA 
AAAATCCTGT 
TTGATGCTAT 
GCACAAATCC 
TGCCAAATGG 
AAGGGAATAA 
TCTACAGCTC 
AAAACACTGG 
AACGATCTAT 
ACAAAGTTGA 
AATACAAATT 
TCAACTGCAG 



21 
I 

GCTGCTGCTG 
AGAGCAAGAT 
TGGGAGGCAA 
GCAGGAATTC 
GAAGCAGCCC 
TCGCTGGGAG 
AGCAGATGTG 
GACATTCACC 
T CAT CGGGAC 
AGGCCCAGGT 
CAGAGAGTAC 
CCATT CTACT 
GCTAGCTCAG 
CCAGCCCATC 
AACTACGATT 
CTTCTACCCG 
GCTTGAAGCT 
GTACTGGGCT 
CTTTGGCTTC 
AAAAACCTAC 
GGATCCAGGT 
TGCAGTTTTC 
TGATCCTAAA 
GAAAAATTAG 



31 

CTGTTCTGGG 
GTGGACTTAG 
GTTGAAAAGC 
TTTGGGCTGA 
AGATGTGGAG 
CAAACAC AT C 
GACCATGCCA 
AAGGTCTCTG 
AACTCTCCTT 
ATTGGAGGGG 
AACTTACATC 
GATATCGGGG 
GATGACATTG 
GGCCCACAAA 
CGGGGAGAAG 
GAAGTTGAGC 
GCTTACGAAT 
GTTCAGGGAC 
CCTAGAACTG 
TTCTTTGTTG 
TATCCCAAAA 
ATGAAAGATG 
ACGAAGAGAA 



Seq ID NO: 393 Protein sequence 
Protein Accession #: NP__002412.1 



1 

I 

MHSFPPLLLL 
VEKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVF 



11 

I 

LFWGWSH SF 
FGLKVTGKPD 
DHAIEKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNFISVF 
PRTVKHIDAA 
MKDGFFYFFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 

I 

VDLVQKYLEK 
RCGVPDVAQF 
KVSEGQADIM 
NLHRVAAHAL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 
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GHSLGLSHST 
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240 


KLTFDAITTI 


RGEVMFFKDR 


300 


RFFKGNKYWA 


VQGQNVLHGY 


360 


DEYKRSMDPG 


YPKMIAHDFP 


420 


NSWFNCRKN 







Seq ID NO: 3 94 DNA sequence 

Nucleic Acid Accession #: NM_014331. 

Coding sequence: 1..1506 
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ATGGTCAGAA AGCCTGTTGT GTCCACCATC TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TCCATTATCA TTGGCACCAT CATTGGAGCA 180 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAG CGTGGG CATGTCTCTG 240 

5 ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 42 0 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 

CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

10 AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 60 0 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

TTTAAAGACG CGTTTTCAGG AAGAGATTCA AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 

TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 84 0 

15 TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTT CCAGAAA TCCTCTCCAT GATTCATGTC 1080 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 1140 

20 TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 12 00 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TAT TATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

25 TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA GATCTTGGCA ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 162 0 

CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

30 GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGT CAT 180 0 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT TAGATACCAA TTTAGATAAC AAACACTCAT 2 04 0 

35 GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 22 8 0 

AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

40 TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 258 0 

CTTCAGATGA AACTGTCCAG ATTAATTAGG AAAAGGCATA TATTAACATA AAAATTGCAA 2 640 

45 AAGAAATGTC GCTGTAAATA AGATT TACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

AT CTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 288 0 

GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2 940 

50 CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 
CCATCTCCAA AAAAAAAAAA AAAA 

55 Seq ID NO: 3 95 Protein sequence 
Protein Accession #: NP_055146.1 



PCT/US02/12476 



60 
65 
70 
75 
80 
85 



MVRKPWSTI 
GIFISPKGVL 
LPAFVRVWVE 
SMSVSWSARI 
YGMYAYAGWF 
NAVAVTFSER 
RKHTPLPAVI 
FKVPLFIPAL 
SEKITRTLQI 



11 

I 

SKGGYLQGNV 
QNTGSVGMSL 
LLIIRPAATA 
QIFLTFCKLT 
YLNFVTEEVE 
LLGNFSLAVP 
VLHPLTMIML 
FSFTCLFMVA 
ILEWPEEDK 



21 
I 

NGRIiPSLGNK 
TIWTVCGVLS 
VISLAFGRYI 
AILIIIVPGV 
NPEKTIPLAI 
IFVALSCFGS 
FSGDLDSLLN 
LSLYSDPFST 
L 



31 
I 

EPPGQEKVQIi 
LFGALSYAEL 
LEPFFIQCEI 
MQLIKGQTQN 
CISMAITIGV 
MNGGVFAVSR 
FLSFARWLFI 
GIGFVITLTG 



41 
I 

KRKVTLIiRGV 
GTTIKKSGGH 
PELAIKLITA 
FKDAFSGRDS 
YVLTNVAYFT 
LFYVASREGH 
GLAVAGLIYL 
VPAYYLFIIW 



51 

I 

SIIIGTIIGA 
YTYILEVFGP 
VG I TWMVLN 
SITRLPLAFY 
TINAEELLLS 
LPEILSMIHV 
RYKCPDMHRP 
DKKPRWFRIM 



Seq ID NO: 396 DNA sequence 

Nucleic Acid Accession #: NM_006528 

Coding sequence: 57.. 764 



GCCGCCAGCG 
ACCCCGCTCG 
GCGATGCTGC 
ACGGACCCTG 
GCCAGTTCCT 
GCGACGATGC 
TGGACGACCA 
GTGAAAAATT 
AAGCTACTTG 
AAGATGAGGG 
CCTGTGATGC 



11 
I 

GCTTTCTCGG 
CCCCCTGGGG 
TCAGGAGCCA 
CCGGGCCCTA 
GTACGGGGGC 
TTGCTGGAGG 
GTGTGAGGGG 
CTTTTCCGGT 
TATGGGCTTC 
ACTGTGCTCT 
TTTCACCTAT 



21 
I 

ACGCCTTGCC 
CTGTCGATTC 
ACAGGAAATA 
CTTCTCCGTT 
TGCGAGGGCA 
ATAGAAAAAG 
TCCACAGAAA 
GGGTGTCACC 
TGCGCACCAA 
GCCAATGTGA 
ACTGGCTGTG 



31 
I 

CAGCGGGCCG 
TGCTGCTTTT 
ACGCGGAGAT 
ACTACTACGA 
ACGCCAACAA 
TTCCCAAAGT 
AGTATTTCTT 
GGAACCGGAT 
AGAAAATTCC 
CTCGCTATTA 
GAGGGAATGA 



41 
I 

CCCGACCCCC 
CCTGACGGAG 
CTGTCTCCTG 
CAGGTACACG 
TTTCTACACC 
TTGCCGGCTG 
TAATCTAAGT 
TGAGAACAGG 
ATCATTTTGC 
TTTTAATCCA 
CAATAACTTT 



51 

I 

TGCACCATGG 
GCTGCACTGG 
CCCCTAGACT 
CAGAGCTGCC 
TGGGAGGCTT 
CAAGTGAGTG 
TCCATGACAT 
TTTCCAGATG 
TACAGTCCAA 
AGATACAGAA 
GTTAGCAGGG 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



332 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

AGGATTGCAA ACGTGCATGT GCAAAAGCTT 
GCTTTGCCAG TAGAATCCGG AAAATTCGGA 
ATCTTGTTTG TCTTTATGGC TTATTTGCCT 
GCATGAGGAA ACAAATCATT GGTGATTTAT 
TTCAAAAATT TGGATTTTTT TATATATAAC 
TTTAATTTAT GGTTCAACTG TTTGTGAGAC 
AAATATGACT CACTCATTTC TTGGGGTCGT 
AAACAACATA AGACAATATA ATCATGTGCT 
CC 

Seq ID NO: 3 97 Protein sequence 
Protein Accession #: NP_006519 

21 



TGAAAAAGAA 
AGAAGCAATT 
TTATGGTTGT 
TCACCAGTTT 
TAGCTGCTAT 
GAATTCTTGC 
ATTCCTGATT 
TTTAACATAT 



AAAGAAGATG 
TTAAACATTC 
ATCTGAAGAA 
TTATTAATAC 
TCAAATGTGA 
AATGCATAAG 
TCAGAAGAGG 
TTGAGAATAA 



CCAAAGCTTC 
TTAATATGTC 
TAATATGACA 
AAGTCACTTT 
GTCTACCATT 
ATATAAAAGC 
ATCATAACTG 
AAAGGACTAG 



51 



720 
780 
840 
900 
960 
1020 
1080 
1140 



PCT/US02/12476 



11 21 31 41 

I I I I I I 

MDPARPLGLS ILLLFLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 
CRQFLYGGCE GNANNFYTWE ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNLSSM 
TCEKFFSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 

Seq ID NO: 398 DNA sequence 

Nucleic Acid Accession #: NM_001508.1 

Coding sequence: 1.-1361 



ATGGCTTCAC 
CCCGAGTTTG 
TTCGTGATGG 
AAAGGATACT 
TTGGTGTTCC 
ACGTCCAGCT 
GCTACGCTGC 
TTCAGGTACA 
GTCACCTCCG 
GTGAACGTGC 
CAGCCCGAGA 
CAGTCCAGCA 
ATGTGCTGGA 
ACGCGGCCTC 
ACCATCATCT 
ATT CGGAGG A 
GCGTACATGA 
CCGCTCCTGT 
TGCCGCCTGT 
ACCACCGACA 
TCTGCAAGGA 
TCTAAGTCCC 
AATTCTGCTG 



11 
I 

CCAGCCTCCC 
AGGTGGCCAC 
GCCTTCTGGG 
TGCAGAAGGA 
TCATCGGCAT 
ACACCCTGTC 
TGCACGTGCT 
AGGCTGTGTC 
CCCTGGTGGC 
CCAGCCAGCG 
CCTCCAATAT 
TCTTCGGCGC 
ACATGATGCA 
CGCAGCTGAG 
TCCTGAGGCT 
TCATGGCTGC 
TCCTCCTCCC 
AC ACGGTGT C 
CGCTGCAGCA 
GCGCCCGCTT 
GAACTGAGAA 
AGTCATTGAG 
CAGAGAATGG 



21 

I 

GGGCAGTGAC 
CTGGATCAAA 
GAACAGCGTC 
GGTGACAGAC 
GCCCATGGAG 
CTGCAAGCTG 
GACGCTCAGC 
GGGACCTTGC 
ACTGCCCTTG 
GGGTCTCACT 
GTCCATCTGT 
CTTCGTGGTC 
GGTGCTCATG 
GAAGTCCGAG 
GATTGTTGTG 
GGCCAAACCC 
CTTCTCGGAG 
CTCGCAGCAG 
CGCCAACCAC 
TGTGCAGCGC 
GATTTTCTTA 
TCTCGAGTCA 
TTTT CAGGAG 



31 
I 

TGCTCCCAAA 
AT CACCCTTA 
ACCATTCGGG 
CACATGGTGA 
TTCTACAGCA 
CACACTTTCC 
TTTGAGCGCT 
CAGGTGAAGC 
CTGTTTGCCA 
TGCAACCGCT 
ACCAACCTCT 
TACCTCGTGG 
AAAAGCCAGA 
AGCGAAGAGA 
ACATTGGCCG 
AAGCACGACT 
ACGTTTTTCT 
TTTCGGCGGG 
GAGAAGCGCC 
CCGTTGCTCT 
AGCACTTTTC 
CTAGAGCCCA 
CATGAAGTTT 



41 
I 

TCATTGATCA 
TTCTGGTGTA 
TCACCCAGGT 
GTTTGGCTTG 
TCATCTGGAA 
TCTTCGAGGC 
ACATCGCCAT 
TGCTGATTGG 
TGGGTACTGA 
CCAGCACCCG 
CCAGCCGCTG 
TCCTGCTCTC 
AGGGCTCGCT 
GCAGGACCGC 
TATGCTGGAT 
GGACGAGGTC 
ACCTCAGCTC 
TGTTCGTGCA 
TGCGCGTACA 
TCGCGTCCCG 
AGAGCGAGGC 
ACTCAGGCGC 
GA 



51 
I 

CAGTCATGT C 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACCCC 
CTTCGTCTGG 
GTACCCCCTG 
CCACCACGAG 
GACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGT CAT C AAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



Seq ID NO: 3 99 Protein sequence 
Protein Accession #: NPJD01499.1 



MASPSLPGSD 
KGYLQKEVTD 
ATLLHVLTLS 
VNVPSHRGLT 
MCWNMMQVLM 
IRRIMAAAKP 
CRLSLQHANH 
SKSQSLSLES 



11 
I 

CSQIIDHSHV 
HMVSLACSDI 
FERYIAICHP 
CNRSSTRHHE 
KSQKGSLAGG 
KHDWTRSYFR 
EKRIiRVHAH S 
LEPNSGAKPA 



21 
i 

PEFEVATWIK 
LVFLIGMPME 
FRYKAVSGPC 
QPETSNMSIC 
TRPPQLRKSE 
AYMILLPFSE 
TTDSARFVQR 
NSAAENGFQE 



31 

I 

ITLILVYLII 
FYSIIWNPLT 
QVKLLIGFVW 
TNLSSRWTVF 
SEESRTARRQ 
TFFYLSSVIN 
PLLFASRRQS 
HEV 



41 

I 

FVMGIiLGNSV 
TSSYTLSCKL 
VTSAbVALPL 
QSSIFGAFW 
TIIFLRLIW 
PLLYTVSSQQ 
SARRTEKIFL 



51 

I 

TIRVTQVLQK 
HTFLFEACSY 
LFAMGTEYPL 
YLWLLSVAF 
TLAVCWMPNQ 
FRRVFVQVLC 
STFQSEAEPQ 



Seq ID NO: 400 DNA sequence 

Nucleic Acid Accession #: NM_006475.1 

Coding sequence: 28.. 2538 



AACAGAACTG 
TTGCTGCTTA 
AGTCGTATCA 
ACCAAAAAGA 
AAAACGACTG 
TGCCCAGCAG 
ACAACGCAGC 
TTCACTTACT 
GGTTTGGAGA 
AAGAGAATGT 
TTGGGGCTTT 
ATCCATGGGA 
CAAATTGGTA 
GCAGCTGCCA 
TTTGCTCCCA 
GGAGACAAAG 
TCTGAGTCTA 



11 

i 

CAACGGAGAG 
TTGTTAACCC 
GGGGTCGGGA 
AATACTTCAG 
TTTTATATGA 
TTTTGCCCAT 
GCTATTCTGA 
TTGCACCGAG 
GCAACGTGAA 
TGACCAAGGA 
TCATTAACCA 
ACCAGATTGC 
CCT CAATTCA 
TCACATCGGA 
CCAATGAGGC 
TGGCTTCCGA 
TTATGGGAGG 



21 
I 

ACT CAAGATG 
TATAAACGCC 
CCAAGGCCCA 
CACTTGTAAG 
ATGTTGCCCT 
TGACCATGTT 
CGCCTCAAAA 
TAATGAGGCT 
TGTTGAATTA 
CTTAAAAAAT 
TTATCCTAAT 
AACAAATGGT 
AGACTTCATT 
CATATTGGAG 
TTTTGAGAAA 
AGCTCTTATG 
AGCAGTCTTT 



31 

I 

ATTCCCTTTT 
AACAATCATT 
AATGTCTGTG 
AACTGGTATA 
GGTTATATGA 
TATGGCACTC 
CTGAGGGAGG 
TGGGACAACT 
CTGAATGCTT 
GGCATGATTA 
GGGGTTGTCA 
GTTGTCCATG 
GAAGCAGAAG 
GCCCTTGGAA 
CTTCCACGAG 
AAGTACCACA 
GAGACGCTGG 



41 

I" 

TACCCATGTT 
ATGACAAGAT 
CCCTTCAACA 
AAAAGT C CAT 
GAATGGAAGG 
TGGGCAT CGT 
AGATCGAGGG 
TGGATTCTGA 
TACATAGTCA 
TTCCTTCAAT 
CTGTTAATTG 
TCATTGACCG 
ATGACCTTTC 
GAGACGGTCA 
GTGT CCTAGA 
TCTTAAATAC 
AAGGAAATAC 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



60 
120 
180 
240 
300 
360 
420 



51 
I 

TTCTCTACTA 
CTTGGCTCAT 
GATTTTGGGC 
CTGTGGACAG 
AATGAAAGGC 
GGGAGCCACC 
AAAGGGATCC 
TATCCGTAGA 
CATGATTAAT 
GTATAACAAT 
TGCTCGAATC 
TGTGCTTACA 
ATCTTTTAGA 
CTTCACACTC 
AAGGTTCATG 
TCTCCAGTGT 
AATTGAGATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



333 



WO 02/086443 

GGATGTGACG GTGACAGTAT AACAGTAAAT GGAAT CAAAA TGGTGAACAA AAAGGATATT 1080 

GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1140 

CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 12 00 

GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 

5 GCATTTT CTG ATGATACTCT CAGCATGGTT CAGCGCCTCC TTAAATTAAT TCTGCAGAAT 132 0 

CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 13 80 

ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TAT CGTACAG CTGTCTGCAT TGAAAATTCA 1440 

TGCATGGAGA AAGGGAGTAA G CAAGGG AG A AACGGTGCGA TTCACATATT CCGCGAGATC 1500 

ATCAAGCCAG CAGAGAAATC CCT CCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 

10 TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 

ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTGAAGA AAAAGAAATT 1680 

CTGATACGGG ACAAAAATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 

TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 

AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA 1860 

15 TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTAT CCAGCA 1920 

GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 

CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTGACTGT CTATACAACT 2 040 

AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2100 

ATTAT CAAAA CTG AAGGAC C CACACTAACA AAAGT CAAAA TTGAAGGTGA ACCTGAATTC 2160 

20 AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC AATTATTAAA 2220 

AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 22 80 

GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 

ACAGAAGAAA CTCTGAAGAA ATTGTTACAA GAAGAGGTCA CCAAGGTCAC CAAATTCATT 2400 

GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 

25 ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 252 0 

GAAGGTCGTT CTCAGTGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 

AATAACCTGA CCTTAGAAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACAT CAGC 2 640 

ACAAAGAAGC AATCAT CAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATGA 2 700 

GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2 760 

30 CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTT CTGG CTAACTTTGG AATCCATTAG 2820 

AGAAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAGAGTTGTG AACTGTTATC 2880 

CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2 940 

TATCTCTCCA TGGGAAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA ACTTTTTATA 3 00 0 

TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 

35 CAACTAATTT TGTACTCTCA GAATGTTTGT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 

TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTGAGTA 3180 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 



PCT/US02/12476 



40 



Seq ID NO: 401 Protein sequence 
Protein Accession #: NP_006466.1 



45 



50 



55 



MIPFLPMFSL 
KNWYKKSICG 
KLREEIEGKG 
NGMIIPSMYN 
IEAEDDLSSF 
MKYHILNTLQ 
IDQVLIPDSA 
VQRLLKLILQ 
RNGAIHIFRE 
FKGMTSEEKE 
TLLVNELKSK 
FKEIPVTVYT 
TEVIHGEPII 
QEEVTKVTKF 



11 
I 

LLLLIVNPIN 
QKTTVLYECC 
SFTYFAPSNE 
NLGLFINHYP 
RAAAITSDIIa 
CSESIMGGAV 
KQVIELAGKQ 
NHILKVKVGL 
I I KPAEKSLH 
ILIRDKNALQ 
ESDIMTTNGV 
TKIITKWEP 
KKYTKI IDGV 
IEGGDGHLFE 



21 

1 

ANNHYDKILA 
PGYMRMEGMK 
AWDNLDSDIR 
NGWTVN CAR 
EALGRDGHFT 
FETLEGNTIE 
QTTFTDLVAQ 
NELYNGQILE 
EKLKQDKRFS 
NIILYHLTPG 
IHWDKLLYP 
KIKVIEGSLQ 
PVEITEKETR 
DEEIKRLLQG 



31 
I 

HSRIRGRDQG 
GCPAVLPIDH 
RGLE SNVNVE 
I IHGNQIATN 
LFAPTNEAFE 
IGCDGDSITV 
LGLASALRPD 
TIGGKQIiRVF 
TFLSIjLEAAD 
VF I GKGFEPG 
ADTPVGNDQL 
PIIKTEGPTL 
EERIITGPEI 
DTPVRKLQAN 



41 

! 

PNVCALQQIL 
VYGTLGIVGA 
LLNALHSHMI 
GWHVIDRVL 
KLPRGVLERF 
NGIKMVNKKD 
GEYTLLAPVN 
VYRTAVC I EN 
LKELLTQPGD 
VTNILKTTQG 
LEILNKLIKY 
TKVKIEGEPE 
KYTRISTGGG 
KKVQGSRRRL 



51 

I 

GTKKKYFSTC 
TTTQRYSDAS 
NKRMLTKDLK 
TQIGTSIQDF 
MGDKVASEAL 
IVTNNGVIHL 
NAFSDDTLSM 
SCMEKGSKQG 
WTLFVPTNDA 
SKI FLKEVND 
IQIKFVRGST 
FRLIKEGETI 
ETEETLKKLL 
REGRSQ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 402 DNA sequence 
60 Nucleic Acid Accession #: NM_002416 
Coding sequence: 40.. 417 



65 
70 
75 
80 
85 



ATCCAATACA 
TTCCTCTTGG 
AAGGGTCGCT 
GACCTTAAAC 
AAGAATGGAG 
AAGTGGGAGA 
AAGAAAGTTC 
ACCACTTCAC 
TTCCAAAGGA 
ATTACT CTGA 
TTGTTAAAGG 
TTAAGGCCAT 
CTCACAACAG 
TATCTGAGGC 
TGAAATTGAG 
CTACAGGCCT 
ACTGGAGATC 
TTCCATCTTG 
AAGTCAGCTC 
CATATGCT CT 
AGTGCTTTCT 
AAATAAACCT 
CACATGGGTG 



11 

I 

GGAGTGACTT 
GCATCATCTT 
GTTCCTGCAT 
AATTTGCCCC 
TTCAAACATG 
AACAGGTCAG 
TGAAAGTT CG 
CAATAAGTAT 
GGATGGCATA 
AATTGTAACT 
CTATGATTGT 
GATTTTAGCA 
CTGCCTGGAA 
ACATGTCAGC 
CTGGACCTCA 
CACACACAAT 
ACCAGTGTGT 
CCCGCTCAGG 
TTCTCCATCC 
GATTTATCTG 
TCTCCCAATT 
TTTTGGACAC 
AACACTCAAT 



21 
I 

GGAACT CCAT 
GCTGGT TCTG 
CAGCACCAAC 
AAGCCCTTCC 
TCTAAACCCA 
CCAAAAGAAA 
AAAATCTCAA 
TCTGTGTTAA 
TAATACAAAG 
AAAGTTAGAA 
CTTTGTTCTT 
ATACCCATGT 
GAGCAGCCCT 
AAGTCCTAAG 
CCAAGCTGCT 
GTGTCTGAGA 
GGCTTT CAGA 
CTGACCACTT 
TACCACAATG 
AGTCAACTCC 
CATCCTCACT 
ACAAATTATC 
GGTTAACTAA 



31 

I 

TCTATCACTA 
ATTGGAGTGC 
CAAGGGACTA 
TGCGAGAAAA 
GATT CAGCAG 
AAGCAAAAGA 
CGTTCTCGTC 
AAATGTTCTA 
GCTTATTAAT 
AGTTGAT TTT 
CTACCACCCA 
CTACACAGAT 
AGGCTTCCAC 
CCTGTTAGCA 
GTGG CCATCA 
GATT CATGCT 
GCCTCCTTTC 
TATTTCTTTT 
CAGTGCCTTT 
TTTCTCATCT 
CAGTCCAGCT 
TTAAAACTCC 
TTCTTGGGTG 



41 

I 

TGAAGAAAAG 
AAGGAACCCC 
TCCACCTACA 
T TGAAAT CAT 
ATGTGAAGGA 
ATGGGAAAAA 
AAAAGAAGAC 
TTTTAATTAT 
TTGACTAGAA 
AAGAATCCAA 
CCAGTTGAAT 
GTTCACCCAA 
GTACTGCAGC 
TGCTGGTGAG 
ACCTCTGTAT 
GATTGTTATT 
TGGCTTTGGA 
TGTTCCCCTT 
CTTCTCTCCA 
TGTCCCCAAC 
TAGTTCAAGT 
TGTTTCACTT 
TTTATCCTAT 



51 
I 

TGGTGTTCTT 
AGTAGTGAGA 
ATCCTTGAAA 
TGCTACACTG 
ACTGATTAAA 
ACATCAAAAA 
TACATAAGAG 
ACCGCTATCA 
AATTTAAAAC 
ACGTTAAGAA 
TTCATCATGC 
CCACATCCCA 
CTCCAGAGAG 
CCAAGCAGTT 
TTGAATCAGC 
GGGTAT CACC 
AGCCATGTGA 
TGCTTCATTC 
GTGCACCTGT 
ACCCCACAGA 
CCTGCCTCTT 
GGTTCAGTAC 
CTCTCCAACC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



334 



WO 02/086443 

AGATTGTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 

CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 

TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 

CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGGACAC TCACTACAGG GACCAGGGAT 162 0 

GATGCAACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAGAA 1680 

GCACGTGGTA AAACACTTGC GGATATTCTG GACTGTTTTT AAAAAATATA CAGTTTACCG 1740 

AAAATCATAT AATCTTACAA TGAAAAGGAC TTTATAGATC AGCCAGTGAC CAACCTTTTC 1800 

CCAACCATAC AAAAATTCCT TTTCCCGAAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT 1860 

TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 

AGTTTTATTG TCCGTTTACT TGTTTCAGAG TTTGTATTGT GATTATCAAT TACCACACCA 1980 

TCTCCCATGA AGAAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGT CGGT 2040 

TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CT CTGCAGG A TGTGGAAACC TCCTT CCAGG 2100 

GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 

CTTTCCCAAA TTGAATCACT GCTCACACTG CTGATGATTT AGAGTGCTGT CCGGTGGAGA 222 0 

TCCCACCCGA ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 22 80 

AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTGTGACCC ACTTACCTTG CATCTCACAG 2340 

GTAGACAGTA TATAACTAAC AACCAAAGAC TACATATTGT CACTGACACA CACGTTATAA 2400 

TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 

AAACAGTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 
TATCAATAAA TAGACCATTA ATCAG 

Seq ID NO: 403 Protein sequence 
Protein Accession #: NP_002407 

1 11 21 31 41 51 

1 I I I 1 I 

MKKSGVLFLL GIILLVLIGV QGTPWRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 60 

IEIIATLKNG VQTCLNPDSA DVKELIKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 120 
QKKTT 

Seq ID NO: 404 DNA sequence 

Nucleic Acid Accession #: NM_006670 

Coding sequence: 85.. 1347 

1 11 21 31 41 51 

I 1 I I I I 

CCGGCTCGCG CCCTCCGGGC CCAGCCTCCC GAGCCTTCGG AGCGGGCGCC GTCCCAGCCC 60 

AGCTCCGGGG AAACGCGAGC CGCGATGCCT GGGGGGTGCT CCCGGGGCCC CGCCGCCGGG 120 

GACGGGCGTC TGCGGCTGGC GCGACTAGCG GTGGTACTCC TGGGCTGGGT CTCCTCGTCT 180 

TCTCCCACCT CCTCGGCATC CTCCTTCTCC TCCTCGGCGC CGTTCCTGGC TTCCGCCGTG 24 0 

TCCGCCCAGC CCCCGCTGCC GGACCAGTGC CCCGCGCTGT GCGAGTGCTC CGAGGCAGCG 3 00 

CGCACAGTCA AGTGCGTTAA CCGCAATCTG ACCGAGGTGC CCACGGACCT GCCCGCCTAC 360 

GTGCGCAACC TCTTCCTTAC CGGCAACCAG CTGGCCGTGC TCCCTGCCGG CGCCTTCGCC 42 0 

CGCCGGCCGC CGCTGGCGGA GCTGGCCGCG CTCAACCTCA GCGGCAGCCG CCTGGACGAG 480 

GTGCGCGCGG GCGCCTTCGA GCATCTGCCC AGCCTGCGCC AGCTCGACCT CAGCCACAAC 540 

CCACTGGCCG ACCTCAGTCC CTTCGCTTTC TCGGGCAGCA ATGCCAGCGT CTCGGCCCCC 600 

AGTCCCCTTG TGGAACTGAT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCAGAAC 660 

CGGAGCTTCG AGGGCATGGT GGTGGCGGCC CTGCTGGCGG GCCGTGCACT GCAGGGGCTC 720 

CGCCGCTTGG AGCTGGCCAG CAACCACTTC CTTTACCTGC CGCGGGATGT GCTGGCCCAA 780 

CTGCCCAGCC TCAGGCACCT GGACTTAAGT AATAATTCGC TGGTGAGCCT GACCTACGTG 840 

TCCTTCCGCA ACCTGACACA TCTAGAAAGC CTCCACCTGG AGGACAATGC CCTCAAGGTC 900 

CTTCACAATG GCACCCTGGC TGAGTTGCAA GGTCTACCCC ACATTAGGGT TTTCCTGGAC 960 

AACAATCCCT GGGTCTGCGA CTGCCACATG GCAGACATGG TGACCTGGCT CAAGGAAACA 1020 

GAGGTAGTGC AGGGCAAAGA CCGGCTCACC TGTGCATATC CGGAAAAAAT GAGGAATCGG 1080 

GTCCTCTTGG AACTCAACAG TGCTGACCTG GACTGTGACC CGATTCTTCC CCCATCCCTG 1140 

CAAACCTCTT ATGTCTTCCT GGGTATTGTT TTAGCCCTGA TAGGCGCTAT TTTCCTCCTG 1200 

GTTTTGTATT TGAACCGCAA GGGGATAAAA AAGTGGATGC ATAACATCAG AGATGCCTGC 12 60 

AGGGATCACA TGGAAGGGTA TCATTACAGA TATGAAATCA ATGCGGACCC CAGATTAACA 1320 

AACCTCAGTT CTAACTCGGA- TGT CTGAGAA ATATTAGAGG ACAGACCAAG GACAACTCTG 1380 

CATGAGATGT AGACTTAAGC TTTATCCCTA CTAGGCTTGC TCCACTTTCA TCCTCCACTA 1440 

TAGATACAAC GGACTTTGAC TAAAAGCAGT GAAGGGGATT TGCTTCCTTG TTATGTAAAG 1500 

TTTCTCGGTG TGTTCTGTTA ATGTAAGACG ATGAACAGTT GTGTATAGTG TTTTACCCTC 1560 

TTCTTTTTCT TGGAACTCCT CAACACGTAT GGAGGGATTT TTCAGGTTTC AGCATGAACA 162 0 

TGGGCTTCTT G CTGTCTGT C TCTCTCTCAG TACAGTTCAA GGTGTAGCAA GTGTACCCAC 1680 

ACAGATAGCA TTCAACAAAA GCTGCCTCAA CTTTTTCGAG AAAAATACTT T ATT CAT AAA 1740 

TAT CAGTTTT ATTCTCATGT ACCTAAGTTG TGGAGAAAAT AATTGCATCC TATAAACTGC 1800 

CTGCAGACGT TAGCAGGCTC TTCAAAATAA CTCCATGGTG CACAGGAGCA CCTGCATCCA 1860 

AGAGCATGCT TACATTTTAC TGTTCTGCAT ATTACAAAAA ATAACTTGCA ACTTCATAAC 192 0 

TTCTTTGACA AAGTAAATTA CTTTTTTGAT TGCAGTTTAT ATGAAAATGT ACTGATTTTT 1980 

TTTTAATAAA CTGCATCGAG ATCCAACCGA CTGAATTGTT AAAAAAAAAA AAAAATAAAG 2 040 
ATTCTTAAAA GAA 

Seq ID NO: 405 Protein sequence 
Protein Accession #: NP_006661 

1 11 21 

I I I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS 
QCPALCECSE AARTVKCVNR NLTEVPTDLP 
AALNLSGSRL DEVRAGAFEH LPSLRQLDLS 
NHIVPPEDER QNRSFEGMW AALLAGRALQ 
LSNNSLVSLT YVSFRNDTHL ESLHLEDNAIi 
HMADMVTWLK ETEWQGKDR LTCAYPEKMR 
IVLALIGAIF LLVLYLNRKG I KKWMHN I RD 



31 41 51 

I I I 

SSSPTSSASS FSSSAPFLAS AVSAQPPIiPD 60 

AYVRNLFLTG NQIjAVLPAGA FARRPPIAEL 120 

HNPLADLSPF AFSGSNASVS APSPLVELIL 180 

GLRRLELASN HFLYLPRDVL AQLPSLRHLD 240 

KVLHNGTLAE LQGLPHIRVF LDNNPWVCDC 3 00 

NRVLLELNSA DLDCDPILPP SLQTSYVFLG 3 60 
ACRDHMEGYH YRYEINADPR LTNLSSNSDV 



Seq ID NO: 406 DNA sequence 

Nucleic Acid Accession #: Eos sequence 



335 



WO 02/086443 

Coding sequence : 1 . . 927 



PCT/US02/12476 



1 11 21 31 41 51 

c I I I I I I 

D ATGGCTGGGG GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 

CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGCATCCTCC 120 

TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGT CCG CCCAGCCCCC GCTGCCGGAC 180 

CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 

AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 

10 AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 

AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGT CCTTC 420 

CGCAACCTGA CACATCTAGA AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 

AATGGCACCC TGGCTGAGTT GCAAGGT CTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 

CCCTGGGTCT GCGACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 

15 GTGCAGGGCA AAGACCGGCT CACCTGTGCA T AT CCGGAAA AAATGAGGAA TCGGGTCCTC 660 

TTGGAACTCA ACAGTGCTGA CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 72 0 

TCTTATGTCT T CCTGGGT AT TGTTTTAGCC CTGATAGGCG CTATTTTCCT CCTGGTTTTG 780 

TATTTGAACC GCAAGGGGAT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 

CACATGGAAG GGTATCATTA CAGATATGAA AT C AATGCGG ACCCCAGATT AACAAACCTC 900 

20 AGTTCTAACT CGGATGTCCT CGAGTGA 

Seq ID NO: 407 Protein sequence 
Protein Accession #: Eos sequence 

25 1 11 21 31 41 51 

I I I I 1 I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 

SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 180 

30 PWVCDCHMAD MVTWLKETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 

SYVFLGIVLA LIGAIFLLVL YLNRKGIKKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 

Seq ID NO: 408 DNA sequence 
35 Nucleic Acid Accession #: NM_000095.1 
Coding sequence: 26.. 2299 

1 11 21 31 41 51 

An 1 I I 1 I I 

4U CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 

CCTGGCTGCC CTCGGCGCGT CCGGACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 

GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 

GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 

CGGGATGCAG CAGT CAGT AC GCACCGGCCT ACCCAG CGTG CGGCCCCTGC TCCACTGCGC 300 

45 GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 3 60 

CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 

CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 48 0 

GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 

GGCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 60 0 

50 CCCCAACTCC GTGTG CATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 

CTTCGTGGGC GACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGCGCTTCT GCCCCGACGG 720 

CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCG CGATG GCTCGCGGTC 780 

GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 

AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 

55 CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGCCTG 960 

CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 102 0 

GAACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 1080 

GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 

CGACGACATC GACGGCGACC GGAT CCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 

60 CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGGAT GCCTGTGACA ACTGTCCCCA 1260 

GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 

CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 

TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 

CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTGGTGCCTA ACCCCGGCCA 1500 

65 GGAGGACGCG GACAGGGACG GCGTGGGCGA CGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 

GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 162 0 

GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 

GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 

GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 

70 GGATGACGAC TATGCGGGCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 

CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 192 0 

GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 

CGCTCTGTGG CATACAGGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 

AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 

75 GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 

CTTGGACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTCCC AGGAGAACAT 2220 

CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 22 8 0 

TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GGATGACAGC CACCCTCACC 2340 

GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 240 0 

80 AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Seq ID NO: 409 Protein sequence 
Protein Accession #: NP_0000 8 6.1 

85 1 11 21 31 41 51 

I I i I i I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 



336 



10 



WO 02/086443 

FLKNTVMECD ACGMQQSVRT GLPSVRPLbH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 12 0 

NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQGVGLAF AKANKQVCTD 180 

INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 

ADCVLERDGS RSCVCRVGWA GNGILCGRDT DLDGFPDEKIi RCPEPQCRKD NCVTVPNSGQ 300 

EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 360 

DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 

VDHDFVGDAC DSDQDQDGDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 

RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWLD 540 

PEGDAQIDPN WWLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGF I 600 

FGYQDSSSFY WMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 

ESQVRLLWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 
RLGVFCFSQE NIIWANLRYR CNDT I PEDYE THQLRQA 



PCT/US02/12476 



Seq ID NO: 410 DNA sequence 
15 Nucleic Acid Accession #: NM_001565.1 
Coding sequence: 6 7.. 3 63 



20 



25 



30 



35 



40 



GAGACATTCC 
AGCACCATGA 
ATTCAAGGAG 
CCTGTTAATC 
CGTGTTGAGA 
TCGAAGGCCA 
TAAAACCAGA 
CCTCTCCCAT 
GTTACACTAA 
GGTTAATGTT 
GCTCTACTGA 
ACCTTTCCCA 
TCAGAATCTC 
ACTTCATGGA 
CATACAATTC 
CTTATTTAAT 
TTTCAGTGTA 
TTTTAAAAAT 
TTTTCAAATA 



11 

I 

TCAATTGCTT 
ATCAAACTGC 
TACCTCTCTC 
CAAGGTCTTT 
TCATTGCTAC 
TCAAGAATTT 
GGGGAGCAAA 
CACTTCCCTA 
AAGGTGACCA 
CATCATCCTA 
GGTG CTATGT 
TCTTCCAAGG 
AAATAACTAA 
CTTCCACTGC 
CAAACACATA 
GAAAGACTGT 
CATGGAATAA 
ACAGATAGAT 
AAAATGAGGT 



21 
I 

AGACATATTC 
GATTCTGATT 
TAGAACCGTA 
AGAAAAACTT 
AATGAAAAAG 
ACTGAAAGCA 
ATCGATGCAG 
CATGGAGTAT 
ATGATGGTCA 
AGCTATTCAG 
TCTTAGTGGA 
GTACTAAGGA 
AAGGTATGCA 
CATCCTCCCA 
CAGGAAGGTA 
ACAAAGTATA 
CATGTAATTA 
ATATGCTCTG 
ACTCTCCTGG 



31 

I 

TGAGCCTACA 
TGCTGCCTTA 
CGCTGTACCT 
GAAATTATTC 
AAGGGTGAGA 
GTTAGCAAGG 
TGCTTCCAAG 
ATGT CAAGCC 
CCAAATCAGC 
TAATAACTCT 
TGTTCTGACC 
ATCTTTCTGC 
AT CAAAT CTG 
AGGGGCCCAA 
GAAATATCTG 
AGTCTTAGAT 
AGTACTATGT 
CATGTTACAT 
AAATATTAAG 



41 
I 

GCAGAGGAAC 
TCTTTCTGAC 
GCATCAGCAT 
CTGCAAGCCA 
AGAGATGTCT 
AAATGTCTAA 
GATGGACCAC 
ATAATTGTTC 
TGCTACTACT 
ACCCTGGCAC 
CTGCTTCAAA 
TTTGGGGTTT 
CTTTTTAAAG 
ATTCTTTCAG 
AAAATGTATG 
GTATATATTT 
ATCAATGAGT 
AAGATAAATG 



51 
I 

CTCCAGTCTC 
TCTAAGTGGC 
TAGTAATCAA 
ATTTTGTCCA 
GAATCCAGAA 
AAGATCTCCT 
ACAGAGGCTG 
TTAGTTTGCA 
CCTGTAGGAA 
TATAATGTAA 
TATTTCCCTC 
ATCAGAATTC 
AATGCTCTTT 
TGGCTACCTA 
TGTAAGTATT 
CCTATATTGT 
AACAGGAAAA 
TGCTGAATGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NO: 411 Protein sequence 
Protein Accession #: NP_001556.1 



1 11 21 31 41 51 

A< \ I 1 1 1 1 

4Z> MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 

EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 412 DNA sequence 
Nucleic Acid Accession #: XM_057014 
50 Coding sequence: 143.. 874 



55 
60 
65 
70 
75 
80 
85 



1 
I 

GGGAGGGAGA 
CGCGGCGGAG 
CGCTGCCCGG 
CCGCGGCCTC 
CCCCAAGGGG 
AATGTGCTTA 
CATTCCGGGT 
TCTGAGGGAA 
ATTGAATTAT 
AAATAGTGCT 
CTGTCAGCGT 
AGCTATAATT 
CACTTCTTCT 
CTGGGTTGGC 
TTCTCGCATC 
TTTTTTTATT 
CATCTGAATG 
TTTAAATCTA 
TGGTTAGAAT 
GGTCTTTTGT 
TGTACAATTT 
CAACCTTAAA 



11 
I 

GAGGCGCGCG 
CCAGACGCTG 
CAGCCGGGAG 
CTGCTGCTCC 
AAGCAAAAGG 
CAAGGGCCAG 
ACACCTGGGA 
AGCTTTGAGG 
GGCATAGATC 
CTAAGAGTTT 
TGGTATTTCA 
TATTTGGACC 
GTGGAAGGAC 
ACTTGTTCAG 
ATTATTGAAG 
ATGCCTTGGA 
AAAAGCAAAG 
GCATTATTCA 
ACTTTCTTCA 
TTTTTCTCTT 
GTAAATGTTA 
AAAAAAAAAA 



21 

I 

GGTGAAAGGC 
ACCACGTTCC 
CCATGCGACC 
TGCTGCTGCA 
CGCAGCTCCG 
CAGGAGTGCC 
TCCCAGGTCG 
AGTCCTGGAC 
TTGGGAAAAT 
TGTTCAGTGG 
CATTCAATGG 
AAGGAAGCCC 
TTTGTGAAGG 
ATTACCCAAA 
AACTACCAAA 
ATGGTTCACT 
CTAAATATGT 
TTTTGCTTCA 
TAGTCACATT 
AGTATAGCAT 
AGAATTTTTT 
AAAA 



31 
I 

GCATTGATGC 
TCTCCTCGGT 
CCAGGGCCCC 
GCTGCCCGCG 
GCAGAGGGAG 
TGGTCGAGAC 
GGATGGATTC 
ACCCAACTAC 
TGCGGAGTGT 
CTCACTTCGG 
AG CTG AATGT 
TGAAATGAAT 
AATTGGTGCT 
AGGAGATGCT 
ATAAATGCTT 
TAAATGACAT 
TTACAGACCA 
ATCAAAAGTG 
CTCTCAACCT 
TTTTAAAAAA 
TTATATCTGT 



41 
I 

AGCCTGCGGC 
CTCCTCCGCC 
GCCGCCTCCC 
CCGTCGAGCG 
GTGGTGGACC 
GGGAGCCCTG 
AAAGGAGAAA 
AAGCAGTGTT 
ACATTTACAA 
CTAAAATGCA 
TCAGGACCTC 
TCAACAATTA 
GGATTAGTGG 
TCTACTGGAT 
TAATTTTCAT 
TTTAAATAAG 
AAGTGTGATT 
GTTTCAATAT 
ATAATTTGGA 
ATATAAAAGC 
TAAATAAAAA 



51 
1 

GGCCTCGGAG 
TCCAGCTCCG 
CGCAGCGGCT 
CCTCTGAGAT 
TGTATAATGG 
GGGCCAATGG 
AGGGGGAATG 
CATGGAGTTC 
AGATGCGTTC 
GAAATGCATG 
TTCCCATTGA 
ATATTCATCG 
ATGTTGCTAT 
GGAATTCAGT 
TTGCTACCTC 
TTTATGTATA 
TCACACTGTT 
TTTTTTTAGT 
ATATTGTTGT 
TACCAATCTT 
TTATTTCCAA 



Seq ID NO: 413 Protein sequence 
Protein Accession #: XP_057014 

1 11 



51 



21 31 41 

1 I 1 1 1 i 

MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREWDL YNGMCLQGPA 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNGAECSGPL PIEAIIYLDQ 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEE 
LPK 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 



337 




PCT/US02/12476 



5 



i 



li 



21 



31 



41 



51 



CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAGACA 60 

CCAGTGGGCC CGTGTGGAAC CAAACCTGCG CGCGTGGCCG GGCCGTGGGA CAACGAGGCC 120 

GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 

10 TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 

AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 

AT CAT CT ACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 

TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 42 0 

ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAGACCA TGAGCGTCAC TCAGACCATG 480 

15 AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCCCAC CATAAT CATG 540 

CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 

GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 

GAAGGAATGT CAAGGACAGT GTTAGTGCTA GTGAAGTGAC CTCAACTGTG TACAACACTG 720 

TCTCTGAAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGGA AAACTCTTCC 780 

20 CCAAAGATGT AAGCAGCTCC ACTCCACCCA GTGTCACATC AAAGAGCCGG GTGAGCCGGC 840 

TGGCTGGTAG GAAAACAAAT GAATCTGTGA GTGAGCCCCG AAAAGGCTTT ATGTATTCCA 900 

GAAACACAAA TGAAAATCCT CAGGAGTGTT TCAATGCATC AAAGCTACTG ACATCT CATG 960 

GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 

TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 

25 CTCCAAAGAC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC ATTTCCATCA 1140 

TCAGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 

AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 

TACACCTTCT- TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAGAACCAG 1320 

CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 

30 GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCT CT AGGAGGCCTG TATTTCATGT 1440 

TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 

AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 

CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 

GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1680 

35 AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 

GGTGCAAGAA T AAATG CC AT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1800 

TTCACCACCA TCATGACTAC CAT CAT AT T C T CC AT CAT CA CCACCACCAA AACCACCATC 1860 

CT C ACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTGAA AGATGCCGGC GTCGCCACTT 1920 

TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 

40 GTGCTGCTTT TACTGAAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 

ATGAGTTGCC TCATGAATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 

AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 

GAATTTT CAT TGGT CATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 222 0 

GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AATGATGCTA 2280 

45 GTGACCATGG ATGT AG CCG C TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 

GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 

TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT 2460 

AGGGAGATGA GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTGATTT 2 520 

TTGTATTGAA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 

50 TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTATTATG 2640 

TAAACAAGAG ATTTGGCATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 270 0 

TTTTCAAGAA CTAACACAGT TATTCCTATA CTGGATTTTA GGTCTCTGAA GAACTGCTGG 2760 

TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 

AGCAAAGAAA TAAAGGAGAA AAGAGAAGAA TCTGAGAATT GGGGAGGCAT AGATTCTTAT 2 880 

55 AAAAATCACA AAATTTGTTG TAAATTAGAG GGGAGAAATT TAGAATTAAG TATAAAAAGG 2 940 

CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 

GTAGTGAGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAGAAAT CT 3 060 

AAATATATTT AATGAATTCA AGCAATATAC ACTTGACCAA GAAATTGGAA TTTCAAAATG 3120 

TTCGTGCGGG TTATATACCA GATGAGTACA GTGAGTAGTT TATGTATCAC CAGACT GGGT 3180 

60 TATTGCCAAG TT AT AT AT C A CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 

CAAAATTATC AGAGTAGTAA AACTTTGATA TATATGAGGA TATTAAAACT ACACTAAGTA 33 00 

TCATTTGATT CGATTCAGAA AGTACTTTGA TATCT CTCAG TGCTTCAGTG CTATCATTGT 3360 

GAG CAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACAA TAAATTCCTT AT AT C AGCTT G 
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MARKLSVIXiI LTPALSVTNP LHELKAAAFP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 

FYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER H SDHEHH SDH 12 0 

EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 180 

SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 

75 NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAIINQIDA 300 

RSCLIHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 3 60 

LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 

TWKGLTALGG LYFMFLVEHV LTLIKQFKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 480 

EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 

80 HSHFHDTLGQ SDDL I HHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 

MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGD FAVLLKA GMTVKQAVLY 660 

NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML LISIFEHKIV FRINF 
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ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT TTGGGGCCAT 60 

CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT ACGTCCCCAG CGAGGTCCAC 12 0 

TGCACGTTCC GATCCCTGGC TTCCGTGCCC GCTGGCATTG CTAGACACGT GGAAAGAATC 180 

AATTTGGGGT TTAATAGCAT ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG 240 

TTGGAGCTAC TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 300 

GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT GATCACAGGA 3 60 

CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA TTGACCACAA CAAGAT CG AG 420 

TTTATCCACC CTCAAGCTTT CAACGGCTTA ACGTCTCTGA GGCTACTCCA TTTGGAAGGA 480 

AATCTCCTCC ACCAGCTGCA CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC 540 

AGACT CT CCA CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 600 

AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA TCCGTGGACC 660 

TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA AATCCAGAGG AATTCTGAAG 72 0 

TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT CAGTTGTGTG CAATGTGCTT CAGTCCAAAG 780 

AAGTTGTACA AACATGAGAT ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA 840 

GAGTCCCCTC TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 90 0 

GATGGTGGCA GCCAGCT CAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG CATCTCTTTG 960 

AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT GTGACATCAA GAAACCAATG 102 0 

GATGTGTACA AGATTCACTT GAACCAAACG GATCCTCCAG ATATTGACAT AAATGCAACA 10 80 

GTTGCCTTGG ACTTTGAGTG T CCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG 1140 

ATAGCATACT ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 12 0 0 

CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA C T AC ACAGGT 12 60 

GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC AGCCATCCAT AGATATCCAG 132 0 

CTGAACCGAC GTCAGAGTAC GGCCAAGAAG GTGCTACTTT CCTACTACAC CCAGTATTCT 13 80 

CAAACAATAT CCACCAAAGA TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGAT TGAG 1440 

CCTAGTGGAG CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 150 0 

TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA TGGCTCCATC 15 60 

CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA TTCTCAGCAG TGGCTGGCTG 162 0 

AGGATCAAGT CCATGGAGCC ATCTGACTCA GGCTTGTACC AGTGCATTGC TCAAGTGAGG 1680 

GATGAAATGG ACCGCATGGT ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC 1740 

GAGAAAGACA CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 18 0 0 

GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG GATAATTAAT 1860 

GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG GAACTCTTTC CATCCCAAAG 192 0 

GTCCAAGTCA GTGATAGTGG TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC 198 0 

CATTTTACGG TGGGAAT C AC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 2040 

AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA GGATGAAGGG 2100 

GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC TTCTGCATCC AAAGGACCAA 2160 

GAGGTGTTCC TCAAAACAAA GGATGATGCC ATCAATGGAG ACAAGAAAGC CAAGAAAGGG 2220 

AGAAGAAAGC TGAAACTCTG GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA 22 80 

GGT CGCAGAG TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 2340 

GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA GGGCACAGAA 2400 

GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA GCCTAGAAGT CACACCACCT 24 60 

TTTCCTGCTG TTTCTCCCCC CTCAGCATCT CCTGTGCAGA CAGTAACCAG TGCTGAAGAA 2520 

TCCTCAGCAG ATGTACCTCT ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA 25 8 0 

GCCAGCATGG GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 2640 

AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA GATAACTTCC 2 700 

ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA TATCTGAGCC TTATGAACCA 2 760 

TCTCCTACTC TGCACACATT AGACACAGTC TATGAAAAGC CCACCCATGA AGAGACGGCA 2 820 

ACAGAGGGTT GGTCTGCAGC AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT 2 880 

GAGCCTCCAT TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 2 940 

GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC CTTTGCACAC 3 000 

CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA CATCACAGTT ATTTGAGGAT 3 060 

T CT ACT AT AG GGGAACCAGG TGTCCCAGGC CAATCACATC TACAAGGACT GACAGACAAC 312 0 

ATCCACCTTG TGAAAAGTAG TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG 3180 

AAAGAGATGT CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 3 240 

AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAAT CCATCA CTTTGCCTGA CTCCACACTG 33 00 

GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG AAACCACAGT TGGTACCCTC 33 60 

CTAGACAAAG ACACCACAAC AGTAACAACA ACACCAAGGC AAAAAGTTGC TCCGTCATCC 3 420 

ACCATGAGCA CTCACCCTTC TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC 3 480 

AAATTCCGCC ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 3540 

TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA GAGTTCTCTG 3 600 

GTT CCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA AACAGTTGGA AATGGAGAAG 3 660 

AATGCAGAAC CCACATCCAA GGGAACACCA CGGAGAAAAC ACGGGAAGAG GCCAAACAAA 3 720 

CATCGATATA CCCCTTCTAC AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT 3 780 

CCAGAAAATA AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 3 840 

ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC AACCACCAGA 3 900 

AAAATATATT CATCTTACCC TAAAGT CCAA GAGACACTTC CAGTCACATA TAAACCCACA 3 960 

TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG TTGACAAACA TAAAAGTGAC 4020 

ATTTTAGTCA CTGGTGAATC AATTACTAAT GCCATACCAA CTTCTCGCTC CTTGGTCTCC 4080 

ACTATGGGAG AATTTAAGGA AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG 4140 

AATCCCTCAA GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 4200 

GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA TTTCACTTCC 4260 

GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC AGGAAGAAGC TGGTTCTTCC 4320 

ACAACTCTCT CAAG CAT AAA AGTGGAGGTG GCTTCAAGTC AGGCAGAAAC CACCACCCTT 4380 

GAT CAAGAT C ATCTTGAAAC CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT 4440 

CACACCCCTA CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 4500 

ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG AATATCTCAA 4560 

GCAT CTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG TGGGGAATCC AGAAACAGAA 4620 

GCAACCCCAG TCAACAATGA AGGAACACAG CATATGTCAG GGCCAAATGA ATTATCAACA 4680 

CCCTCTTCCG ACCGGGATGC ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA 4740 

TTTGGTAGTA GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTT CAT 4800 

GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC AACAGTGAGG 4860 

CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG TAACTTCCCA GTCACCTCGT 4 920 

CACTGGACCA ACAAACCGGA AATAACTACA TATCCTTCTG GGGCTTTGCC AGAGAACAAA 4 980 

CAGTTTACAA CTCCAAGATT ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA 5040 
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CCCAGCATTC CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 5100 

GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC TCCCAGTCCA 5160 

AGAATTCCTC ATTATT CCAA TGGAAG ACT C CCTTTCTTTA CCAACAAGAC TCTTTCTTTT 5220 

CCACAGTTGG GAGTCACCCG GAGACCCCAG ATACCCACTT CTCCTGCCCC AGTAATGAGA 5280 

5 GAGAGAAAAG TTATTCCAGG TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG 5340 

GACTTTGGCC CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 54 00 

ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC CTTTATAACA 5460 

TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT CAAAGTTCTT TGCAGGAGGA 5520 

CCTCCTGCAT CCAAATTCTG GTCTCTTGGG GAAAAGCCCC AAATCCTCAC CAAGTCCCCA 5580 

10 CAGACTGTGT CCGTCACCGC TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA 5640 

CCAAAGCCTT TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 5700 

AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA GGTTCAAGTA 5760 

CAAGATCGAG G CCAGT AT AT GTGCACCGCC AGCAACCTGC ACGGCCTGGA CAGGATGGTG 5820 

GTCTTGCTTT CGGTCACCGT GCAGCAACCT CAAATCCTAG CCTCCCACTA CCAGGACGTC 5880 

15 ACTGTCTACC TGGGAGACAC CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC 5940 

CAAATTTCCT GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC CGTGGAGAGC 6000 

CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG AGGCGTCCTT CTCAGACAGA 6060 

GGCGTCTATA AGTGCGTGGC CAGCAATGCA GCCGGGGCGG ACAGCCTGGC CATCCGCCTG 6120 

CACGTGGCGG CACTGCCCCC CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC 6180 

20 CCGGGGCTCA GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 6240 

TGGGTGGTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG GAACTTGTTT 6300 

GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC CCAAGGACAG CGGGCGCTAT 63 60 

GAGTGCGTGG CCGCCAACCT GGTAGGCTCC GCGCGCAGGA CGGTGCAGCT GAACGTGCAG 6420 

CGTGCAGCAG CCAACGCGCG GAT CACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC 6480 

25 GGAGGAACCC TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 6540 

AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG AAT CAAGGTG 6600 

TTTGCCAATG GGACCCTGGT GGTGAAAT CA GTGACGGACA AAGATGCCGG AGATTACCTG 6660 

TGCGTAGCTC GAAATAAGGT TGGTGATGAC TACGTGGTGC TCAAAGTGGA TGTGGTGATG 672 0 

AAACCGGCCA AGATTGAACA CAAGGAGGAG AACGACCACA AAGT CTTCTA CGGGGGTGAC 6780 

30 CTGAAAGTGG ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 6840 

GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG CACCAAGCGC 690 0 

TATGTCGTCT TCAACAATGG GAC ACT CT AC TTTAACGAAG TGGGGATGAG GGAGGAAGGA 6960 

GACTACACCT GCTTTGCTGA AAATCAGGTC GGGAAGGACG AGATGAGAGT CAGAGTCAAG 702 0 

GTGGTGACAG CGCCCGCCAC CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT 7080 

35 GGAGACGTGG TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 7140 

TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGT AT CAGAT ATACCAAGAT 720 0 

GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG GCAACTACAC CTGCCTGGTC 72 60 

AGGAACAGCG CGGGAGAGGA TAGGAAGACG GTGTGGATTC ACGT CAACGT CCAGCCACCC 732 0 

AAGATCAACG GTAACCCCAA CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT 7380 

40 CGGAAACTGA TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 7440 

CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGAT CACTGT CCATGGCAAC 7500 

GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG TCCAGCTGGT ATGCATGGCA 7560 

CGCAACGAGG GAGGGGAGGC GAGGTTGATC GTGCAGCTCA CTGTCCTGGA GCCCATGGAG 762 0 

AAACCCATCT TCCACGACCC GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC 7680 

45 AGCCTCAACT GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 7740 

GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC TGACGGCATG 7800 

CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT ACCGCTGCGT GGCCCGCAAT 7860 

GCCGCTGGCC ACACGGAGAG GCTGGTCTCC CTGAAGGTGG GACTGAAGCC AGAAGCAAAC 7920 

AAGCAGTATC ATAACCTGGT CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC 7980 

50 CCTCCCGGGG CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 8040 

GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC GGTTCGTGAG 8100 

GCCT CGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG AGACGGAGTA CGGCCCTTCG 8160 

GTCACCAGCA TCCCCGTGAT TGTGATCGCC TATCCTCCCC GGATCACCAG CGAGCCCACC 822 0 

CCGGTCATCT ACACCCGGCC CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT 82 80 

55 CCCAAAGCTG ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 8340 

GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT CCAGCATGCC 8400 

ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA ACATTCTCGG CAGTGACTCC 8460 

AAAACAACTT ACATCCACGT CTTCTGAAAT GTGGATTCCA GAATGATTGC TTAGGAACTG 8520 

ACAACAAAGC GGGGTTTGTA AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT 8580 

60 GTCACAGTGC ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGAT CTTG ATCTACAATT 8640 

GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT GAGACACTTT 8700 

CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG TCTGTGCTCT GACTGCAATT 8760 

TTTCTTCTTT TGCAAATGCC ACT CG ACTG C CTTCATAAGC GTCCATAGGA TATCTGAGGA 882 0 

ACATTCATCA AAAATAAGCC ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA 8 880 

65 TCACCTAGTT AACCTGCTGC AGTTTTTACA TGATAGACTT TGTT CCAGAT TGACAAGTCA 8940 

TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG GATTTAGAAC 900 0 

CAGAGTGACT GATATATATA TATATATTTT AATT CAGAGT TACATACATA CAGCTACCAT 9060 

TTTATATGAA AAAAGAAAAA CATTTCTTCC TGGAACTCAC TTTTTATATA ATGTTTTATA 912 0 

TATATATTTT TTCCTTTCAA ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT 9180 

70 ATTAAAATTA ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 924 0 

ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC ■ TGTT AT ATT A 930 0 

CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT AGATTTCCTT GTATGCAAAG 93 60 

TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT GAGAGGAGAG GAAGGAGAAA ACTGCATCAT 9420 

AACTTTACAG AATTGAATCT AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT 9480 

75 CTGGCTTGTC CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 9540 

CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT TAAGGTCAAT 9600 
ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 



PCT/US02/12476 



80 



Seq ID NO: 417 Protein sequence 
Protein Accession #: NP 056234.1 



1 11 21 31 41 51 

I ! I I I I 

MPKRAHWGAL SWLILLWGH PRVALACPHP CACYVPSEVH CTFRSLASVP AGIARHVERI 60 

85 NLGFNSIQAL SETSFAGLTK LELLMIHGNE IPSIPDGALR DLSSLQVFKP SYNKLRVITG 12 0 

QTLQGLSNLM RLHIDHNKIE FIHPQAFNGL TSLRLLHLEG NLLHQLHPST FSTFTFLDYF 180 

RLSTIRHLYL AENMVRTLPA SMLRNMPLLE NLYLQGNPWT CD CEMRWFLE WDAKSRGILK 240 
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CKKDKAYEGG QLCAMCFSPK KLYKHEIHKL KDMTCLKPSI ESPLRQNRSR SIEEEQEQEE 300 

DGGSQLILEK FQLPQWSISL NMTDEHGNMV NLVCD I KKPM DVYKIHLNQT DPPDIDINAT 3 60 

VALDFECPMT RENYEKLWKI, IAYYSEVPVK LHRELMLSKD PRVSYQYRQD ADEEALYYTG 420 

VRAQILAEPE WVMQPSIDIQ LNRRQSTAKK VLLSYYTQYS QTISTKDTRQ ARGRSWVMIE 480 

PSGAVQRDQT VLEGGPCQLS CNVKASESPS IFWVLPDGSI LKAPMDDPDS KFSILiSSGWL 540 

RIKSMEPSDS GLYQCIAQVR DEMDRMVYRV LVQSPSTQPA EKDTVTIGKN PGESVTLPCN 600 

ALAIPEAHIiS WILPNRRIIN DLANTSHVYM LPNGTLSIPK VQVSDSGYYR CVAVNQQGAD 660 

HFTVGITVTK KGSGLPSKRG RRPGAKALSR VREDIVEDEG GSGMGDEENT SRRLLHPKDQ 72 0 

EVFLKTKDDA INGDKKAKKG RRKLKLWKHS EKEPETNVAE GRRVFESRRR INMANKQINP 780 

ERWADILAKV RGKNLPKGTE VPPLIKTTSP PSLSLEVTPP FPAVSPPSAS PVQTVTSAEE 840 

SSADVPLLGE EEHVLGTISS ASMGLEHNHN GVILVEPEVT STPLEEWDD LSEKTEEITS 90 0 

TEGDL KGTAA PTLISEPYEP SPTLHTLDTV YEKPTHEETA TEGWSAADVG SSPEPTSSEY 960 

EPPLDAVSLA ESEPMQYFDP DLETKSQPDE DKMKEDTFAH LTPTPTIWVN DSSTSQLFED 1020 

STIGEPGVPG QSHLQGLTDN IHLVKSSLST QDTLLIKKGM KEMSQTLQGG NMLEGDPTHS 1080 

RSSESEGQES KSITLPDSTL GIMSSMSPVK KPAETTVGTL LDKDTTTVTT TPRQKVAPSS 1140 

TMSTHPSRRR PNGRRRLRPN KFRHRHKQTP PTTFAPSETF STQPTQAPDI KISS^QVESSL 1200 

VPTAWVDNTV NTPKQLEMEK NAEPTSKGTP RRKHGKRPNK HRYTPSTVSS RASGSKPSPS 1260 

PENKHRNIVT PSSETILLPR TVSLKTEGPY DSLDYMTTTR KIYSSYPKVQ ETLPVTYKPT 1320 

SDGKE IKDDV ATNVDKHKSD ILVTGESITN AIPTSRSLVS TMGEFKEESS PVGFPGTPTW 1380 

NPSRTAQPGR LQTDIPVTTS GENLTDPPLL KELEDVDFTS EFLSSLTVST PFHQEEAGSS 144 0 

TTLSSIKVEV ASSQAETTTL DQDHLETTVA ILLSETRPQN HTPTAARMKE PASSSPSTIL 1500 

MSLGQTTTTK PALPSPRISQ AS RD SKENVF LNYVGNPETE ATPVNNEGTQ HMSGPNELST 1560 

PSSDRDAFNL STKLELEKQV FGSRSLPRGP DSQRQDGRVH ASHQLTRVPA KP I LPTATVR 1620 

LPEMSTQSAS RYFVTSQSPR HWTNKPEITT YPSGALPENK QFTTPRLSST TIPLPLHMSK 1680 

PSIPSKFTDR RTDQFNGYSK VFGNNN I PEA RNPVGKPPSP RIPHYSNGRL PFFTNKTLSF 1740 

PQLGVTRRPQ IPTSPAPVMR ERKVIPGSYN RIHSHSTFHD DFGPPAPPLL HTPQTTGSPS 1800 

TNLQNIPMVS STQSSISFIT SSVQSSGSFH QSSSKFFAGG PPASKFWSLG EKPQILTKSP 1860 

QTVSVTAETD TVFPCEATGK PKPFVTWTKV STGALMTPNT RIQRFEVLKN GTLVIRKVQV 1920 

QDRGQYMCTA SNLHGLDRMV VLLSVTVQQP QILASHYQDV TVYLGDTIAM ECLAKGTPAP 1980 

QISWIFPDRR VWQTVSPVES R I TLHENRTL SIKEASFSDR GVYKCVASNA AGADSLAIRL 2 040 

HVAALPPVIH QEKLENISLiP PGLSIHIHCT AKAAPLPSVR WVLGDGTQIR PSQFLHGNLF 2100 

VFPNGTLYIR NLAPKDSGRY ECVAANLVGS ARRTVQLNVQ RAAANARITG TSPRRTDVRY 2160 

GGTLKLDCSA SGDPWPRILW RLPSKRMIDA LFSFDSRIKV FANGTLWKS VTDKDAGDYL 222 0 

CVARNKVGDD YWLKVDWM KPAKIEHKEE NDHKVFYGGD LKVDCVATGL PNPEISWSLP 22 80 

DGSLVNSFMQ SDDSGGRTKR YWFNNGTLY FNEVGMREEG DYTCFAENQV GKDEMRVRVK 2340 

WTAPATIRN KTYLAVQVPY GDWTVACEA KGEPMPKVTW LSPTNKVIPT SSEKYQIYQD 2400 

GTLLIQKAQR SDSGNYTCLV RNSAGEDRKT VWIHVNVQPP KINGNPNPIT TVREIAAGGS 2460 

RKLIDCKAEG IPTPRVLWAF PEGWLPAPY YGNRITVHGN GSLDIRSLRK SDSVQLVCMA 252 0 

RNEGGEARL I VQLTVLEPME KPIFHDPISE K I TAMAGHT I SLNCSAAGTP TPSLVWVLPN 2580 

GTDLQSGQQL QRFYHKADGM LHISGLSSVD AGAYRCVARN AAGHTERLVS LKVGLKPEAN 2640 

KQYHNLVS I I NGETLKL PCT PPGAGQGRFS WTLPNGMHLE GPQTLGRVSL LDNGTLTVRE 2700 

ASVFDRGTYV CRMETEYGPS VTSIPVIVIA YPPRITSEPT PVIYTRPGNT VKLNCMAMGI 2760 

PKADITWELP DKSHLKAGVQ ARLYGNRFLH PQGSLTIQHA TQRDAGFYKC MAKNILGSDS 2 820 
KTTYIHVF 

Seq ID NO: 418 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..50 01 

1 11 21 31 41 51 

I I I 1 t I 

ATGCCAGGCA CAAAACTAAC CCGAACAGGC GCCCCAGCAG ACTACAGAGT GATATTGAAG 60 

ACCT CTCAAG AGGACGAATT GGATGTACCT GACGACATCA GCGTCCGGGT TATGTCATCT 12 0 

CAGTCTGTGC TTGTGTCCTG GGTGGATCCT GTTCTGGAAA AACAGAAGAA AGTTGTTGCA 180 

TCAAGACAGT ACACCGTGCG CTATCGAGAG AAGGGGGAAT TGGCCAGGTG GGATTATAAG 240 

CAGATCGCTA ACAGGCGTGT GCTGATTGAG AACCTGATTC CAGACACTGT GTATGAATTT 3 00 

GCAGTCCGTA TTTCACAGGG TGAAAGAGAT GGCAAATGGA GTACGTCAGT CTTCCAAAGA 360 

ACACCAGAAT CTGCCCCTAC CACAGCTCCT GAAAACTTGA ACGTCTGGCC AGT CAATGGC 420 

AAACCTACAG TTGTCGCTGC ATCTTGGGAT GCGCTACCAG AGACTGAGGG GAAAGTGAAA 480 

GTCTGTCTGC TGGACACAGG ACTGTTTTCA GTTTCCTCCT TCCAACCATC TGCCAAATCA 540 

TT TCAGAAT A CATTCTTTCA TACGCCCCGG CTCTCAAACC ATTTGGAGCA AAGTCCCTCA 600 

CCTATCCTGG AGACACTACT TCTGCCCTGG TGGATGGTCT GCAGCCTGGG GAACGCT AT C 660 

TTTTCAAAAT CCGGGCCACA AACAGGAGAG GCCTGGGACC TCACTCCAAA GCCTTCATTG 720 

TCGCTATGCC AACAAGAATG CAGCTGTACC CAGAAGGATT TCAGTTGTCT AGCTTACCTG 780 

ATCGATATCC AAACCAAACA AGTTAATAAA GATCCACAAC TGGAAGGGAG TGTTTTTGGA 840 

CCATGTTTTC TTTTCTACTT CCTCACATTT ATGCTGGATA TTGGCGGCTT TTCCTTCATT 900 

ATGTGCTATG AAGACCCANN TGTTTCTTCT TTGACAGGCA ATTCTTTAAA ATCTGTTGCA 960 

GCCAGTAAGG CGGATGTTCA GCAGAACACG GAGGACAATG GGAAACCCGA AAAACCTGAG 102 0 

CCTTCCTCAC CTTCTCCCAG AGCTCCAGCT TCCTCCCAAC ACCCCTCTGT GCCTGCTTCT 10 80 

CCCCAAGGGA G AAATGCCAA GGACCTTCTT CTTGACTTGA AGAACAAAAT ATTGGCTAAT 1140 

GGTGGGGCGC CCCGAAAACC CCAGCTTCGC GCCAAGAAGG CAGAGGAGCT GGATCTTCAG 1200 

TCGACAGAAA TCACTGGGGA GGAGGAGCTG GGTTCCCGGG AGGACTCGCC CATGTCACCC 1260 

TCAGACACCC AAGACCAGAA ACGGACCCTG AGGCCGCCAA GTAGACACGG CCACTCGGTG 132 0 

GTTGCTCCCG GCAGGACTGC AGTGAGGGCC CGGATGCCAG CGCTGCCCCG AAGGGAAGGC 1380 

GTAGATAAGC CTGGCTTTTC CCTGGCCACG CAGCCCCGCC CAGGGGCGCC CCCCTCGGCT 1440 

TCGGCCTCTC CTGCCCACCA CGCGTCCACC CAGGGCACCT CTCATCGTCC TTCCCTGCCT 1500 

GCCAGCTTGA ATGACAACGA CTTGGTGGAC TCAGACGAAG ATGAGCGCGC TGTGGGCTCC 1560 

CTCCACCCCA AGGGCGCCTT CGCCCAGCCC CGGCCAGCCC TGTCCCCCAG CCGCCAGTCC 1620 

CCGTCCAGCG TTCTCCGCGA CAGAAGCTCT GTGCACCCCG GCGCAAAGCC AGCCTCGCCG 1680 

GCGCGGAGGA CCCCCCATTC AGGGGCCGCA GAGGAAGATT CCAGTGCCTC AGCCCCACCC 1740 

TCAAGACTTT CTCCACCCCA TGGGGG AT CA TCTCGGCTGC TGCCCACCCA GCCACACCTG 1800 

AGCTCTCCAC TTTCCAAGGG CGGGAAGGAT GGTGAGGACG CCCCAGCCAC CAACTCCAAT 1860 

GCGCCATCAC GGT CCACCAT GTCCTCCTCC GTCTCTTCTC ATCTCTCGTC CAGGACGCAG 1920 

GTCTCTGAGG GAGCGGAGGC TTCTGATGGT GAAAGCCACG GTGACGG CGA TAGGGAAGAC 1980 

GGCGGAAGGC AGGCGGAGGC CACGGCCCAG ACGCTGCGGG CCCGGCCTGC CTCTGGACAC 2040 

TTCCATTTGC TCAGACACAA ACCCTTTGCT GCCAACGGGA GGTCTCCAAG CAGGTTCAGC 2100 

ATTGGGCGGG GACCTCGGCT GCAGCCCTCC AGCTCCCCAC AGTCGACTGT GCCCTCCCGA 2160 
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GCCCACCCCA GGGTTCCCTC T C ACT CTGAT TCCCACCCTA AGCTTAGCTC AGGTATCCAT 222 0 

GGAGACGAGG AGGATGAGAA GCCGCTTCCT GCCACCGTTG TCAATGACCA CGTGCCTTCC 2280 

TCCTCCAGGC AGCCCATCTC CCGGGGCTGG GAGGACTTAA GGAGAAGCCC GCAGAGAGGG 2340 

GCCAGCCTGC ATCGGAAGGA ACCCATCCCA GAGAACCCCA AATCCACAGG GGCAGATACA 2400 

CATCCTCAGG GCAAGTACTC CTCCCTGGCC TCCAAGGCTC AGGATGTTCA ACAGAGCACA 2460 

GACGCGGACA CGGAGGGTCA TTCTCCCAAA GCACAGCCAG GGTCCACAGA CCGCCACGCG 252 0 

TCCCCTGCTC GTCCTCCCGC AGCACGGTCA CAGCAGCATC CCAGTGTTCC CAGAAGGATG 2580 

ACACCCGGCC GGGCCCCAGA ACAGCAGCCC CCTCCTCCCG TCGCCACGTC CCAGCACCAC 2640 

CCGGGACCCC AGAGCAGAGA CGCGGGTCGG TCACCTTCCC AGCCCAGGCT CTCACTGACC 2700 

CAGGCCGGGC GGCCCCGCCC CACGTCGCAG GGCCGCTCCC ACTCCTCCTC GGACCCTTAC 2760 

ACGGCGAGCT CCAGAGGGAT GCTCCCCACG GCCCTCCAGA ACCAGGACGA GGATGCCCAG 2820 

GGCAGCTACG ACGACGACAG CACAGAAGTC GAGGCCCAGG ATGTGCGGGC CCCCGCGCAC 2880 

GCCGCGCGCG CCAAGGAGGC AGCTGCGTCC CTTCCCAAGC ACCAGCAGGT GGAGTCTCCC 2940 

ACAGGCGCAG GGGCAGGTGG CGACCACAGG TCCCAGCGCG GACATGCGGC CTCCCCCGCC 3000 

AGGCCCAGCC GACCCGGCGG CCCCCAGTCC CGCGCCCGGG TCCCCAGCAG GGCAGCGCCG 3060 

GGGAAGTCGG AGCCTCCTTC CAAGCGGCCC CTGTCCTCCA AGTCCCAGCA GTCGGTCTCA 3120 

GCCGAGGACG AGGAGGAGGA GGACGCGGGG TTTTTTAAAG GCGGGAAAGA AGACCTTCTG 3180 

TCTTCCTCTG TGCCAAAGTG GCCCTCTTCC TCCACTCCCA GGGGCGG CAA AGACGCCGAT 3240 

GGGAGCCTCG CCAAGGAAGA GAGGGAGCCT GCCATCGCGC TTGCCCCTCG CGGAGGGAGC 3300 

CTGGCTCCTG TGAAGCGACC TCTCCCCCCA CCTCCAGGCA GCTCCCCCAG GGCCTCCCAC 3360 

GTCCCTTCCC GACCGCCGCC TCGCAGCGCT GCCACCGTGA GCCCCGTCGC GGGCACCCAC 3420 

CCCTGGCCGC GGTACACCAC GCGCGCCCCV CCTGGCCACT TCTCCACCAC CCCGATGCTG 3480 

TCCTTGCGCC AGAGGATGAT GCATGCCAGA TTCCGTAACC CTCTCTCCCG ACAGCCTGCG 3540 

AGACCCTCTT ACAGACAAGG TTATAATGGC AGACCAAATG TAGAAGGGAA AGTCCTTCCT 3600 

GGTAGTAATG GAAAACCGAA TGGACAGAGA ATTATCAATG GCCCTCAAGG AACAAAGTGG 3 660 

GTTGTGGACC TTGATCGTGG GTTAGTATTG AATGCAGAAG GAAGGTACCT CCAAGATTCA 372 0 

CATGGAAATC CTCTTCGGAT TAAACTAGGA GGAGATGGTC GAACCATTGT AGATCTGGAA 3780 

GGGACCCCCG TGGTGAGTCC TGACGGCCTC CCACTCTTTG GGCAGGGGCG ACATGGCACA 3840 

CCTCTGGCCA ATGCCCAAGA TAAGCCAATT TTGAGTCTTG GAGGAAAGCC GCTGGTGGGC 3 900 

TTGGAGGTCA TCAAAAAAAC CACCCATCCC CCTACCACTA CCATGCAGCC CACCACTACT 3960 

ACGACGCCCC TGCCTACCAC TACAACCCCG AGGCCCACCA CTGCCACCAC CATGCAGCCC 402 0 

ACCACTACTA CGACGCCCCT GCCTACCACT ACACCGAGGC CCACCACTGC CACCACCCGC 4080 

CGCACGACCA CCAGGCGTCC AACAACCACA GTCCGAACCA CTACGCGGAC AACCACCACC 4140 

ACCACCCCCA AACCCACCAC TCCCATCCCC ACCTGTCCCC CTGGGACCTT GGAACGGCAC 4200 

GACGATGATG GCAACCTGAT AATGAGCTCC AATGGGATCC CAGAGTGCTA CGCTGAAGAA 42 60 

GATGAGTTCT CAGGCTTGGA GACTGACACT GCAGTACCTA CGGAAGAGGC CTACGTTATA 432 0 

TATGATGAAG ATTATGAATT TGAGACGTCA AGGCCACCAA CCACCACTGA GCCTTCGACC 4380 

ACTGCTACCA CACCGAGGGT GATCCCAGAG GAAGGCGCCA TCAGTTCCTT TCCTGAAGAA 4440 

GAATTTGATC TGGCTGGAAG GAAACGATTT GTTGCTCCTT ACGTGACGTA CCTAAATAAA 4500 

GACCCATCAG CCCCGTGCTC TCTGACTGAT GCACTGGATC ACTTCCAAGT GGACAGCCTG 4560 

GATGAAATCA TCCCCAATGA CCTGAAGAAG AGTGATCTGC CTCCCCAGCA TGCTCCCCGC 462 0 

AACATCACCG TGGTGGCCGT GGAAGGTTGC CACTCATTTG TCATTGTGGA TTGGGACAAA 4680 

GCCACCCCAG GAGATTTGGT CACAGGT TAT TTGGTTTACA GTGCATCCTA TGAAGATTTC 4740 

ATCAGGAACA AGTTTTCCAC TCAAGCTTCA TCAGTAACTC ACTTGCCCAT TGAGAACCTA 4800 

AAGCCCAACA CGAGGTATTA TTTTAAAGTG CAAGCACAAA ATCCTCATGG CTACGGACCT 4860 

ATCAGCCCTT CGGTCTCATT TGTCACCGAA TCAGATAATC CTCTGCTTGT TGTGAGGCCC 4920 

CCAGGCGGTG AGCTATCTGG ATCCCATTCG CTTTCAAACA TGATCCCAGC TACACGGACT 4 980 

GCCATGGACG GCAATATGTG AAGCGCACGT GGTATCGAAA GTTCGTGGGA GTTGTTCTTT 5040 

GTAATTCACT GAGGTATAAA ATCTACCTCA GTGACAACCT GAAAGATACA TTCTACAGCA 510 0 

TTGGAGACAG CTGGGGAAGA GGTGAAGACC ATTGCCAATT TGTGGATTCA CACCTTGATG 5160 

GAAGAACAGG GCCTCAGTCC TATGTAGAAG CCCTCCCTAC TATTCAAGGC TACTATCGCC 5220 

AGT AT CGTC A GGAGCCTGTC AGGTTTGGGA ACATCGGCTT CGGAACCCCC TACTACTATG 5280 

TGGGCTGGTA CGAGTGTGGG GTCTCCATCC CTGGAAAGTG GTAATCACAG GACCGTCATG 5340 

CTGCAAGCTT GCCCTGCCCA GCCCCACCAA CTAAGTCGCA CTAGGGGCTG TGAGCAAAGA 5400 

CAGCCAGCAT GCTCAGCCCC GCTGCCCTAG GTGCCAGGAA GGTCACAGAT GGACACTGGC 5460 

CATTCTGGTC ATCTCAGTCT GGAACTCAGT CCCACTTCTT GGCCTGGACA ATGAACAGGA 5520 

TTCAGTTTTG CTGTTAACTT TGCTTCTCTA CTTTTTTTTG TTTGTTTGTA ATAGCACATC 5580 

CCAGAGACAT CAGAAACCAG CAACTGATTC AGTGTGATTT CCCAGACTTT TTAGGCATGA 5640 

AATTCGGACA CTTCAGTATT TCCAGGAATA GCATATGCAC GCTGTTCTTG CTTCATGGAA 57 0 0 

* TGCTACATGC TTTCTGTTTT TCTCATTTTG GATTTCTCCA AAACTAACTG AATTTAAGCT 5760 

TCAGGTCCCT TTGTATG CAG TAGAAAGGAA TTATTAAAAA CACCACCAAA GAAAATAAAT 5820 

ATATCCTACT TGAAATTTAC TCTATGGACT TACCCACTGG TAGAATAAAT GTATCAAATC 5880 

TTATTTGTAA ATTCTCAATT TTGATATATA TATGTATATA TGCATATACA TATCCACACT 5 940 

TGTCTGCAAG AATATTGATT AAAATTG CT A AATTTGTACT TGTTCACCAA AAAAAAAAAA 6000 
AAAAAAA 

Seq ID NO: 419 Protein sequence 
Protein Accession #: Eos sequence 

1 11 21 31 41 51 

I i I I I I 

MPGTKLTRTG APADYRVILK TSQEDELDVP DDISVRVMSS QSVLVSWVDP VLEKQKKWA 60 

SRQYTVRYRE KGELARWDYK QIANRRVLIE NLIPDTVYEF AVRISQGERD GKWSTSVFQR 12 0 

TPESAPTTAP ENLNVWPVNG KPTWAASWD ALPETEGKVK VCLLDTGIjFS VSSFQPSAKS 180 

FQNTFFHTPR LSNHLEQSPS PILETIjLLPW WMVCSLGNAI FSKSGPQTGE AWDLTPKPSL 240 

SLCQQECSCT QKDFSCLAYL IDIQTKQVNK DPQLEGSVFG PCFLFYFLTF MLDIGGFSFI 300 

MCYEDPVSSh TGNSLKSVAA SKADVQQNTE DNGKPEKPEP SSPSPRAPAS SQHPSVPASP 3 60 

QGRNAKDLLL DLKNKILANG GAPRKPQLRA KKAEELDLQS TEITGEEELG SREDSPMSPS 420 

DTQDQKRTLR PPSRHGHSW APGRTAVRAR MPALPRREGV DKPGFSLATQ PRPGAPPSAS 480 

ASPAHHASTQ GTSHRPSLPA SLNDNDLVDS DEDERAVGSL HPKGAFAQPR PALSPSRQSP 540 

SSVLRDRSSV HPGAKPASPA RRTPHSGAAE EDSSASAPPS RLSPPHGGSS RLLPTQPHLS 600 

SPLSKGGKDG EDAPATNSNA PSRSTMSSSV SSHLSSRTQV SEGAEASDGE SHGDGDREDG 660 

GRQAEATAQT LRARPASGHF HLLRHKPFAA NGRSPSRFSI GRGPRLQPSS SPQSTVPSRA 720 

HPRVPSHSDS HPKLSSGIHG DEEDEKPLPA TWNDHVPSS SRQPISRGWE DLRRSPQRGA 780 

SLHRKEPIPE NPKSTGADTH PQGKYSSLAS KAQDVQQSTD ADTEGHSPKA QPGSTDRHAS 840 

PARPPAARSQ QHPSVPRRMT PGRAPEQQPP PPVATSQHHP GPQSRDAGRS PSQPRLSLTQ 900 

AGRPRPTSQG RSHSSSDPYT ASSRGMLPTA LQNQDEDAQG SYDDD STEVE AQDVRAPAHA 960 
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ARAKEAAASL PKHQQVBSPT GAGAGGDHRS QRGHAASPAR PSRPGGPQSR ARVPSRAAPG 

KSEPPSKRPL SSKSQQSVSA EDEEEEDAGF FKGGKEDLLS SSVPKWPSSS TPRGGKDADG 

SLAKEEREPA IALAPRGGSL APVKRPLPPP PGSSPRASHV PSRPPPRSAA TVSPVAGTHP 

WPRYTTRAPP GHFSTTPMLS LRQRMMHARF RNPLSRQPAR PSYRQGYNGR PNVEGKVLPG 

SNGKPNGQRI INGPQGTKWV VDLDRGLVLN AEGRYLQDSH GNPIjRIKLGG DGRTIVDLEG 

TPWSPDGLP LFGQGRHGTP LANAQDKPIL. SLGGKPLVGL EVIKKTTHPP TTTMQPTTTT 

TPLPTTTTPR PTTATTMQPT TTTTPLPTTT PRPTTATTRR TTTRRPTTTV RTTTRTTTTT 

TPKPTTPIPT CPPGTLBRHD DDGNLIMSSN GIPECYAEED EFSGLETDTA VPTEEAYVIY 

DEDYEFETSR PPTTTEPSTT ATTPRVI PEE GAISSFPEEE FDLAGRKRFV APYVTYLNKD 

PSAPCSLTDA LDHFQVDSLD EIIPNDLKKS DLPPQHAPRN I TWAVEGCH SFVIVDWDKA 

TPGDLVTGYL VYSASYEDFI RNKFSTQASS VTHLPIENLK PNTRYYFKVQ AQNPHGYGPI 

SPSVSFVTES DNPLLWRPP GGEIiSGSHSL SNMIPATRTA MDGNM 

Seq ID NO: 420 DNA sequence 

Nucleic Acid Accession #: NM_022743 

Coding sequence: 128.. 1237 

1 11 21 31 41 51 

I I I I i I 

GTGGATTTTA GAGATACCTC CCCTCCTTCT GCTCAGCTGC CTTGCAGTAA TTAAACTCTT 
TCTCTGCTGC AACACCCCTA CTGTTCTCCG TGTATTGGCT TTTCTGGGCA GCAGGAAGGA 
AAAGCTGATG CGATGCTCTC AGTGCCGCGT CGCCAAATAC TGTAGTGCTA AGTGT CAGAA 
AAAAGCTTGG CCAGACCACA AGCGGGAATG CAAATGCCTT AAAAGCTGCA AACCCAGATA 
TCCTCCAGAC TCCGTTCGAC TTCTTGGCAG AGTTGTCTTC AAACTTATGG ATGGAGCACC 
TTCAGAATCA GAGAAGCTTT ACTCATTTTA TGATCTGGAG TCAAATATTA ACAAACTGAC 
TGAAGATAAG AAAGAGGGCC TCAGGCAACT CGTAATGACA TTTCAACATT TCATGAGAGA 
AGAAATACAG GATGCCTCTC AGCTGCCACC TGCCTTTGAC CTTTTTGAAG CCTTTGCAAA 
AGTGATCTGC AACTCTTTCA CCATCTGTAA TGCGGAGATG CAGGAAGTTG GTGTTGGCCT 
AT AT CCCAGT ATCTCTTTGC TCAATCACAG CTGTGACCCC AACTGTTCGA TTGTGTTCAA 
TGGGCCCCAC CTCTTACTGC GAGCAGTCCG AGACATCGAG GTGGGAGAGG AGCTCACCAT 
CTGCTACCTG GATATG CTG A TGACCAGTGA GGAGCGCCGG AAGCAGCTGA GGGACCAGTA 
CTGCTTTGAA TGTGACTGTT TCCGTTGCCA AACCCAGGAC AAGGATGCTG ATATGCTAAC 
TGGTGATGAG CAAGTATGGA AGGAAGTTCA AGAAT CCCTG AAAAAAATTG AAGAACTGAA 
GGCACACTGG AAGTGGGAGC AGGTTCTGGC CATGTGCCAG GCGATCATAA GCAGCAATTC 
TGAACGGCTT CCCGATATCA ACATCTACCA GCTGAAGGTG CTCGACTGCG CCATGGATGC 
CTGCATCAAC CTCGGCCTGT TGGAGGAAGC CTTGTTCTAT GGTACTCGGA CCATGGAGCC 
ATACAGGATT TTTTTCCCAG GAAGCCATCC CGTCAGAGGG GT T CAAGTGA TGAAAGTTGG 
CAAACTGCAG CTACATCAAG GCATGTTTCC CCAAGCAATG AAGAAT CTGA GACTGGCTTT 
TGATATTATG AGAGTGACAC ATGGCAGAGA ACACAGCCTG ATTGAAGATT TGATTCTACT 
TTTAGAAGAA TGCGACGCCA ACATCAGAGC ATCCTAAGGG AACGCAGTCA GAGGGAAATA 
CGGCGTGTGT CTTTGTTGAA TGCCTTATTG AGGTCACACA CTCTATGCTT TGTTAGCTGT 
GTGAACCTCT CTTATTGGAA ATTCTGTTCC GTGTTTGTGT AGGTAAATAA AGGCAGACAT 
GGTTTGCAAA CCACAAGAAT CATTAGTTGT AGAGAAGCAC GATTATAATA AATTCAAAAC 
ATTTGGTTGA GGATGCCAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 421 Protein sequence 
Protein Accession #: NP_073580 

1 11 21 31 41 51 

I I I I I I 

MRCSQCRVAK YCSAKCQKKA WPDHKRECKC LKSCKPRYPP DSVRLLGRW FKLMDGAPSE 

SEKLYSFYDL ESNINKLTED KKEGLRQLVM TFQHFMREEI QDASQLPPAF DLFEAFAKVI 

CNSFTI CNAE MQEVGVGLYP SISLLNHSCD PNCSIVFNGP HLLLRAVRDI EVGEELTICY 

LDMLMTSEER RKQLRDQYCF ECDCFRCQTQ DKDADMLTGD EQVWKEVQES LKKIEELKAH 

WKWEQVLAMC QAIISSNSER LPDINIYQLK VLDCAMDACI NLGLLEEALF YGTRTMEPYR 

IFFPGSHPVR GVQVMKVGKL QLHQGMFPQA MKNLRLAFDI MRVTHGREHS LIEDLIIiLLE 
ECDANIRAS 

Seq ID NO: 422 DNA sequence 

Nucleic Acid Accession #t NM_0 03 014.2 

Coding sequence: 238.. 648 

1 li 21 31 41 51 

I I I 1 i I 

GGCGGGTTCG CGCCCCGAAG GCTGAGAGCT GGCGCTGCTC GTGCCCTGTG TGCCAGACGG 
CGGAGCTCCG CGGCCGGACC CCGCGGCCCC GCTTTGCTGC CGACTGGAGT TTGGGGGAAG 
AAACTCTCCT GCGCCCCAGA AGATTTCTTC CTCGGCGAAG GGACAGCGAA AGATGAGGGT 
GGCAGGAAGA GAAGGCGCTT TCTGTCTGCC GGGGTCGCAG CGCGAGAGGG CAGTGCCATG 
TTCCTCTCCA TCCTAGTGGC GCTGTGCCTG TGGCTGCACC TGGCGCTGGG CGTGCGCGGC 
GCGCCCTGCG AGGCGGTGCG CAT CCCT ATG TGCCGGCACA TGCCCTGGAA CATCACGCGG 
ATGCCCAACC ACCTGCACCA CAGCACGCAG GAGAACGCCA TCCTGGCCAT CGAG CAGTAC 
GAGGAGCTGG TGGACGTGAA CTGCAGCGCC GTGCTGCGCT TCTTCTTCTG TGCCATGTAC 
GCGCCCATTT GCACCCTGGA GTTCCTGCAC GACCCTATCA AGCCGTGCAA GTCGGTGTGC 
CAACGCGCGC GCGACGACTG CGAGCCCCTC ATGAAGATGT ACAACCACAG CTGGCCCGAA 
AGCCTGGCCT GCGACGAGCT GCCTGTCTAT GACCGTGGCG TGTGCATTTC GCCTGAAGCC 
ATCGTCACGG ACCTCCCGGA GGATGTTAAG TGGATAGACA TCACACCAGA CATGATGGTA 
CAGGAAAGGC CTCTTGATGT TGACTGTAAA CGCCTAAGCC CCGATCGGTG CAAGTGTAAA 
AAGGTGAAGC CAACTTTGGC AACGTATCTC AGCAAAAACT ACAGCTATGT TATTCATGCC 
AAAATAAAAG CTGTG C AGAG GAGTGGCTGC AATGAGGTCA CAACGGTGGT GGATGTAAAA 
GAGAT CTTCA AGTCCTCATC ACCCATCCCT CGAACT CAAG TCCCGCTCAT TACAAATTCT 
TCTTGCCAGT GTCCACACAT CCTGCCCCAT CAAGATGTTC TCATCATGTG TTACGAGTGG 
CGTTCAAGGA TGATGCTTCT TGAAAAT TGC TTAGTTGAAA AATGGAGAGA TCAGCTTAGT 
AAAAGATCCA TACAGTGGGA AGAGAGGCTG CAGGAACAGC GGAGAACAGT TCAGGACAAG 
AAGAAAACAG CCGGGCGCAC CAGT CGTAGT AATCCCCCCA AACCAAAGGG AAAGCCTCCT 
GCTCCCAAAC CAGCCAGTCC CAAGAAGAAC ATTAAAACTA GGAGTGCCCA GAAGAGAACA 
AACCCGAAAA G AGTGTG AG C TAACTAGTTT CCAAAGCGGA GACTTCCGAC TTCCTTACAG 
GATGAGGCTG GGCATTGCCT GGGACAGCCT ATGTAAGGCC ATGTGCCCCT TGCCCTAACA 
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ACTCACTGCA GTGCTCTTCA TAGACACATC TTGCAGCATT TTTCTTAAGG CTATGCTTCA 1440 

GTTTTTCTTT GTAAGCCATC ACAAGCCATA GTGGTAGGTT TGCCCTTTGG TACAGAAGGT 1500 

GAGTTAAAGC TGGTGGAAAA GGCTTATTGC ATTGCATTCA GAGTAACCTG TGTGCATACT 1560 

CTAGAAGAGT AGGGAAAATA ATGCTTGTTA CAATTCGACC TAATATGTGC ATTGTAAAAT 1620 

AAATGCCATA TTTCAAACAA AACACGTAAT TTTTTTACAG TATGTTTTAT TACCTTTTGA 1680 

TATCTGTTGT TGCAATGTTA GTGATGTTTT AAAATGTGAT GAAAATATAA TGTTTTTAAG 1740 

AAGGAACAGT AGTGGAATGA ATGTTAAAAG ATCTTTATGT GTTTATGGTC TGCAGAAGGA 180 0 

TTTTTGTGAT GAAAGGGGAT TTTTTGAAAA ATTAGAGAAG TAGCATATGG AAAATTATAA 1860 

TGTGTTTTTT TACCAATGAC TTCAGTTTCT GTTTTTAGCT AGAAACTTAA AAACAAAAAT 1920 

AATAATAAAG AAAAATAAAT AAAAAGGAGA GGCAGACAAT GTCTGGATTC CTGTTTTTTG 1980 

GTTACCTGAT TTCCATGATC ATGATGCTTC TTGTCAACAC CCTCTTAAGC AGCACCAGAA 2 040 

ACAGTGAGTT TGTCTGTACC ATTAGGAGTT AGGTACTAAT TAGTTGGCTA ATGCTCAAGT 2100 

ATTTTATACC CACAAGAGAG GTATGTCACT CATCTTACTT CCCAGGACAT CCACCCTGAG 2160 

AATAATTTGA CAAGCTTAAA AATGGCCTTC ATGTGAGTGC CAAATTTTGT TTTTCTTCAT 2220 

TTAAATATTT TCTTTGCCTA AATACATGTG AGAGGAGTTA AATATAAATG TACAGAGAGG 2280 

AAAGTTGAGT TCCACCTCTG AAATGAGAAT TACTTGACAG TTGGGATACT TTAATCAGAA 2 340 

AAAAAGAACT TATTTGCAGC ATTTTATCAA CAAATTTCAT AATTGTGGAC AATTGGAGGC 2400 

ATTTATTTTA AAAAACAATT TTATTGGCCT TTTGCTAACA CAGTAAGCAT GTATTTTATA 2460 

AGGCATTCAA TAAATGCACA ACGCCCAAAG GAAATAAAAT CCTATCTAAT CCTACTCTCC 2520 

ACTACACAGA GGTAAT CACT ATTAGTATTT TGGCATATTA TTCTCCAGGT GTTTGCTTAT 2580 

GCACTTATAA AATGAT TTGA ACAAATAAAA CTAGGAACCT GTATACATGT GTTTCATAAC 2 640 

CTGCCTCCTT TGCTTGGCCC TTTATTGAGA TAAGTTTTCC TGTCAAGAAA GCAGAAACCA 2700 

TCTCATTTCT AACAGCTGTG TTATATTCCA TAGTATGCAT TACTCAACAA ACTGTTGTGC 2760 
TATTGGATAC TTAGGTGGTT TCTTCACTGA CAATACTGAA TAAACATCTC ACCGGAATTC 

Seq ID NO: 423 Protein sequence 
Protein Accession ft: NP_003005.1 
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MFLSILVALC 
YEELVDVNCS 
ESLACDELPV 
KKVKPTLATY 
SSCQCPHILP 
KKKTAGRTSR 



11 
I 

LWLHLALGVR 
AVLRFFFCAM 
YDRGVCISPE 
LSKNYSYVIH 
HQDVLIMCYE 
SNPPKPKGKP 



21 
I 

GAPCEAVRIP 
YAPICTLEFL 
AIVTDLPEDV 
AKIKAVQRSG 
WRSRMMLLEN 
PAPKPASPKK 



31 

I 

MCRHMPWNIT 
HDPIKPCKSV 
KWIDITPDMM 
CNEVTTWDV 
CLVEKWRDQL 
NIKTRSAQKR 



41 
I 

RMPNHLHHST 
CQRARDDCEP 
VQERPLDVDC 
KEIFKSSSPI 
SKRSIQWEER 
TNPKRV 



51 

I 

QENAILAIEQ 
LMKMYNHSWP 
KRLSPDRCKC 
PRTQVPLITN 
LQEQRRTVQD 



60 
120 
180 
240 
300 



Seq ID NO: 424 DNA sequence 
Nucleic Acid Accession BC010423 
40 Coding sequence: 248.. 1780 



45 



50 



55 



60 



65 



70 



75 



80 



85 



CACAGCGTGG 
AGCTACGGCT 
CAAGTGCGAG 
TCTGCAGCCG 
TTCAACCATG 
GCTGCTACTG 
CGTGGTAACT 
CGGCGAGCAA 
ACTAGCGCTA 
GGAGCAGCCG 
GCAGGCGGAT 
GGCGCGGCTG 
ACTAGAAGAG 
CCCCAGCGTG 
CTCCCGCTCT 
GCAGCCACTG 
CATCCTCCAC 
GTGGCACATT 
CTCATACAAC 
CACTTTGGGC 
CAATGAGTTC 
CTCTGGGAAG 
ACTCTTGTTC 
GGCCCAGCAG 
CCGGAGGCTG 
GAGAGCCGAG 
AGAGCCCGAG 
TGAACTGCTG 
CAAACAGGCC 
CAATGGCATC 
CTAGGCCTGG 
ACACCCCCAT 
AACCCTTCTG 
CACTGTGTGT 
TGACTGTCCG 
AAGTGAACTG 
GTTTGGCGTG 
CAGACCCCAG 
CAGACCCAGG 
TCTCCTACCA 
GAGGCTTGAA 
ACATATTTTC 
ACTTTTAATT 
TTTTATTTTT 



11 
I 

GAAGCAGCTC 
GGGTGTGTAG 
AGGCAAGAAC 
GCTCCCAGGG 
CCCCTGTCCC 
CTGGCATCAT 
GTGGTGCTGG 
GTGGGGCAAG 
CTGCACTCCA 
CCGCCCCCAC 
GAGGGCGAGT 
CGGCTCCGAG 
GGCCAGGGCC 
ACCTGGGACA 
GCTGCCGTCA 
ACTTGTGTGG 
GTGTCCTTCC 
GGCAGAGAAG 
TGGACACGGC 
TTTCCCCCAC 
TCCTCAAGGG 
CAGGTGGACC 
TGCCTTCTGG 
ATGACCCAGA 
CATTCCCATC 
GGCCACCCTG 
GGCCGCAGTT 
TCTCCAGGCT 
ATGAACCATT 
TAC AT CAATG 
CTCCTTCTGT 
TTCTTGCGGA 
TTCATCGGGA 
GTGCATGTGT 
TGGAGGGGTG 
TGGTGTATGT 
TGTGT CATGT 
AGCAGTATTA 
TGTGCGGGCA 
CTTCGGAGCC 
CTGTTACAGA 
TGTAAATATA 
TTTTTCTTTT 
ATTTTTTTTT 



21 
I 

TGGGGGAGCT 
AACGGGGCCG 
TCTGCAGCTT 
AGATCTCGGT 
TGGGAGCCGA 
TTACAGGCCG 
GCCAGGACGC 
TGGCATGGGC 
AATACGGGCT 
GCAACCCCCT 
ACGAGTGCCG 
TGCTGGTGCC 
TGACCCTGGC 
CGGAGGTCAA 
CCT CAGAGTT 
TGTCCCATCC 
TTGCTGAGGC 
GAGCTATGCT 
TGGATGGGCC 
TGACCACTGA 
ATTCTCAGGT 
TAGTGTCAGC 
TGGTGGTGGT 
AATATGAGGA 
ACACGGACCC 
ATAGTCTCAA 
ACTCCACGCT 
CTGGGCGGGC 
TTGTTCAGGA 
GGCGGGGACA 
TGACATGGGA 
AGATGCTCCC 
GGGCTCCACC 
GCCTGTGTGA 
ACTGTGT CCG 
GCCACGGGAT 
GGCTGTGTGT 
ATGATGCAGA 
TAGCTGGAGC 
ATGGGGGCAA 
AGCCCTCTGC 
CATGCGCCGG 
TTTTTTCTTG 
AGAGTTTGAG 



31 
I 

CGGAGCTCCC 
GGGCTGGGGC 
CCTGCCTTCT 
GGAACTT CAG 
GATGTGGGGG 
GTGCCCCGCG 
AAAACTGCCC 
TCGGGTGGAC 
TCATGTGAGC 
GGACGGCTCA 
GGTCAGCACC 
TCCCCTGCCC 
AGCCTCCTGC 
AGGCACAACG 
CCACTTGGTG 
TGGCCTGCTC 
CTCTGTGAGG 
CAAGTGCCTG 
TCTGCCCAGT 
GCACAGCGGC 
CACTGTGGAT 
CTCGGTGGTG 
GGTGCTCATG 
GGAGCTGACC 
CAGGAGCCAG 
GGACAACAGT 
GACCACGGTG 
CGAGGAGGAG 
GAATGGGACC 
CCTGGTCTGA 
GATTTTAGCT 
CATCCCACTG 
AATTGAGTCT 
GTGTTGACTG 
TGGTGTGTAT 
TTGAGTGGTT 
GACCTCTGCC 
GGTTGGAGGA 
TGGAATCTGC 
GTGTGAAGCA 
CCTCTGGTGG 
GAGCTTCTTG 
CCCTTTCCAT 
TCCAGCCTGG 



41 

I 

GATCACGGCT 
TGGGTCCCCT 
GGGTCAGTTC 
AAACGCTGGG 
CCTGAGGCCT 
GGTGAGCTGG 
TGCTTCTACC 
GCGGGCGAAG 
CCGGCTTACG 
GTGCTCCTGC 
TTCCCCGCCG 
TCACTGAAT C 
ACAGCTGAGG 
TCCAGCCGTT 
CCTAGCCGCA 
CAGGACCAAA 
GGCCTTGAAG 
AGTGAAGGGC 
GGGGTACGAG 
ATCTACGTCT 
GTTCTTGACC 
GTGGTGGGTG 
TCCCGATACC 
CTGACCAGGG 
CCGGAGGAGA 
AGCTGCTCTG 
AGGGAGATAG 
GAAGATCAGG 
CTACGGGCCA 
CCCAGGCCTG 
CATCTTGGGG 
ACTGCTTGAC 
CTCCCACCAT 
ACTGTGTGTG 
TATGCTGTCA 
GCGTGGGCAA 
TGAAAAAGCA 
GAGAGGTGGA 
CTCCGGTGTG 
GCCAGTCCCT 
CCTCTGGGCC 
CAGGAATACT 
TAGTTGTATT 
ACGATATAGC 



51 

1 

TCTTGGGGGT 
AGTGGAGACC 
CTTATTCAAG 
CAGTCTGCCT 
GGCTGCTGCT 
AGACCTCAGA 
GAGGGGACT C 
GCGCCCAGGA 
AGGGCCGCGT 
GCAACGCAGT 
GCAGCTTCCA 
CTGGTCCAGC 
GCAGCCCAGC 
CCTTCAAGCA 
GCATGAATGG 
GGATCACCCA 
ACCAAAATCT 
AGCCCCCTCC 
TGGATGGGGA 
GCCATGTCAG 
CCCAGGAAGA 
TGATCGCCGC 
ATCGGCGCAA 
AGAACTCCAT 
GTGTAGGGCT 
TGATGAGTGA 
AAACACAGAC 
AT G AAGG CAT 
AGCCCACGGG 
CCTCCCTTCC 
GCCTCCTTAA 
CTTTACCTCC 
GCATGCAGGT 
TGTGGAGGGG 
TAT CAGAGTC 
CACTGTCAGG 
GGTATTTTCT 
GACTGTGGCT 
AGGGAACCTG 
GGGTCAGCCA 
TGCTGCATGT 
GCTCCGAATC 
TTTTATTTAT 
CAGACCCTGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
162 0* 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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CTGTAAAAAA ACCAAAACCC AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 425 Protein sequence 
Protein Accession #: AAH10423 

1 11 21 31 41 51 

I l l I I l 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TWLGQDAKL PCFYRGDSGE 60 

QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 

DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 180 

VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 

HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 

GFPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 3 60 

FCIiLVWWL MSRYHRRKAQ QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 

EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 480 
AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NO: 426 DNA sequence 

Nucleic Acid Accession #: NM_003474.2 

Coding sequence: 3 7.. 3 03 6 

1 11 21 31 41 51 

I I I I 1 I 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CT CGGACAGT TTGCTCATTT ATTGCAACGG 60 

TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCGACGCA CGCACACACA CGGGGGGAAA 12 0 

CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 180 

CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 

GCCCGCGTGG GATGGTG CAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 

GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 

GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 

GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 4 80 

TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 

CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 

TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGGTCAC 660 

TGTTACTACC ATGGACATGT ACGGGGATAT TCTGATTCAG CAGTCAGTCT CAGCACGTGT 72 0 

TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 7 80 

AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 

TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 

CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 

GTGAT CGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGAT CTGGA AAAAGTTAAG 1020 

CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 10 80 

ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGT CAGG AC 1140 

CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TGAAGCTTCT ACCTCGCAAA 12 0 0 

TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 

GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 

GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 13 80 

AATCATGACA CACTGGACAG GGGCTGTAGC TGT CAAATGG CGGTTGAGAA AGGAGGCTGC 1440 

AT CATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 

GACTTGGAGA CCAGCCTGGA GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 

AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAGATTTG TGGAAGAAGG AGAGGAGTGT 162 0 

GACTGTGGGG AGCCAGAGGA ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 

AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1740 

GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1800 

AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTCA GGATGTGGAC 1860 

GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGT CAC ACTCTGGGGA 192 0 

CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 

TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAGATGCT 2 04 0 

AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 

GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 

CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 222 0 

GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 

GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2 340 

GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 240 0 

GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 

TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CT C AAAAGGA AGACCTTGAT ACGACTGCTG 252 0 

TTTACAAATA AGAAGACCAC CATTGAAAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 

CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2 640 

CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 

GACATCAGCA GACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2 760 

CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 

AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 

CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2 940 

CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 

GTG CCCAGAT CCACCCACAC CGCCTATATT AAGTGAGAAG CCGACACCTT TTTTCAACAG 3060 

TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 312 0 

GGATTTTTTT TAATGTT TAA AACATCATTA CTATAAGAAC TTTGAGCTAC TGCCGTCAGT 3180 

GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAATTTAT 3240 

GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 330 0 

CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 

ATGGGATTCT GGACAGGATG TGTTTGCTTT CTGAT CAAGG CCTTATTGGA AAGCAGTCCC 3420 

CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 

CT CAGTTGAT TTTCTGGATT CCCCATCTCA GGCCAGAGCC AAGGGGCTTC AGGTCCAGGC 3540 

TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGG CAGG CAG GCTCCCAGGG 3 600 

ACACCTGGGA GAAATCTGGC TTCTGGCCAG GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 

AGGAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 

TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCAAGGC 3 780 

GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3 840 



345 
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CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 

CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACTGTAATA 3960 

CAATGATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 402 0 

TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 

AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 

TATAGTT CAT GTCTGCTATC ATTATTCGTA GATATTGGAC AAAGAACCTT CTCTATGGGG 4200 

CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 

ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 

CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 

CTCTTCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 

AATGGCATGA GAAATACAAA AATACT CAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 450 0 

CTGGACTGGT TTTCACATTA GAAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 

TTTATGAGAA AGCCTTCTTT TGGGGT CAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 462 0 

TGTACCAAGA ATCTTGGTTT GCCTTCCAGA AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 

TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 

AAACACACAC AAAAG GGAAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 

TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 48 60 

AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4 92 0 

TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4 980 

AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA GAGGATATGA 5040 
ACCAAAAAAA AAAAAAAAAA AA 



Seq ID NO i 427 Protein sequence 
Protein Accession #: NP_003465 

1 11 21 31 41 51 

I I I J I 1 

MAARPLPVSP ARALLLALAG ALLAPCEARG VSLWNEGRAD EWSASVRSG DLWIPVKSFD 60 

SKNHPEVLNI RLQRESKELI INLERNEGLI ASSFTETHYL QDGTDVSLAR NYTVILGHCY 120 

YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 180 

SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELVI VADNREFQRQ GKDLEKVKQR 240 

LIEIANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 3 00 

DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GG I VMDHSDN PLGAAVTLAH ELGHNFGMNH 3 60 

DTLDRGCSCQ MAVEKGGCIM NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 42 0 

SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 

ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 

AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGGAS RPVIGTNAVS 600 

IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ NISVFGVHEC 660 

AMQCHGRGVC NNRKNCHCEA HWAPPFCDKF GFGGSTDSGP IRQADNQGLT IGILVTILCL 720 

liAAGFWYLK RKTLIRDLFT NKKTT I EKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 

DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 

ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 



Seq ID NO: 42 8 DNA sequence 

Nucleic Acid Accession #: NM_003714 

Coding sequence: 135.. 1043 

1 11 21 31 41 51 

I 1 I I I 1 

GAGGAGGAGG GAAAAGGCGA G C AAAAAGG A AGAGTGGGAG GAGGAGGGGA AGCGGCGAAG 60 

GAGGAAGAGG AGGAGGAGGA AGAGGGGAGC ACAAAGGATC CAGGTCTCCC GACGGGAGGT 120 

TAATACCAAG AACCATGTGT GCCGAGCGGC TGGGCCAGTT CATGACCCTG GCTTTGGTGT 180 

TGGCCACCTT TGACCCGGCG CGGGGGACCG ACGCCACCAA CCCACCCGAG GGTCCCCAAG 240 

ACAGGAGCTC CCAGCAGAAA GGCCGCCTGT CCCTGCAGAA TACAGCGGAG ATCCAGCACT 300 

GTTTGGTCAA CGCTGGCGAT GTGGGGTGTG GCGTGTTTGA ATGTTTCGAG AACAACTCTT 360 

GTGAGATTCG GGGCTTACAT GGGATTTGCA TGACTTTTCT GCACAACGCT GGAAAATTTG 420 

ATGCCCAGGG CAAGTCATTC ATCAAAGACG CCTTGAAATG TAAGGCCCAC GCTCTGCGGC 480 

ACAGGTTCGG CTGCATAAGC CGGAAGTGCC CGGCCATCAG GGAAATGGTG TCCCAGTTGC 540 

AGCGGGAATG CTACCTCAAG CACGACCTGT GCGCGGCTGC CCAGGAGAAC ACCCGGGTGA 600 

TAGTGGAGAT GATCCATTTC AAGGACTTGC TGCTGCACGA ACCCTACGTG GACCTCGTGA 660 

ACTTGCTGCT GACCTGTGGG GAGGAGGTGA AGGAGGCCAT CACCCACAGC GTGCAGGTTC 720 

AGTGTGAGCA GAACTGGGGA AGCCTGTGCT CCATCTTGAG CTTCTGCACC TCGGCCATCC 780 

AGAAGCCTCC CACGGCGCCC CCCGAGCGCC AGCCCCAGGT GGACAGAACC AAGCTCTCCA 840 

GGGCCCACCA CGGGGAAGCA GGACATCACC TCCCAGAGCC CAGCAGTAGG GAGACTGGCC 900 

GAGGTGCCAA GGGTGAGCGA GGTAGCAAGA GCCACCCAAA CGCCCATGCC CGAGGCAGAG 960 

TCGGGGGCCT TGGGGCTCAG GGACCTTCCG GAAGCAGCGA GTGGGAAGAC GAACAGTCTG 1020 

AGTATTCTGA TATCCGGAGG TGAAATGAAA GGCCTGGCCA CGAAATCTTT CCTCCACGCC 1080 

GTCCATTTTC TTATCTATGG ACATTCCAAA ACATTTACCA TTAGAGAGGG GGGATGTCAC 1140 

ACGCAGGATT CTGTGGGGAC TGTGGACTTC ATCGAGGTGT GTGTTCGCGG AACGGACAGG 1200 

TGAGATGGAG ACCCCTGGGG CCGTGGGGTC TCAGGGGTGC CTGGTGAATT CTGCACTTAC 12 60 

ACGTACTCAA GGGAGCGCGC CCGCGTTATC CTCGTACCTT TGTCTTCTTT CCATCTGTGG 1320 

AGTCAGTGGG TGTCGGCCGC TCTGTTGTGG GGGAGGTGAA CCAGGGAGGG GCAGGGCAAG 1380 

GCAGGGCCCC CAGAGCTGGG CCACACAGTG GGTGCTGGGC CTCGCCCCGA AGCTTCTGGT 1440 

GCAGCAGCCT CTGGTGCTGT CTCCGCGGAA GTCAGGGCGG CTGGATTCCA GGACAGGAGT 1500 

GAATGTAAAA ATAAATATCG CTTAGAATGC AGGAGAAGGG TGGAGAGGAG GCAGGGGCCG 1560 

AGGGGGTGCT TGGTGCCAAA CTGAAATTCA GTTTCTTGTG TGGGGCCTTG CGGTTCAGAG 1620 

CTCTTGGCGA GGGTGGAGGG AGGAGTGTCA TTTCTATGTG TAATTTCTGA GCCATTGTAC 1680 

TGTCTGGGCT GGGGGGGACA CTGTCCAAGG GAGTGGCCCC TATGAGTTTA TATTTTAACC 1740 

ACTGCTTCAA ATCTCGATTT CACTTTTTTT ATTTATCCAG TTATATCTAC ATATCTGTCA 1800 

TCTAAATAAA TGGCTTTCAA ACAAAGCAAC TGGGTCATTA AAACCAGCTC AAAGGGGGTT 1860 

TAAAAAAAAA AAAACCAGCC CATCCTTTGA GGCTGATTTT TCTTTTTTTT AAGTTCTATT 1920 

TTAAAAG CT A TCAAACAGCG ACATAGCCAT ACATCTGACT GCCTGACATG GACTCCTGCC 1980 

CACTTGGGGG AAACCTTATA CCCAGAGGAA AATACACACC TGGGGAGTAC ATTTGACAAA 2040 

TTTCCCTTAG GATTTCGTTA TCTCACCTTG ACCCTCAGCC AAGATTGGTA AAGCTGCGTC 2100 

CTGGCGATTC CAGGAGACCC AGCTGGAAAC CTGGCTTCTC CATGTGAGGG GATGGGAAAG 2160 

GAAAGAAGAG AATGAAGACT ACTTAGTAAT TCCCATCAGG AAATGCTGAC CTTTTACATA 2220 
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AAATCAAGGA GACTGCTGAA AATCTCTAAG GGACAGGATT TTCCAGATCC TAATTGGAAA 
TTTAGCAATA AGGAGAGGAG TCCAAGGGGA CAAATAAAGG CAGAGAGAGA GAGAGAGAGA 
GGGAGAGGAA GAAAAGAGAG AGAGAAAAGA GCCTCGTGCC 



2280 
2340 



PCT/US02/12476 



Seq ID NO: 42 9 Protein sequence 
Protein Accession #: NP 003705 



10 



15 



MCAERLGQFM 
GDVGCGVFEC 
ISRKCPAIRE 
CGEEVKEAIT 
EAGHHLPEPS 
RR 



11 

1 

TLALVLATFD 
FENNSCEIRG 
MVSQLQRECY 
HSVQVQCEQN 
SRETGRGAKG 



21 
I 

PARGTDATNP 
LHGICMTFLH 
LKHDLCAAAQ 
WGSLCSILSF 
ERGSKSHPNA 



31 

i 

PEGPQDRSSQ 
NAGKFDAQGK 
ENTRVIVEMI 
CTSAIQKPPT 
HARGRVGGLG 



41 
I 

QKGRLSLQNT 
SFIKDALKCK 
HFKDLLLHEP 
APPERQPQVD 
AQGPSGSSEW 



51 

! 

AEIQHCLVNA 
AHALRHR FGC 
YVDLVNLLLT 
RTKL SRAHHG 
EDEQSEYSDI 



60 
120 
180 
240 
300 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Seq ID NO: 430 DNA sequence 

Nucleic Acid Accession #: NM_005940 

Coding sequence: 23.. 1489 



AAGCCCAGCA 
CGCCCTCCTG 
TCTGCCGCCG 
AGCCCTGCCC 
CAGCCTCAGG 
CCGACAGAAG 
GATCCTTCGG 
CCTAAAGGTA 
TGACAT CATG 
TGGGGGCATC 
CGACTATGAT 
AGCCCATGAA 
GTCCGCCTTC 
TCAACACCTA 
CCAGGCTGGG 
CTGTGAGGCC 
GGGCTTTGTG 
TCGCCACTGG 
CATTTGGTTC 
CCCCGCACCC 
GGGTCCCGAG 
CAGCACCCGG 
CTCTGAGATC 
CCTCTACTGG 
GGGTCCTGAC 
ATGCCCTCAG 
ATCTTTGTGG 
GGTGGGGTAC 
AGCGACTGTC 
GGGACCCGCT 
GTAGCACCAT 
TCCTTCCAGG 
TGAGCAACTG 
ATCTGTCTGC 
GTTCACAGTC 
CAACATACCT 
ATCCTCCAAA 
TTTTTAAACT 



11 

I 

GCCCCGGGGC 
CCCCCGATGC 
GACGTCCACC 
AGTAGCCCGG 
CCTCCCCGCT 
AGGTTCGTGC 
TTCCCATGGC 
TGGAGCGATG 
ATCGACTTCG 
CTGGCCCATG 
GAGACCTGGA 
TTTGGCCACG 
TACACCTTTC 
TATGGCCAGC 
ATAGACACCA 
TCCTTTGACG 
TGGCGCCTCC 
CAGGGACTGC 
TTCCAAGGTG 
CTCACCGAGC 
AAGAACAAGA 
CGTGTAGACA 
GACGCTGCCT 
AAGTTTGACC 
TTCTTTGGCT 
GGGTGCTGAC 
CTGTGGGGAC 
AACCACCATG 
TCAGACTGGG 
ATGCAGGTCC 
GGCAGGACTG 
GGCTGGCACT 
GGCTGTAGGG 
CTTCTGGCTG 
AAATGGGGAG 
CAATCCTGTC 
GCCATTGTAA 
GAGGATTGTG 



21 
I 

GGATGGCTCC 
TGCTGCTGCT 
ACCTCCATGC 
CACCTGCCCC 
GTGGCGTGCC 
TTTCTGGCGG 
AGTTGGTGCA 
TGACGCCACT 
CCAGGTACTG 
CCTTCTTCCC 
CTATCGGGGA 
TGCTGGGGCT 
GCTACCCACT 
CCTGGCCCAC 
ATGAGATTGC 
CGGTCTCCAC 
GTGGGGGCCA 
CCAGCCCTGT 
CTCAGTACTG 
TGGGCCTGGT 
TCTACTTCTT 
GTCCCGTGCC 
TCCAGGATGC 
CTGTGAAGGT 
GTGCCGAGCC 
CCCTGCCAGG 
CAGGCATGGG 
ACAACTGCCG 
CAGGGAGGCT 
TGGCAAACCT 
GGGGAACTGG 
GAAG CAAGGG 
CAGGGCCACT 
ACAAT CCTGG 
GGGTATTCTT 
CCAGGCCGGA 
ATGTGTGTAC 
ATTAAACACA 



31 
1 

GGCCGCCTGG 
GCTCCAGCCG 
CGAGAGGAGG 
TGCCACGCAG 
CGACCCATCT 
GCGCTGGGAG 
GGAGCAGGTG 
CACCTTTACT 
GCATGGGGAC 
CAAGACTCAC 
TGACCAGGGC 
GCAGCACACA 
GAGTCTCAGC 
TGTCACCTCC 
ACCGCTGGAG 
CATCCGAGGC 
GCTGCAGCCC 
GGACGCTGCC 
GGTGTACGAC 
GAGGTTCCCG 
CCGAGGCAGG 
CCGCAGGGCC 
TGATGGCTAT 
GAAGGCTCTG 
TGCCAACACT 
CCACGAATAT 
ACTGAGCCCA 
GGAGGGCCAC 
TTGGCATGAC 
GGCTGCCCTG 
AGTGTCCTTG 
TGCTGGGGCC 
TCCTGAGGTC 
AAATCTGTTC 
CATGCAGGAG 
TCCTCCTGAA 
AGTGTGTATA 
GTTGTTTTCT 



41 

CTCCGCAGCG 
CCGCCGCTGC 
GGGCCACAGC 
GAAGCCCCCC 
GATGGGCTGA 
AAGACGGACC 
CGGCAGACGA 
GAGGTG CACG 
GACCTGCCGT 
CGAGAAGGGG 
ACAGACCTGC 
ACAGCAG CCA 
CCAGATGACT 
AGGACCCCAG 
CCAGACGCCC 
GAGCTCTTTT 
GGCTACCCAG 
TTCGAGGATG 
GGTGAAAAGC 
GTCCATGCTG 
GACTACTGGC 
ACTGACTGGA 
GCCTACTTCC 
GAAGGCTTCC 
TTCCTCTGAC 
CAGGCTAGAG 
TGTCTCCTGC 
GCAGGT CGTG 
TTAAGAGGAA 
TCTCATCCCT 
CTGTATCCCT 
CCATGGCCTT 
AGGTCTTGGT 
TCCAGAATCC 
ACCCCAGGCC 
GCCCTTTTCG 
AACCTTCTTC 



51 
I 

CGGCCGCGCG 
TGGCCCGGGC 
CCTGGCATGC 
GGCCTGCCAG 
GTGCCCGCAA 
TCACCTACAG 
TGGCAGAGGC 
AGGGCCGTGC 
TTGATGGGCC 
ATGTCCACTT 
TGCAGGTGGC 
AGGCCCTGAT 
GCAGGGGCGT 
CCCTGGGCCC 
CGCCAGATGC 
TCTTCAAAGC 
CATTGGCCTC 
CCCAGGGCCA 
CAGTCCTGGG 
CCTTGGTCTG 
GTTTCCACCC 
GAGGGGTGCC 
TGCGCGGCCG 
CCCGTCTCGT 
CATGGCTTGG 
ACCCATGGCC 
AGGGGGATGG 
GTCACCTGCC 
GGGCAGTCTT 
GTCCCTCAGG 
GTTGTGAGGT 
CAGCCCTGGC 
AGGTGCCTGC 
AGGCCAAAAA 
CTGGAGGCTG 
CAGCACTGCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



Seq ID NO: 431 Protein sequence 
Protein Accession #: NP_005931 



65 



70 



75 



MAPAAWLRSA 
PAPATQEAPR 
LVQEQVRQTM 
FFPKTHREGD 
YPLSLSPDDC 
VSTIRGELFF 
QYWVYDGEKP 
PVPRRATDWR 
AEPANTFL 



11 

I 

AARALLPPML 
PASSLRPPRC 
AEALKVWSDV 
VHFDYDETWT 
RGVQHLYGQP 
FKAGFVWRLR 
VLGPAPLTEL 
GVPSEIDAAF 



21 
1 

LLLLQPPPLL 
GVPDPSDGLS 
TPLTFTEVHE 
IGDDQGTDLL 
WPTVTSRTPA 
GGQLQPGYPA 
GLVRFPVHAA 
QDADGYAYFL 



31 
I 

ARAL P PDVHH 
ARNRQKRFVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHWQGIiP 
I>VWGPEKNKI 
RGRLYWKFDP 



41 

I 

LHAERRGPQP 
SGGRWEKTDL 
RYWHGDDLPF 
LGLQHTTAAK 
EIAPLEPDAP 
SPVDAAFEDA 
YFFRGRDYWR 
VKVKALEGFP 



51 

i 

WHAALPSSPA 
TYRILRFPWQ 
DGPGGILAHA 
ALMSAFYTFR 
PD ACE AS FDA 
QGHIWFFQGA 
FHPSTRRVDS 
RLVGPDFFGC 



60 
120 
180 
240 
300 
360 
420 
480 



80 



85 



Seq ID NO: 432 DNA sequence 

Nucleic Acid Accession #: NM_024022 

Coding sequence: 202.. 1563 



ACCGGGCACC 
GGAAAGGGCT 
CCATCTACAT 
AGAGGTCCTG 
TCATTCCGAT 



11 
I 

GGACGGCTCG 
GTGTTTATGG 
TTTTGGGACT 
AAATAGT CAC 
CGCTTTTTGG 



21 
I 

GGTACTTTCG 
GAAGCCAGTA 
CGGGAATTAT 
CATGGGGGAA 
CCTTGATGAT 



31 
I 

TTCTTAATTA 
ACACTGTGGC 
GAGGTAGAGG 
AATGATCCGC 
TTGAAAATAA 



41 
i 

GGTCATGCCC 
CTACTATCTC 
TGGAGGCGGA 
CTGCTGTTGA 
GTCCTGTTGC 



51 

I 

GTGTGAGCCA 
TTCCGTGGTG 
GCCGGATGTC 
AGCCCCCTTC 
ACCAGATGCA 



60 
120 
180 
240 
300 
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GATGCTGTTG CTGCACAGAT CCTGTCACTG CTGCCATTGA AGTTTTTTCC AATCATCGTC 3 60 

ATTGGGATCA TTGCATTGAT ATTAGCACTG GCCATTGGTC TGGGCATCCA CTTCGACTGC 42 0 

TCAGGGAAGT ACAGATGTCG CTCATCCTTT AAGTGTATCG AGCTGATAGC TCGATGTGAC 4 80 

GGAGTCTCGG ATTGCAAAGA CGGGGAGGAC GAGTACCGCT GTGT CCGGGT GGGTGGTCAG 540 

AATGCCGTGC TCCAGGTGTT CACAGCTGCT TCGTGGAAGA CCATGTGCTC CGATGACTGG 60 0 

AAGGGTCACT ACGCAAATGT TGCCTGTGCC CAACTGGGTT TCCCAAGCTA TGTGAGTTCA 660 

GATAACCTCA GAGTGAGCTC GCTGGAGGGG CAGTTCCGGG AGGAGTTTGT GTCCATCGAT 720 

CACCTCTTGC CAGATGACAA GGTGACTGCA TTACACCACT CAGTATATGT GAGGGAGGGA 780 

TGTGCCTCTG GCCACGTGGT ( TACCTTGCAG TGCACAGCCT GTGGTCATAG AAGGGGCTAC 840 

AGCTCACGCA TCGTGGGTGg' AAACATGTCC TTGCTCTCGC AGTGGCCCTG GCAGGCCAGC 900 

CTTCAGTTCC AGGGCTACCA CCTGTGCGGG GGCTCTGTCA TCACGCCCCT GTGGATCATC 960 

ACTGCTGCAC ACTGTGTTTA TGACTTGTAC CTCCCCAAGT CATGGACCAT CCAGGTGGGT 102 0 

CTAGTTTCCC TGTTGGACAA TCCAGCCCCA TCCCACTTGG TGGAGAAGAT TGTCTACCAC 10 80 

AG CAAGTACA AGCCAAAGAG GCTGGGCAAT GACATCGCCC TTATGAAGCT GGCCGGGCCA 1140 

CTCACGTTCA ATGAAATGAT CCAGCCTGTG TGCCTGCCCA ACTCTGAAGA GAACTTCCCC 12 00 

GATGGAAAAG TGTGCTGGAC GTCAGGATGG GGGG CCACAG AGGATGGAGG TGACGCCTCC 1260 

CCTGTCCTGA ACCACGCGGC CGTCCCTTTG ATTTCCAACA AGATCTGCAA CCACAGGGAC 1320 

GTGTACGGTG GCATCATCTC CCCCTCCATG CTCTGCGCGG GCTACCTGAC GGGTGGCGTG 13 80 

GACAGCTGCC AGGGGGACAG CGGGGGGCCC CTGGTGTGTC AAGAGAGGAG GCTGTGGAAG 1440 

TTAGTGGGAG CGACCAGCTT TGGCATCGGC TGCG CAGAGG TGAACAAGCC TGGGGTGTAC 1500 

ACCCGTGTCA CCTCCTTCCT GGACTGGATC CACGAGCAGA TGGAGAGAGA CCTAAAAACC 1560 

TGAAGAGGAA GGGGACAAGT AGCCACCTGA GTTCCTGAGG TGATGAAGAC AGCCCGATCC 162 0 

TCCCCTGGAC TCCCGTGTAG GAACCTGCAC ACGAGCAGAC ACCCTTGGAG CTCTGAGTTC 1680 

CGGCACCAGT AGCAGGCCCG AAAGAGGCAC CCTTCCATCT GATTCCAGCA CAACCTTCAA 1740 

GCTGCTTTTT GTTTTTTGTT TTTTTGAGGT GGAGTCTCGG TCTGTTGCCC AGGCTGGAGT 1800 

GCAGTGGCGA AATCCCTGCT CACTGCAGCC TCCGCTTCCC TGGTTCAAGC GATTCTCTTG 1860 

CCTCAGCTTC CCCAGTAGCT GGGACCACAG GTGCCCGCCA CCACACCCAA CTAATTTTTG 192 0 

TATTTTTAGT AGAGACAGGG TTTCACCATG TTGGCCAGGC TGCTCTCAAA CCCCTGACCT 1980 

CAAATGATGT GCCTGCTTCA GCCTCCCACA GTGCTGGGAT TACAGGCATG GGCCACCACG 2 040 

CCTAGCCTCA CGCTCCTTTC TGATCTTCAC TAAGAACAAA AGAAGCAGCA ACTTGCAAGG 2100 

GCGGCCTTTC CCACTGGTCC ATCTGGTTTT CTCTCCAGGG GTCTTGCAAA ATTCCTGACG 2160 

AGATAAGCAG TTATGTGACC TCACGTGCAA AGCCACCAAC AGCCACTCAG AAAAGACGCA 2220 

CCAGCCCAGA AGTGCAGAAC TGCAGTCACT GCACGTTTTC ATCTCTAGGG ACCAGAACCA 2280 

AACCCACCCT TTCTACTTCC AAGACTTATT TTCACATGTG GGGAGGTTAA TCTAGGAATG 2340 

ACTCGTTTAA GGCCTATTTT CATGATTTCT TTGTAGCATT TGGTGCTTGA CGTATTATTG 2400 

TCCTTTGATT C CAAAT AAT A TGTTTCCTTC CCTCAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAA 

Seq ID NO: 433 Protein sequence 
Protein Accession #: NP_076927 

1 11 21 31 41 51 

1 1 i 1 t I 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PD AD AVAAQ I LSLLPLKFFP IXVIGIIALI 60 

LALAIGLGIH FDCSGKYRCR SSFKCIELIA RCDGVSDCKD GEDEYRCVRV GGONAVLQVF 120 

TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK 180 

VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 2 40 

LCGGSVITPL WIITAAHCVY DLYLPKSWTI QVGLVSDLDN PAPSHLVEKI VYHSKYKPKR 300 

LGND I ALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DAS P VLNHAA 360 

VPLISNKICN HRDVYGGIIS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVGATSF 420 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 434 DNA sequence 

Nucleic Acid Accession #: NM_000493.2 

Coding sequence: 97.. 213 9 

1 11 21 31 41 51 

I I 1 1 I I 

CACCTTCTGC ACTGCTCATC TGGGCAGAGG AAG CTTCAGA AAGCTGCCAA GGCACCATCT 60 

CCAGGAACTC CCAGCACGCA GAATCCATCT GAGAATATGC TGCCACAAAT ACCCTTTTTG 120 

CTGCTAGTAT CCTTGAACTT GGTTCATGGA GTGTTTTACG CTGAACGATA CCAAATGCCC 180 

ACAGGCATAA AAGGCCCACT ACCCAACACC AAGACACAGT TCTTCATTCC CTACACCATA 240 

AAGAGTAAAG GTATAGCAGT AAGAGGAGAG CAAGGT ACT C CTGGTCCACC AGGCCCTGCT 300 

GGACCTCGAG GGCACCCAGG TCCTTCTGGA CCACCAGGAA AACCAGGCTA CGGAAGT CCT 3 60 

GGACTCCAAG GAGAGCCAGG GTTGCCAGGA CCACCGGGAC CATCAGCTGT AGGGAAACCA 420 

GGTGTGCCAG GACTCCCAGG AAAACCAGGA GAGAGAGGAC CATATGGACC AAAAGGAGAT 480 

GTTGGACCAG CTGGCCTACC AGGACCCCGG GGCCCACCAG GACCACCTGG AATCCCTGGA 540 

CCGGCTGGAA TTTCTGTGCC AGGAAAACCT GGACAACAGG GACCCACAGG AGCCCCAGGA 60 0 

CCCAGGGGCT TTCCTGGAGA AAAGGGTGCA CCAGGAGTCC CTGGTATGAA TGGACAGAAA 660 

GGGGAAATGG GATATGGTGC TCCTGGTCGT CCAGGTGAGA GGGGTCTTCC AGGCCCTCAG 720 

GGTCCCACAG GACCATCTGG CCCTCCTGGA GTGGG AAAAA GAGGTGAAAA TGGGGTTCCA 780 

GGACAGCCAG GCATCAAAGG TGATAGAGGT TTTCCGGGAG AAATGGGACC AATTGGCCCA 840 

CCAGGTCCCC AAGGCCCTCC TGGGGAACGA GGGCCAGAAG GCATTGGAAA GCCAGGAGCT 900 

GCTGGAGCCC CAGGCCAGCC AGGGATTCCA GGAACAAAAG GTCTCCCTGG GGCTCCAGGA 960 

ATAGCTGGGC CCCCAGGGCC' TCCTGGCTTT GGGAAACCAG GCTTGCCAGG CCTGAAGGGA 1020 

GAAAGAGGAC CTGCTGGCCT TCCTGGGGGT CCAGGTGCCA AAGGGGAACA AGGGCCAGCA 1080 

GGTCTTCCTG GGAAGCCAGG TCTGACTGGA CCCCCTGGGA ATATGGGACC CCAAGGACCA 1140 

AAAGGCATCC CGGGTAGCCA TGGTCTCCCA GGCCCTAAAG GTGAGACAGG GCCAGCTGGG 12 00 

CCTGCAGGAT ACCCTGGGGC TAAGGGTGAA AGGGGTTCCC CTGGGTCAGA TGGAAAACCA 12 60 

GGGTACCCAG GAAAACCAGG TCTCGATGGT CCTAAGGGTA ACCCAGGGTT ACCAGGTCCA 1320 

AAAGGTGATC CTGGAGTTGG AGGACCTCCT GGTCTCCCAG GCCCTGTGGG CCCAGCAGGA 1380 

GCAAAGGGAA TGCCCGGACA CAATGGAGAG GCTGGCCCAA GAGGTGCCCC TGGAATACCA 1440 

GGTACTAGAG GCCCTATTGG GCCACCAGGC ATTCCAGGAT TCCCTGGGTC TAAAGGGGAT 1500 

CCAGGAAGTC CCGGTCCTCC TGGCCCAGCT GGCATAGCAA CTAAGGGCCT CAATGGACCC 1560 

ACCGGGCCAC CAGGGCCTCC AGGTCCAAGA GGCCACTCTG GAGAGCCTGG TCTTCCAGGG 1620 

CCCCCTGGGC CTCCAGGCCC ACCAGGTCAA GCAGTCATGC CTGAGGGTTT TATAAAGGCA 1680 

GGCCAAAGGC CCAGTCTTTC TGGGACCCCT CTTGTTAGTG CCAACCAGGG GGTAACAGGA 1740 
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WO 02/086443 

ATGCCTGTGT CTGCTTTTAC TGTTATTCTC TCCAAAGCTT ACCCAGCAAT AGGAACTCCC 1800 

ATACCATTTG ATAAAATTTT GTATAACAGG CAACAGCATT ATGACCCAAG GACTGGAATC 1860 

TTTACTTGTC AGATACCAGG AATATACTAT TTTTCATACC ACGTGCATGT GAAAGGGACT 192 0 

CATGTTTGGG TAGGCCTGTA TAAGAATGGC ACCCCTGTAA TGTACACCTA TGATGAATAC 1980 

ACCAAAGGCT ACCTGGATCA GGCTTCAGGG AGTGCCATCA TCGATCTCAC AGAAAATGAC 2040 

CAGGTGTGGC TCCAGCTTCC CAATGCCGAG TCAAATGGCC TATACTCCTC TGAGTATGTC 210 0 

CACTCCTCTT TCTCAGGATT CCTAGTGGCT CCAATGTGAG TACACCCCAC AGAGCTAATC 2160 

TAAATCTTGT GCTAGAAAAA GCATTCTCTA ACTCTACCCC ACCCTACAAA ATGCATATGG 2220 

AGGTAGGCTG AAAAGAATGT AATTTTTATT TTCTGAAATA CAGATTTGAG CTAT CAGACC 2280 

AACAAACCTT CCCCCTGAAA AGTGAGCAGC AACGTAAAAA CGTATGTGAA GCCTCTCTTG 2340 

AATTTCTAGT TAGCAATCTT AAGGCTCTTT AAGGTTTTCT CCAATATTAA AAAATAT CAC 2400 

CAAAGAAGTC CTGCTATGTT AAAAACAAAC AACAAAAAAC AAAGCAACAA AAAAAAAAAT 24 60 

TAAAAAAAAA AACAGAAATA GAGCTCTAAG TTATGTGAAA TTTGATTTGA GAAACTCGGC 2520 

ATTTCCTTTT TAAAAAAGCC TGTTTCTAAC TATGAATATG AGAACTTCTA GGAAACATCC 2580 

AGGAGGTATC ATATAACTTT GTAGAACTTA AATACTTGAA TATTCAAATT TAAAAGACAC 2 640 

TGTATCCCCT AAAATATTTC TGATGGTGCA CTACTCTGAG GCCTGTATGG CCCCTTTCAT 2 7 00 

CAATATCTAT TCAAATATAC AGGTGCATAT ATACTTGTTA AAGCTCTTAT ATAAAAAAGC 2 7 60 

CCCAAAATAT TGAAGTTCAT CTGAAATGCA AGGTGCTTTC ATCAATGAAC CTTTTCAAAA 2 82 0 

CTTTTCTATG ATTGCAGAGA AGCTTTTTAT ATACCCAGCA TAACTTGGAA ACAGGTATCT 2 8 80 

GACCTATTCT TATTTAGTTA ACACAAGTGT GATTAATTTG ATTTCTTTAA TTCCTTATTG 2 940 

AATCTTATGT GATATGATTT TCTGGATTTA CAGAACATTA GCACATGTAC CTTGTGCCTC 3 000 

CCATTCAAGT GAAGTTATAA TTTACACTGA GGGTTTCAAA ATTCGACTAG AAGTGGAGAT 3 0 60 

ATATTATTTA TTTATGGACT GTACTGTATT TTTATATTGC TGTTTAAAAC TTTTAAGCTG 3120 

TGCCTCACTT ATTAAAGCAC AAAATGTTTT ACCTACTCCT TATTTACGAC ACAATAAAAT 3180 

AACATCAATA GATTTTTAGG CTGAATTAAT TTGAAAGCAG CAATTTGCTG TTCTCAACCA 3240 
TTCTTTCAAG GCTTTTCATT CGACACAATA AAATAACATC AATAG 

Seq ID NO: 435 Protein sequence 
Protein Accession #: NP_000484,2 

1 11 21 31 41 51 

III 

MLPQIPFLL.L, VSLNLVHGVF YAERYQMPTG IKGPLPNTKT QFFIPYTIKS KG I AVRGEQG 60 

TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 

GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 

VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGIKGDRGFP 240 

GEMGPIGPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 

PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 

KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKG DPGVGGPPGL 420 

PGPVGPAGAK GMPGHNGEAG PRGAPG I PGT RGPIGPPGIP GFPGSKGDPG SPGPPGPAGI 480 

ATKGLNGPTG PPGPPGPRGH SGEPGLPGPP GPPGPPGQAV MPEGFIKAGQ RPSLSGTPLV 540 

SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQIPGIYYFS 600 

YHVHVKGTHV WVGLYKNGTP VMYTYDEYTK GYLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 436 DNA sequence 

Nucleic Acid Accession #: XM_062811 

Coding sequence: 1..888 

1 11 21 31 41 51 

] | I I I I 

ATGTGGGGCG CTCGCCGCTC GTCCGTCTCC TCATCCTGGA ACGCCGCTTC GCTCCTGCAG 60 

CTGCTGCTGG CTGCGCTGCT GGCGGCGGGG GCGAGGGCCA GCGGCGAGTA CTGCCACGGC 120 

TGGCTGGACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCGACGGC 180 

GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 240 

GCGCGCCTGG ACCAGGGCGG CTGCGACAAT GACCGCCAGC AGGGCGCTGG CGAGCCTGGC 300 

CGGGCGGACA AAGACGGCCC CGACGGCTCG GCAGTGCCCA TCTACGTGCC GTTCCTCATT 360 

GTTGGCTCCG TGTTTGTCGC CTTTATCATC TTGGGGTCCC TGGTGGCAGC CTGTTGCTGC 42 0 

AGATGTCTCC GGCCTAAGCA GGATCCCCAG CAGAGCCGAG CCCCAGGGGG TAACCGCTTG 480 

ATGGAGACCA TCCCCATGAT CCCCAGTGCC AGCACCTCCC GGGGGTCGTC CTCACGCCAG 540 

TCCAGCACAG CTGCCAGTTC CAGCTCCAGC GCCAACTCAG GGGCCCGGGC GCCCCCAACA 60 0 

AGGTCACAGA CCAACTGTTG CTTGCCGGAA GGGACCATGA ACAACGTGTA TGTCAACATG 660 

CCCACGAATT TCTCTGTGCT GAACTGTCAG CAGGCCACCC AGATTGTGCC ACATCAAGGG 720 

CAGTATCTGC ATCCCCCATA CGTGGGGTAC ACGGTGCAGC ACGACTCTGT GCCCATGACA 780 

GCTGTGCCAC CTTTCATGGA CGGCCTGCAG CCTGGCTACA GGCAGATTCA GTCCCCCTTC 840 
CCTCACACCA ACAGTGAACA GAAGATGTAC CCAGCGGTGA CTGTATAA 

Seq ID NO: 437 Protein sequence 
Protein Accession #: XP_062811 

1 11 21 31 41 51 

| I I I I I 

MWGARRSSVS SSWNAASLLQ LLLAALLAAG ARASGEYCHG WLDAQGVWRI GFQCPERFDG 60 

GDATICCGSC ALRYCCSSAE ARLDQGGCDN DRQQGAGEPG RADKDGPDGS AVPIYVPFLI 120 

VGSVFVAFII LGSLVAACCC RCLRPKQDPQ QSRAPGGNRL METIPMIPSA STSRGSSSRQ 180 

SSTAASSSSS ANSGARAPPT RSQTNCCLPE GTMNNVYVNM PTNFSVLNCQ QATQIVPHQG 240 
QYDHPPYVGY TVQHDSVPMT AVPPFMDGLQ PGYRQIQSPF PHTNSEQKMY PAVTV 

Seq ID NO: 43 8 DNA sequence 

Nucleic Acid Accession ft: NM_004004.1 

Coding sequence: 1..681 

1 11 21 31 41 51 

I I I I I I 

ATGGATTGGG GCACGCTGCA GACGAT CCTG GGGGGT GTG A ACAAACACTC CACCAGCATT 60 

GGAAAGATCT GGCTCACCGT CCTCTTCATT TTTCGCATTA TGATCCTCGT TGTGGCTGCA 120 

AAGGAGGTGT GGGGAGATGA GCAGGCCGAC TTTGTCTGCA ACACCCTGCA GCCAGGCTGC 180 
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GGCCCTGCAG 
CCGGAGACAT 
CAT CGAGG AG 
AAGCAGCATC 
GTACGACGGC 
TGTGGACTGC 
AGTGTCTGGA 
TTGTTCTGGG 



WO 02/086443 

AAGAACGTGT GCTACGATCA CTACTTCCCC ATCTCCCACA TCCGGCTATG 
CTGATCTTCG TGTCCAGCCC AGCGCTCCTA GTGGCCATGC ACGTGGCCTA 
GAGAAGAAGA GGAAGTTCAT CAAGGGGGAG ATAAAGAGTG AATTTAAGGA 
ATCAAAACCC AGAAGGTCCG CATCGAAGGC TCCCTGTGGT GGACCTACAC 
TTCTTCCGGG TCATCTTCGA AGCCGCCTTC ATGTACGTCT TCTATGTCAT 
TTCTCCATGC AGCGGCTGGT GAAGTGCAAC GCCTGGCCTT GTCCCAACAC 
TTTGTGTCCC GGCCCACGGA GAAGACTGTC TTCACAGTGT TCATGATTGC 
ATTTGCATCC TGCTGAATGT CACTGAATTG TGTTATTTGC TAATTAGATA 
AAGTCAAAAA AGCCAGTTTA A 

Seq ID NO: 43 9 Protein sequence 
Protein Accession #: NP_0 03 995.1 

1 11 21 31 41 51 

I I I I I I 

MDWGTLQTIL GGVNKHSTSI GKIWLTVLFI FRIMILWAA KEVWGDEQAD FVCNTLQPGC 
KNVCYDHYFP ISHIRLWALQ LI FVS SPALL VAMHVAYRRH EKKRKFIKGE IKSEFKDIEE 
IKTQKVRIEG SLWWTYTSSI FFRVI FEAAF MYVFYVMYDG FSMQRLVKCN AWPCPNTVDC 
FVSRPTEKTV FTVFMIAVSG ICILLNVTEL CYLLIRYCSG KSKKPV 



PCT/US02/12476 



Seq ID NO: 440 DNA sequence 

Nucleic Acid Accession #: XM_061091.1 

Coding sequence: 1..2481 



ATGCCAAATA 
CACCGAGCGC 
CCCGGGTACC 
CAGATGCATG 
GTTGGGAAGC 
ATTTCAGCTG 
GATGGGTCTA 
GTCTGTGACG 
TCCACTCCTC 
AGAATCAAGA 
CTTCTGCACA 
GTCACTGATG 
GGTGTCACTG 
GCCAGCGAGC 
GGCCTCTTCA 
CCCGAGCTTG 
CAGCCCTGCC 
TGCCCGCTGG 
GTCGACCTCC 
GCCAAAGTCT 
GTGGGTGTGG 
GATGTGCCTG 
ACGGG CAGTG 
CAGGACCGGC 
GCGGGCCCAG 
GCCGTGCGGG 
GATCCTCAGG 
CGGCCAGGGT 
GTAGGGCCCG 
GAGGTGAACC 
GCCTTCGGGC 
CCCTACCTAG 
ATGACCGTCC 
GGGAGAGGCG 
GTCTTGGTCG 
CGGGATTCCC 
ATTGAGTGGC 
ATGAATGAGG 
TGGGAGGGCC 
GGATGGATTC 
ACCCCTCCCA 
GTCTGTGCCC 



11 
I 

CTTCAGGAAC 
TGGTCGCCGC 
CGCCAGTGCC 
GTGAGAAGGT 
CTCCTTTTGA 
CCAGCAAAAT 
ACAGCGT CGG 
GTCTGGACAT 
ATCTGGAATT 
GGATGGTTTT 
GAGGGTTGCC 
GGAAGTCCCA 
TGTTTGCTGT 
CTAGAGGGCA 
GCACCCTCAG 
TCTTCATGGA 
AGAATGGAGG 
CCTTTGGAGG 
TCTTCCTGCT 
TCGTGAAGCG 
CCACATACAG 
ACCTGGTCTG 
CCTTGCGGCA 
CACGTAGAGT 
CGCGTCACGC 
CAGAGCTGGA 
ATCTGTTCAA 
GCCGGACACA 
AGAATTTTGC 
CTGACGTGAC 
TGGACACCAA 
GTGGGGTGGG 
AG AGGGGTG C 
CAGAGGATGC 
TGGGCGTGGG 
TGATCCACGT 
TGTGTGGAGA 
GCAGCTGCGT 
CCCACTGCGA 
TTGAGACGCC 
GCAACTACAG 
CAGGTCCTTA 



21 
I 

AACCAGGATT 
TCTCCTTCCG 
GGCTGCCGAT 
AGATCTCTGG 
GGCAAACGAA 
GATGTGGTGC 
G AAAGGG AG C 
CAGCCCCGAG 
CCCCTTGGAT 
CAAAGGAGGG 
TGGAGGCAGA 
GGGGGATGTG 
GGGGGTCAGG 
GCACGTGCTG 
CAGCTCGGCC 
GCGGTTAATG 
CACATGTGTT 
GGAGGCTAAC 
GGACAGCTCT 
GTTTGTGCGG 
CAGGGAGCTG 
GAGCCT CGAT 
GGCGGCAGAG 
GGTGGTTTTG 
AAGGGCGCGA 
GGAGATCACA 
CCAAATCCCT 
AGCCCTGGAC 
TCAGATGCAG 
ACAGGTCGGC 
ACCCACCCGG 
CTCAGCCGGC 
CCGGCCTGGT 
AGCCGTTCCT 
GCCTGTCCTA 
GGCAGCTTAC 
AGCCAAGCAG 
CCTGCAGAAT 
GAACCGTGAG 
CCTGAGGCAC 
AGAAGGCCTG 
G 



31 

I 

GAAATTTGGC 
GTGAGTCCCA 
GACCGATTCA 
AGCCTTGGTG 
GTCCATGTAA 
TCGGCTGCAG 
TTTGAAAGGT 
AGGGTCAGAG 
TCATTTTCAA 
CGCACGGAGA 
AATGCTTCTG 
GCACTGCCAT 
TTTCCCAGGT 
TTGGCTGAGC 
ATCTGCTCCA 
GGCATCTCTC 
CCAGAAGGAC 
TGTGCCCTGA 
GCGGG CACCA 
GCCGTGCTGA 
CTGGTGGCGG 
GGCATTCCCT 
CGTGGCTTCG 
CTCACTGAGT 
GAGCTGCTCC 
GGCAGCCCAA 
GAGCTGCAGG 
CTCGTCTTCA 
AGCTTTGTGA 
CTGGTGGTGT 
GCTGCGATGC 
ACCGCCCTGC 
GTCCCCAAAG 
GCCCAGAAGC 
AGTGAGGGTC 
GCCGACCTGC 
CCAGTCAACC 
GGGAGCTACC 
TGGAGCTCTT 
ATGGCTCCCG 
GGCACTGAAA 



41 
I 

TTCTCCAAGA 
GCCCCGAGTT 
CGCTCCCGAT 
TTCTTTGCTA 
GCAAAGAAAC 
TGGACATCAT 
CCAAGCACTT 
TGGGAGCATT 
CCCAACAGGA 
CGGAACTTGC 
TGCCCCAGAT 
CCAAGCAGCT 
GGGAGGAGCT 
AGGTGGAGGA 
GCGCCACGCC 
TGATAGGCCC 
TGGACGGCTA 
AGCTGAGCCT 
CTCTGGACGG 
GCGAGGACTC 
TGCCTGTGGG 
TCCGTGGTGG 
GGAGCGCCAC 
CACACTCCGA 
TGCTGGGTGT 
AGCATGTGAT 
GGAAGCTGTG 
TGTTGGACAC 
GAAGCTGTGC 
ATGGCAGCCA 
TGCGGGCCAT 
TGCACATCTA 
CTGTGGTGGT 
TGAGGAACAA 
TGCGGAGGCT 
GGTACCACCA 
TCTGCAAACC 
GCTGCAAGTG 
GCTCTGTATG 
TGCAGGAGGG 
TGGTGCCTAC 



51 
I 

GCCGCCCGGG 
GGCTCTGGCG 
GATTGGAGGT 
TGAATTTTTA 
CAT CGGG AAG 
GTTTCTGTTA 
TGCCATCACA 
CCAGTTCAGT 
AGTGAAGGCA 
TCTGAAATAC 
CCTCATCATC 
GAAGGAAAGG 
GCATGCACTG 
TGCCACCAAC 
AGCTGGGAGC 
CTGTGACTCG 
CCAGTGCCTC 
GGAATGCAGG 
CTTCCTGCGG 
TCGGGCCCGA 
GGAGTACCAG 
CCCCACCCTG 
CAGGACAGGC 
GGATGAGGTT 
AGGCAGTGAG 
GGTCTACTCG 
CAGCCGGCAG 
CTCTGCCTCA 
CCTCCAGTTT 
GGTGCAGACT 
TAG CCAGGCC 
TGACAAAGTG 
GCTCACAGGC 
TGGCATCTCT 
TGCAGGTCCC 
GGACGTGCTC 
CAGCCCGTGC 
TCGGGATGGC 
TGTGAGCCAG 
CAGCAGCCGT 
CTTCTGGAAT 



Seq ID NO: 441 Protein sequence 
Protein Accession ft: XP_061091.1 



MPNTSGTTRI 
QMHGEKVDLW 
DGSNSVGKGS 
RIKRMVFKGG 
GVTVFAVGVR 
PELVFMERLM 
VDLLFLLDSS 
DVPDLVWSLD 
AGPARHARAR 
RPGCRTQALD 
AFGLDTKPTR 
GRGAEDAAVP 



11 

I 

EIWLLQEPPG 
SLGVLCYEFL 
FERSKHFAIT 
RTETELALKY 
FPRWEELHAL 
GISLIGPCDS 
AGTTLDGFLR 
GIPFRGGPTL 
ELLLLGVGSE 
LVFMLDTSAS 
AAMLRAISQA 
AQKLRNNG I S 



21 
I 

HRALVAALLP 
VGKPPFEANE 
VCDGLDISPE 
LLHRGLPGGR 
ASEPRGQHVL 
QPCQNGGTCV 
AKVFVKRFVR 
TGSALRQAAE 
AVRAELEEIT 
VGPENFAQMQ 
PYLGGVGSAG 
VLWGVGPVL 



31 
I 

VSPSPELALA 
VHVSKETIGK 
RVRVGAFQFS 
NASVPQILII 
LAEQVEDATN 
PEGLDGYQCL 
AVLSEDSRAR 
RGFGSATRTG 
GSPKHVMVYS 
SFVRSCALQF 
TALLHIYDKV 
SEGLRRLAGP 



41 

I 

PGYPPVPAAD 
ISAASKMMWC 
STPHLEFPLD 
VTDGKSQGDV 
GLFSTLSSSA 
CPLAFGGEAN 
VGVATYSREL 
QDRPRRVWL 
DPQDLFNQIP 
EVNPDVTQVG 
MTVQRGARPG 
RDSLIHVAAY 



240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



51 
i 

DRFTLPMIGG 
SAAVDIMFLL 
SFSTQQEVKA 
ALPSKQLKER 
ICSSATPAGS 
CALKLSLECR 
LVAVPVGEYQ 
LTESHSEDEV 
ELQGKLCSRQ 
LWYGSQVQT 
VPKAWVLTG 
ADLRYHQDVL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



350 



10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

IEWLCGEAKQ PVNLCKPSPC MNEGSCVLQN GSYRCKCRDG WEGPHCENRE WSSCSVCVSQ 
GWILETPLRH MAPVQEGSSR TPPSNYREGL GTEMVPTFWN VCAPGP 

Seq ID NO: 442 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . . 2424 



PCT/US02/12476 



ATGCCCCCTT 
TCTCTCCCTC 
AGCAAAATGA 
AGCGTCGGGA 
CTGGACATCA 
CTGGAATTCC 
ATGGTTTTCA 
GGGTTGCCTG 
AAGT CCCAGG 
TTTGCTGTGG 
AGAGGGCAGC 
ACCCTCAGCA 
CCCTGTGAGC 
AGAGGATCGC 
AGAGTGTTCC 
TCGCAGCCCT 
CTCTGCCCGC 
AGGGTCGACC 
CGGGCCAAAG 
CGAGTGGGTG 
CAGGATGTGC 
CTGACGGGCA 
GGCCAGGACC 
GTTGCGGGCC 
GAGGCCGTGC 
TCGGATCCTC 
CAGCGGCCAG 
TCAGTAGGGC 
TTTGAGGTGA 
ACTGCCTTCG 
GCCCCCTACC 
GTGATGACCG 
GGCGGGAGAG 
TCTGTCTTGG 
CCCCGGGATT 
CTCATTGAGT 
TGCATGAATG 
GGCTGGGAGG 
CAGGGATGGA 
CGTACCCCTC 
AATGTCTGTG 



11 
I 

TCCTGTTGCT 
TCCAGGAAGT 
TGTGGTGCTC 
AAGGGAGCTT 
GCCCCGAGAG 
CCTTGGATTC 
AAGGAGGGCG 
GAGGCAGAAA 
GGGATGTGGC 
GGGTCAGGTT 
ACGTGCTGTT 
GCTCGGCCAT 
ACAGGACGCT 
GGCGGACCCT 
TAACCCACCC 
GCCAGAATGG 
TGGCCTTTGG 
TCCTCTTCCT 
TCTTCGTGAA 
TGGCCACATA 
CTGACCTGGT 
GTGCCTTGCG 
GGCCACGTAG 
CAGCGCGTCA 
GGGCAGAGCT 
AGGATCTGTT 
GGTGCCGGAC 
CCGAGAATTT 
ACCCTGACGT 
GGCTGGACAC 
TAGGTGGGGT 
TCCAGAGGGG 
GCGCAGAGGA 
TCGTGGGCGT 
CCCTGATCCA 
GGCTGTGTGG 
AGGGCAGCTG 
GCCCCCACTG 
TTCTTGAGAC 
CCAGCAACTA 
CCCCAGGTCC 



21 
I 

GGAGGCCGTC 
CCATGTAAGC 
GGCTGCAGTG 
TGAAAGGTCC 
GGTCAGAGTG 
ATTTTCAACC 
CACGGAGACG 
TGCTTCTGTG 
ACTGCCATCC 
TCCCAGGTGG 
GGCTGAGCAG 
CTGCTCCAGC 
GGAGATGGTC 
TGCGGTGCTG 
TGCCACCTGC 
AGGCACATGT 
AGGGGAGGCT 
GCTGGACAGC 
GCGGTTTGTG 
CAGCAGGGAG 
CTGGAGCCTC 
GCAGGCGGCA 
AGTGGTGGTT 
CGCAAGGGCG 
GGAGGAGATC 
CAACCAAATC 
ACAAGCCCTG 
TGCTCAGATG 
GACACAGGTC 
CAAACCCACC 
GGGCTCAGCC 
TGCCCGGCCT 
TGCAGCCGTT 
GGGGCCTGTC 
CGTGGCAGCT 
AGAAGCCAAG 
CGTCCTGCAG 
CGAGAACCGT 
GCCCCTGAGG 
CAGAGAAGGC 
TTAG 



31 
i 

TGTGTTTTCC 
AAAGAAACCA 
GACATCATGT 
AAGCACTTTG 
GGAGCATTCC 
CAACAGGAAG 
GAACTTGCTC 
CCCCAGATCC 
AAG CAGCTGA 
GAGGAGCTGC 
GTGGAGGATG 
GCCACGCCAG 
CGGGAGTTCG 
GCTGCACACT 
TACAGGACCA 
GTT CCAGAAG 
AACTGTGCCC 
TCTGCGGGCA 
CGGGCCGTGC 
CTGCTGGTGG 
GATGGCATTC 
GAGCGTGGCT 
TTGCTCACTG 
CGAGAGCTGC 
ACAGGCAGCC 
CCTGAGCTGC 
GACCTCGTCT 
CAGAGCTTTG 
GGCCTGGTGG 
CGGGCTGCGA 
GGCACCGCCC 
GGTGTCCCCA 
CCTGCCCAGA 
CTAAGTGAGG 
TACGCCGACC 
CAGCCAGTCA 
AATGGGAGCT 
GAGTGGAGCT 
CACATGGCTC 
CTGGGCACTG 



41 
I 

TGTTTTCCAG 
TCGGGAAGAT 
TTCTGTTAGA 
CCAT CACAGT 
AGTTCAGTTC 
TGAAGGCAAG 
TGAAATACCT 
TCATCATCGT 
AGGAAAGGGG 
ATGCACTGGC 
CCACCAACGG 
ACTGCAGGGT 
CTGGCAATGC 
GTCCCTTCTA 
CCTGCCCAGG 
GACTGGACGG 
TGAAGCTGAG 
CCACTCTGGA 
TGAGCGAGGA 
CGGTGCCTGT 
CCTTCCGTGG 
TCGGGAGCGC 
AGTCACACTC 
TCCTGCTGGG 
CAAAGCATGT 
AGGGGAAGCT 
TCATGTTGGA 
TGAGAAGCTG 
TGTATGGCAG 
TGCTGCGGGC 
TGCTGCACAT 
AAGCTGTGGT 
AGCTGAGGAA 
GTCTGCGGAG 
TGCGGTACCA 
ACCTCTGCAA 
ACCGCTGCAA 
CTTGCTCTGT 
CCGTGCAGGA 
AAATGGTGCC 



Seq ID NO: 443 Protein sequence 
Protein Accession #: Eos sequence 



MPPFLLLEAV 
SVGKGSFERS 
MVFKGGRTET 
FAVGVRFPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA 
SVLWGVGPV 
CMNEGSCVLQ 
RTPPSNYREG 



11 

I 

CVFLFSRVPP 
KHFAITVCDG 
ELALKYLLHR 
EELHALASEP 
REFAGNAPCW 
VPEGLDGYQC 
RAVLSEDSRA 
ERGFGSATRT 
TGSPKHVMVY 
QSFVRSCALQ 
GTALLHIYDK 
LSEGLRRLAG 
NGSYRCKCRD 
LGTEMVPTFW 



21 

1 

SLPLQEVHVS 
LDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRVW 
SDPQDLFNQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 
GWEGPHCENR 
NVCAPGP 



31 
I 

KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTESHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAWVLT 
YADLRYHQDV 
EWSSCSVCVS 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSQGDVALPS 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 
QGWILETPLR 



Seq ID NO: 444 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 8 9.. 2 356 



GCCCCCTGGC 
GTCGCCGCTC 
TGTTTTCCTG 
AGAAACCATC 
CAT CATGTTT 
GCACTTTGCC 
AGCATTCCAG 
ACAGGAAGTG 
ACTTGCTCTG 
CCAGATCCTC 



11 

I 

CCGAGCCGCG 
TCCTTCCGTT 
TTTTCCAGAG 
GGGAAGATTT 
CTGTTAGATG 
ATCACAGTCT 
TTCAGTTCCA 
AAGGCAAGAA 
AAATACCTTC 
ATCATCGTCA 



21 

I 

CCCGGGTCTG 
ATATCAACAT 
TGCCCCCATC 
CAGCTGCCAG 
GGT CTAACAG 
GTGACGGTCT 
CTCCTCATCT 
TCAAGAGGAT 
TGCACAGAGG 
CTGATGGGAA 



31 
I 

TGAGTAGAGC 
GCCCCCTTTC 
TCTCCCTCTC 
CAAAATGATG 
CGTCGGGAAA 
GGACATCAGC 
GGAATTCCCC 
GGTTTTCAAA 
GTTGCCTGGA 
GTCCCAGGGG 



41 
I 

CGCCCGGGCA 
CTGTTGCTGG 
CAGGAAGTCC 
TGGTGCTCGG 
GGGAGCTTTG 
CCCGAGAGGG 
TTGGATT CAT 
GGAGGGCGCA 
GGCAGAAATG 
GATGTGGCAC 



51 
I 

AGTGCCCCCA 
TTCAGCTGCC 
TGGGTCTAAC 
CTGTGACGGT 
CACTCCTCAT 
AATCAAGAGG 
TCTGCACAGA 
CACTGATGGG 
TGTCACTGTG 
CAGCGAGCCT 
CCTCTTCAGC 
CGAGGCTCAC 
CCCATGCTGG 
CAGCTGGAAG 
CCCCTGTGAC 
CTACCAGTGC 
CCTGGAATGC 
CGGCTTCCTG 
CTCTCGGGCC 
GGGGGAGTAC 
TGGCCCCACC 
CACCAGGACA 
CGAGGATGAG 
TGTAGGCAGT 
GATGGT CT AC 
GTGCAGCCGG 
CACCTCTGCC 
TGCCCTCCAG 
CCAGGTGCAG 
CATTAGCCAG 
CTATGACAAA 
GGTGCTCACA 
CAATGGC AT C 
GCTTGCAGGT 
CCAGGACGTG 
ACCCAGCCCG 
GTGTCGGGAT 
ATGTGTG AG C 
GGGCAGCAGC 
TACCTTCTGG 



51 
I 

DIMFLLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPD CRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLLLGVGS 
DLVFMLDTSA 
RAAMLRAISQ 
PAQKLRNNGI 
QPVNLCKPSP 
HMAPVQEGSS 



51 
I 

CCGAGCGCTG 
AAGCCGTCTG 
ATGTAAGCAA 
CTGCAGTGGA 
AAAGGTCCAA 
TCAGAGTGGG 
TTTCAACCCA 
CGGAGACGGA 
CTTCTGTGCC 
TGCCATCCAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



351 



WO 02/086443 

GCAGCTGAAG GAAAGGGGTG TCACTGTGTT TGCTGTGGGG GTCAGGTTTC CCAGGTGGGA 660 

GGAGCTGCAT GCACTGGCCA GCGAGCCTAG AGGGCAGCAC GTGCTGTTGG CTGAGCAGGT 720 

GGAGGATGCC ACCAACGGCC TCTTCAGCAC CCTCAGCAGC TCGGCCATCT GCTCCAGCGC 780 

CACGCCAGAC TGCAGGGTCG AGGCTCACCC CTGTGAGCAC AGGACGCTGG AGATGGTCCG 840 

5 GGAGTTCGCT GGCAATGCCC CATGCTGGAG AGGATCGCGG CGGACCCTTG CGGTGCTGGC 900 

TGCACACTGT CCCTTCTACA GCTGGAAGAG AGTGTTCCTA ACCCACCCTG CCACCTGCTA 960 

CAGGACCACC TGCCCAGGCC CCTGTGACTC GCAGCCCTGC CAGAATGGAG GCACATGTGT 102 0 

TCCAGAAGGA CTGGACGGCT ACCAGTGCCT CTGCCCGCTG GCCTTTGGAG GGGAGGCTAA 1080 

CTGTGCCCTG AAGCTGAGCC TGGAATGCAG GGTCGACCTC CTCTTCCTGC TGGACAGCTC 1140 

10 TGCGGGCACC ACTCTGGACG GCTTCCTGCG GGCCAAAGTC TTCGTGAAGC GGTTTGTGCG 1200 

GGCCGTGCTG AGCGAGGACT CTCGGGCCCG AGTGGGTGTG GCCACATACA GCAGGGAGCT 1260 

GCTGGTGGCG GTGCCTGTGG GGGAGTACCA GGATGTGCCT GACCTGGTCT GGAGCCTCGA 132 0 

TGGCATTCCC TTCCGTGGTG GCCCCACCCT GACGGGCAGT GCCTTGCGGC AGGCGGCAGA 13 80 

GCGTGGCTTC GGGAG CGCCA CCAGGACAGG CCAGGACCGG CCACGTAGAG TGGTGGTTTT 1440 

15 GCTCACTGAG TCACACTCCG AGGATGAGGT TGCGGGCCCA GCGCGTCACG CAAGGGCGCG 1500 

AGAGCTGCTC CTGCTGGGTG TAGGCAGTGA GGCCGTGCGG GCAGAGCTGG AGGAGATCAC 1560 

AGGCAGCCCA AAGCATGTGA TGGTCTACTC GGATCCTCAG GATCTGTTCA ACCAAATCCC 1620 

TGAGCTGCAG GGGAAGCTGT GCAGCCGGCA GCGGCCAGGG TGCCGGACAC AAGCCCTGGA 1680 

CCTCGTCTTC ATGTTGGACA CCTCTGCCTC AGTAGGGCCC GAGAATTTTG CTCAGATGCA 1740 

20 GAGCTTTGTG AGAAGCTGTG CCCTCCAGTT TGAGGTGAAC CCTGACGTGA CACAGGTCGG 1800 

CCTGGTGGTG TATGGCAGCC AGGTGCAGAC TGCCTTCGGG CTGGACACCA AACCCACCCG 1860 

GGCTGCGATG CTGCGGGCCA TTAGCCAGGC CCCCTACCTA GGTGGGGTGG GCTCAGCCGG 192 0 

CACCGCCCTG CTGCACATCT ATGACAAAGT GATGACCGTC CAGAGGGGTG CCCGGCCTGG 1980 

TGTCCCCAAA GCTGTGGTGG TGCTCACAGG CGGGAGAGGC GCAGAGGATG CAGCCGTTCC 2040 

25 TGCCCAGAAG CTGAGGAACA ATGGCATCTC TGTCTTGGTC GTGGGCGTGG GGCCTGTCCT 2100 

AAGTGAGGGT CTGCGGAGGC TTGCAGGTCC CCGGGATTCC CTGATCCACG TGGCAGCTTA 2160 

CGCCGACCTG CGGTACCACC AGGACGTGCT CATTGAGTGG CTGTGTGGAG AAGCCAAGCA 2220 

GCCAGTCAAC CTCTGCAAAC CCAGCCCGTG CATGAATGAG GGCAGCTGCG TCCTGCAGAA 22 8 0 

TGGGAGCTAC CGCTGCAAGT GTCGGGATGG CTGGGAGGGC CCCCACTGCG AGAACCGATT 2340 

30 CTTGAGACGC CCCTGAGGCA CATGGCTCCC GTGCAGGAGG GCAGCAGCCG TACCCCTCCC 240 0 

AGCAACTACA GAGAAGGCCT GGGCACTGAA ATGGTGCCTA CCTTCTGGAA TGTCTGTGCC 2460 

CCAGGTCCTT AGAATGTCTG CTTCCCGCCG TGGCCAGGAC CACTATTCTC ACTGAGGGAG 252 0 

GAGGATGTCC CAACTGCAGC CATGCTGCTT AGAGACAAGA AAGCAGCTGA TGTCACCCAC 2 580 

AAACGATGTT GTTGAAAAGT TTTGATGTGT AAGTAAATAC CCACTTTCTG TACCTGCTGT 2640 

35 GCCTTGTTGA GGCTATGTCA TCTGCCACCT TTCCCTTGAG GATAAACAAG GGGTCCTGAA 2700 

GACTTAAATT TAGCGGCCTG ACGTTCCTTT GCACACAATC AATGCTCGCC AGAATGTTGT 2760 
TGACACAGTA ATGCCCAGCA GAGGCCTTTA CTAGAGCATC CTTTGGACGG 



PCT/US02/12476 



40 



Seq ID NO: 445 Protein sequence 
Protein Accession #: Eos sequence 



45 



50 



55 



MPPFLLLEAV 
SVGKGSFERS 
MVFKGGRTET 
FAVGVRFPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA 
SVLWGVGPV 
CMNEGSCVLQ 



11 
I 

CVFLFSRVPP 
KHFAITVCDG 
ELALKYLLHR 
EELHALASEP 
REFAGNAPCW 
VPEGLDGYQC 
RAVLSEDSRA 
ERGFGSATRT 
TGSPKHVMVY 
QSFVRSCALQ 
GTALLH I YDK 
LSEGLRRLAG 
NGSYRCKCRD 



21 
I 

SLPLQEVHVS 
LDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRVW 
SDPQDLFNQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 
GWEGPHCENR 



31 

I 

KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTESHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAVWLT 
YADLRYHQDV 
FLRRP 



41 
I 

SKMMWCSAAV 
LEFPLDSFST 
KSQGDVALPS 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 



51 
I 

DIMFLLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLLLGVGS 
DLVFMLDTSA 
RAAMLRAISQ 
P AQKLRNNG I 
QPVNLCKPSP 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 446 DNA sequence 
Nucleic Acid Accession #: NM_031942. 
OO Coding sequence: 145.. 1260 



65 



70 



75 



80 



85 



CCCGAGCCCC 
TGCTCCTCCT 
CCGATCTGGG 
GTAAAGAAGA 
TCCTCTGATG 
TCAGTTCGGG 
GCGATGAAGT 
CAGCCCTCAG 
AATTTTTTGG 
ATGTCTGAAT 
GACTCACAAT 
CCTGAACGGA 
GCTCTACCCA 
ACCGTGGATG 
GTGACCCTTC 
GTCTGCAGCA 
TGCCGTCAGA 
CGAGGCCAGT 
CTGCTGGATC 
CGGCAGCGAG 
TTTGGGAATG 
TATCTGGAAA 
TTTTTTCACT 
AATCAAGTTA 



11 

i 

GCCCCTCCGG 
GCTGTGGGAC 
CACCCGCCAC 
ACTTAAAGAA 
ACAGTTGTGA 
AAGGCTGTAG 
TTCCAGCGCG 
AGAATTCTGT 
AGAAAAGGG C 
TAGAAAGCTT 
CAAGGAGACC 
GAGCTCGTCC 
TGGAGGAGGA 
GCTACATGAA 
CGCATATAAT 
ATTCTCGAGA 
AGACTATTGA 
TCTGTGGCCC 
CGAACTGGCA 
ATGGACGGTG 
TGCATGCCTA 
ATTTGCTGCC 
GAAACCTGAG 
ATCTTAGCAG 



21 

t 

GCCCGGGTCG 
CGCTGACCGC 
CAGCATGGAC 
ATTCAGATAT 
CAGCTTTGCT 
GACCCGCAGC 
GAGTACCAGG 
GACTGATTCC 
TTTAAATATA 
CCCTGGCTCG 
GCGAAGGCGT 
TCTTACCAGG 
GGAGGAAGAG 
TGAAGATGAC 
TCGCCCAGTG 
GAAGATATAT 
TACCAAAACA 
CTGCCTTCGA 
TTGCCCGCCT 
TGCGACTGGG 
CTTGAAAAGC 
TGCCTTCTAC 
TTAAAAATCT 
ACATGTGTTT 



31 
I 

GCGCGCCCAG 
GCGGCTGCTC 
GCTCGCCGCG 
GTGAAGTTGA 
TCTGATAATT 
CAGTGCAGGC 
GGAGCAACCA 
AACTCCGATT 
AAGCAAAACA 
TTCCGTGGAA 
ACATTCCCGG 
TCAAGGTCCC 
GATAAGTACA 
CTGCCCAGAA 
GAAGAAATTA 
AACCGTTCAC 
AACTGCAGAA 
AACCGTTATG 
TGTCGAGGAA 
GTCCTTGTGT 
CTGAAACAGG 
TTCTCAAATC 
TGATGAT CAG 
CTGGAGCATC 



41 

I 

CCTGCCAGCC 
CGCTCTCCCC 
TGCCGCAGAA 
TTTCCATGGA 
TTGCAAACAC 
ACTCTGGACC 
ACAAAAAAGC 
CAGAAGATGA 
AAGCAATGCT 
GACATCCCCT 
GTGTTGCTTC 
GGATCCTCGG 
TGTTGGTGAG 
GCCGTCGCTC 
CAGAGGAGGA 
TGGGCTCTAC 
ACCCAGACTG 
GTGAAGAGGT 
TCTGCAACTG 
ATTTAGCCAA 
AATTTGAAAT 
TTTCTTGTAA 
CCTGTTTCAT 
ACAGAAGGTA 



51 

GCGCTGCTGC 
GCTCCAAGCG 
AGATCTCAGA 
AACCTCGTCA 
GAGGCTGCAG 
TCTCAGGGTG 
AGAGTCCCGC 
AAGTGGAATG 
TGCAAAACTC 
CCCAGGCTCC 
CAGGAGAAAC 
GTCCCTTGAC 
AAAGAGGAAG 
CAGATCATCC 
GTTGGAGAAC 
TTGTCATCAA 
CTGGGGCGTT 
CAGGGATGCT 
CAGTTTCTGC 
ATATCATGGC 
GCAAGCATAA 
AAGTTTCCAA 
AAGAAACTCC 
TATTGCTAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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TACACTTTGC CCTCCTGCAG TTTCTTCTCT GCTCCCAACC CCCATCTCAT AGCATCCCCC 1500 

TCTATTTCCA ATGCTCCTCT CCAACCGCTT AGTTTCTGAA TTTCTTTTAA ATTACAGTTT 1560 

TATGAAAGCA TATTTTATTT ACTTGGTGTT GAAATAGCCC TCATAAAACC TAAGCACTTG 1620 

GAAACACAAT AATAGTATTA ACTAACTAGA TCTATTGAAT TTCAGAGAAG AGCCTTCTAA 1680 

CTTGTTTACA CAAAAACGAG TATGATTTAG CACTCATACT AGTTGAAATT TTTAATAGAA 1740 

TCAAGGCACA AAAGTCTTAA AACCATGTGG AAAAATTAGG TAATTATTGC AG AT TGATGT 1800 

CTCTCAATCC CATGTATTGC GCTTATGTTA CAAGTTGTTG TCACAGTTGA GACTTAATTT 18 60 

CTCCTAATTT CTTCTGCCCG AAG GGTAAGT GGTGCGTCCA GCTTACACGA TCATAATTCA 1920 

AAGGTTGGTG GGCAATGTAA TACTTAATTA AAATAATGAT GGAAGAGCTA TCTGGAGATT 1980 

ATGAGTAAGC TGATTTGAAT TTTCAGTATA AAACTTTAGT ATAATTGTAG TTTGCAAAGT 2 040 

TTATTTCAGT TCACATGTAA GGTATTGCAA ATAAATTCTT GGACAATTTT GTATGGAAAC 210 0 

TTGATATTAA AAACTAGTCT GTGGTTCTTT GCAGTTTCTT GTAAATTTAT AAACCAGGCA 2160 

CAAGGTT CAA GTTTAGATTT TAAGCACTTT TATAACAATG ATAAGTGCCT TTTTGGAGAT 222 0 

GTAACTTTTA GCAGTTTGTT AACCTGACAT CTCTGCCAGT CTAGTTTCTG GGCAGGTTTC 2280 

CTGTGTCAGT ATTCCCCCTC CTCTTTGCAT TAATCAAGGT ATTTGGTAGA GGTGGAATCT 2340 

AAGTGTTTGT ATGTCCAATT TACTTGCATA TGTAAACCAT TGCTGTGCCA TTCAATGTTT 2400 

GATGCATAAT TGGACCTTGA ATCGATAAGT GTAAATACAG CTTTTGATCT GTAATGCTTT 2460 
TATACAAAAG TTTATTTTAA TAATAAAATG TTTGTTCTAA AAAAAAAAAA 



Seq ID NO: 447 Protein sequence 
Protein Accession #: NP_114148.1 

1 11 21 31 41 51 

1111)1 

MDARRVPQKD IjRVKKNLKKF RYVKLISMET SSSSDDSCDS FASDNFANTR LQSVREGCRT 60 
RSQCRHSGPL RVAMKFPARS TRGATNKKAE SRQPSENSVT DSNSDSEDES GMNFLEKRAL 12 0 
NIKQNKAMLA KLMSELESFP GSFRGRHPLP GSDSQSRRPR RRTFPGVASR RNPERRARPL 180 
TRSRSRILGS LDALPMEEEE EEDKYMLVRK RKTVDGYMNE DDLPRSRRSR SSVTLPHIIR 240 
PVEEITEEEL ENVCSNSREK IYNRSLGSTC HQCRQKTIDT KTNCRNPDCW GVRGQFCGPC 3 00 
LRNRYGEEVR DALLDPNWHC PPCRGICNCS FCRQRDGRCA TGVLVYLAKY HGFGNVHAYL 360 
KSLKQEFEMQ A 



60 



Seq ID NO: 448 DNA sequence 
Nucleic Acid Accession #: NM_019894 
Coding sequence: 1..1314 

1 11 21 31 41 51 

I 1 1 I I I 

ATGTTACAGG ATCCTGACAG TGATCAACCT CTGAACAGCC TCGATGT CAA ACCCCTGCGC 

AAACCCCGTA TCCCCATGGA GACCTTCAGA AAGGTGGGGA TCCCCATCAT CATAGCACTA 12 0 

CTGAGCCTGG CGAGT AT CAT CATTGTGGTT GTCCTCATCA AGGTGATTCT GGATAAATAC 180 

TACTTCCTCT GCGGGCAGCC TCTCCACTTC AT CCCGAGGA AGCAGCTGTG TGACGGAGAG 240 

CTGGACTGTC CCTTGGGGGA GGACGAGGAG CACTGTGTCA AGAGCTTCCC CGAAGGGCCT 300 

GCAGTGGCAG TCCGCCTCTC CAAGGACCGA TCCACACTGC AGGTGCTGGA CTCGGCCACA 360 

GGGAACTGGT TCTCTGCCTG TTTCGACAAC TTCACAGAAG CTCTCGCTGA GACAGCCTGT 420 

AGGCAGATGG GCTACAGCAG CAAACCCACT TTCAGAGCTG TGGAGATTGG CCCAGACCAG 480 

GATCTGGATG TTGTTGAAAT CACAGAAAAC AGCCAGGAGC TTCGCATGCG GAACTCAAGT 540 

GGGCCCTGTC TCTCAGGCTC CCTGGTCTCC CTGCACTGTC TTGCCTGTGG GAAGAGCCTG 600 

AAGACCCCCC GTGTGGTGGG TGGGGAGGAG GCCTCTGTGG ATTCTTGGCC TTGGCAGGTC 660 

AGCATCCAGT ACGACAAACA GCACGTCTGT GGAGGGAGCA TCCTGGACCC CCACTGGGTC 720 

CTCACGGCAG CCCACTGCTT CAGGAAACAT ACCGATGTGT TCAACTGGAA GGTGCGGGCA 780 

GGCTCAGACA AACTGGGCAG CTTCCCATCC CTGGCTGTGG CCAAGATCAT CAT CAT T GAA 840 

TTCAACCCCA TGTACCCCAA AGACAATGAC ATCGCCCTCA TGAAGCTGCA GTTCCCACTC 900 

ACTTTCTCAG GCACAGT CAG GCCCATCTGT CTGCCCTTCT TTGATGAGGA GCTCACTCCA 960 

GCCACCCCAC TCTGG AT CAT TGGATGGGGC TTTACGAAGC AGAATGGAGG GAAGATGTCT 1020 

GACATACTGC TGCAGGCGTC AGT CCAGGTC ATTGACAGCA CACGGTGCAA TGCAGACGAT 1080 

GCGTACCAGG GGGAAGTCAC CGAGAAGATG ATGTGTGCAG GCATCCCGGA AGGGGGTGTG 1140 

GACACCTGCC AGGGTGACAG TGGTGGGCCC CTGATGTACC AATCTGACCA GTGGCATGTG 1200 

GTGGGCATCG TTAGCTGGGG CTATGGCTGC GGGGGCCCGA GCACCCCAGG AGTATACACC 12 60 
AAGGTCT CAG CCTATCTCAA CTGGATCTAC AATGTCTGGA AGGCTGAGCT GTAA 

Seq ID NO: 449 Protein sequence 
Protein Accession #: NP_063 947.1 

1 11 21 31 41 51 

I I 1 i I I 

MLQDPDSDQP LNSLDVKPLR KPRIPMETFR KVGIPIIIAL LSLASIIIW VLIKVILDKY 60 

YFLCGQPLHF IPRKQLCDGE LDCPLGEDEE HCVKSFPEGP AVAVRLSKDR STLQVLDSAT 120 

GNWFSACFDN FTEALAETAC RQMGYSSKPT FRAVEIGPDQ DLDWEITEN SQELRMRNSS 180 

GPCLSGSLVS LHCLACGKSL KTPRWGGEE ASVDSWPWQV SIQYDKQHVC GGSILDPHWV 240 

LTAAHCFRKH TDVFNWKVRA GSDKLGSFPS LAVAKIIIIE FNPMYPKDND IALMKLQFPL 300 

TFSGTVRPIC LPFFDEELTP ATPLWIIGWG FTKQNGGKMS DILLQASVQV IDSTRCNADD 360 

AYQGEVTEKM MCAGIPEGGV DTCQGDSGGP LMYQSDQWHV VGIVSWGYGC GGPSTPGVYT 420 



Seq ID NO: 450 DNA sequence 

Nucleic Acid Accession #: XM_051860.2 

Coding sequence: 52.. 3042 

1 11 21 31 41 51 

I I I I I I 

GCTCACCCAG GAAAAATATG CAATCGTCCC ATTGATATAC AGGCCACTAC AATGGATGGA 60 
GTTAACCTCA GCACCGAGGT TGTCTACAAA AAAGGCCAGG ATTATAGGTT TGCTTGCTAC 120 
GACCGGGGCA GAGCCTGCCG GAGCTACCGT GTACGGTTCC TCTGTGGGAA GCCTGTGAGG 180 
CCCAAACTCA CAGTCACCAT TGACACCAAT GTGAACAGCA CCATTCTGAA CTTGGAGGAT 240 
AATGTACAGT CATGGAAACC TGGAGATACC CTGGTCATTG CCAGTACTGA TTACTCCATG 300 
TACCAGGCAG AAGAGTTCCA GGTGCTTCCC TGCAGATCCT GCGCCCCCAA CCAGGTCAAA 360 
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GTGGCAGGGA AACCAATGTA CCTGCACATC GGGGAGGAGA TAGACGGCGT GGACATGCGG 420 

GCGGAGGTTG GGCTTCTGAG CCGGAACATC ATAGTGATGG GGGAGATGGA GGACAAATGC 480 

TACCCCTACA GAAACCACAT CTGCAATTTC TTTGACTTCG ATACCTTTGG GGGCCACATC 540 

AAGTTTGCTC TGGGATTTAA GGCAGCACAC TTGGAGGGCA CGGAGCTGAA GCATATGGGA 600 

CAGCAGCTGG TGGGT CAGT A CCCGATTCAC TTCCACCTGG CCGGTGATGT AGACGAAAGG 660 

GGAGGTTATG ACCCACCCAC ATACATCAGG GACCTCTCCA TCCATCATAC ATTCTCTCGC 720 

TGCGTCACAG TCCATGGCTC CAATGGCTTG TTGATCAAGG ACGTTGTGGG CTATAACTCT 780 

TTGGGCCACT GCTTCTTCAC GGAAGATGGG CCGGAGGAAC GCAACACTTT TGACCACTGT 840 

CTTGGCCTCC TTGTCAAGTC TGGAACCCTC CTCCCCTCGG ACCGTGACAG CAAGATGTGC 900 

AAGATGATCA CAGGAGACTC CTACCCAGGG TACATCCCCA AGCCCAGGCA AGACTGCAAT 960 

GCTGTGTCCA CCTTCTGGAT GGCCAATCCC AACAACAACC TCAT CAACTG TGCCGCTGCA 1020 

GGATCTGAGG AAACTGGATT TTGGTTTATT TTTCACCACG TACCAACGGG CCCCTCCGTG 1080 

GGAATGTACT CCCCAGGTTA TTCAGAGCAC ATTCCACTGG GAAAATTCTA TAACAACCGA 1140 

GCACATTCCA ACTACCGGGC TGGCATGATC ATAGACAACG GAGTCAAAAC CACCGAGGCC 1200 

TCTGCCAAGG ACAAGCGGCC GTTCCTCTCA ATCATCTCTG CCAGATACAG CCCTCACCAG 1260 

GACGCCGACC CGCTGAAGCC CCGGGAGCCG GCCATCATCA GACACTTCAT TGCCTACAAG 1320 

AACCAGGACC ACGGGGCCTG GCTGCGCGGC GGGGATGTGT GGCTGGACAG CTGCCGGTTT 13 80 

GCTGACAATG GCATTGGCCT GACCCTGGCC AGTGGTGGAA CCTTCCCGTA TGACGACGGC 1440 

TCCAAGCAAG AGATAAAGAA CAGCTTGTTT GTTGGCGAGA GTGGCAACGT GGGGACGGAA 1500 

ATGATGGACA ATAGGATCTG GGGCCCTGGC GGCTTGGACC ATAGCGGAAG GACCCTCCCT 1560 

ATAGGCCAGA ATTTTCCAAT TAGAGGAATT CAGTTATATG ATGGCCCCAT CAACATCCAA 162 0 

AACTGCACTT TCCGAAAGTT TGTGGCCCTG GAGGGCCGGC ACACCAGCGC CCTGGCCTTC 1680 

CGCCTGAATA ATGCCTGGCA GAGCTGCCCC CATAACAACG TGACCGG CAT TGCCTTTGAG 1740 

GACGTTCCGA TTACTTCCAG AGTGTTCTTC GGAGAGCCTG GGCCCTGGTT CAACCAGCTG 1800 

GACATGGATG GGGATAAGAC ATCTGTGTTC CATGACGTCG ACGGCTCCGT GTCCGAGTAC 1860 

CCTGGCTCCT ACCTCACGAA GAATGACAAC TGGCTGGTCC GGCACCCAGA CTGCAT CAAT 192 0 

GTTCCCGACT GGAGAGGGGC CATTTGCAGT GGGTGCTATG CACAGATGTA CATTCAAGCC 1980 

TACAAGACCA GTAACCTGCG AATGAAGATC ATCAAGAATG ACTTCCCCAG CCACCCTCTT 2040 

TACCTGGAGG GGGCGCTCAC CAGGAGCACC CATTACCAGC AATACCAACC GGTTGTCACC 210 0 

CTGCAGAAGG GCTACACCAT CCACTGGGAC CAGACGGCCC CCGCCGAACT CGCCATCTGG 2160 

CTCATCAACT TCAACAAGGG CGACTGGATC CGAGTGGGGC TCTGCTACCC GCGAGGCACC 2220 

ACATTCTCCA TCCTCTCGGA TGTTCACAAT CGCCTGCTGA AGCAAACGTC CAAGACGGGC 2280 

GTCTTCGTGA GGACCTTGCA GATGGACAAA GTGGAGCAGA GCTACCCTGG CAGGAGCCAC 2340 

TACTACTGGG ACGAGGACTC AGGGCTGTTG TTCCTGAAGC TGAAAGCTCA GAACGAGAGA 2400 

GAGAAGTTTG CTTTCTGCTC CATGAAAGGC TGTGAGAGGA TAAAGATTAA AGCTCTGATT 2460 

CCAAAGAACG CAGGCGT CAG TGACTGCACA GCCACAGCTT ACCCCAAGTT CACCGAGAGG 2520 

GCTGTCGTAG ACGTGCCGAT GCCCAAGAAG CTCTTTGGTT CTCAGCTGAA AACAAAGGAC 2580 

CATTTCTTGG AGGTGAAGAT GGAGAGTTCC AAGCAGCACT TCTTCCACCT CTGGAACGAC 2 640 

TTCGCTTACA TTGAAGTGGA TGGGAAGAAG TACCCCAGTT CGGAGGATGG CATCCAGGTG 270 0 

GTGGTGATTG ACGGGAACCA AGGGCGCGTG GTGAGCCACA CGAGCTT CAG GAACTCCATT 2760 

CTGCAAGGCA TACCATGGCA GCTTTTCAAC TATGTGGCGA CCATCCCTGA CAATTCCATA 2 82 0 

GTGCTTATGG CATCAAAGGG AAGATACGTC TCCAGAGGCC CATGGACCAG AGTGCTGGAA 2880 

AAGCTTGGGG CAGACAGGGG TCTCAAGTTG AAAGAGCAAA TGGCATTCGT TGGCTTCAAA 2 940 

GGCAGCTTCC GGCCCATCTG GGTGACACTG GACACTGAGG AT CACAAAGC CAAAATCTTC 3 000 

CAAGTTGTGC CCATCCCTGT GGTGAAGAAG AAGAAGTTGT GAGGACAGCT GCCGCCCGGT 3060 

GCCACCTCGT GGTAGACTAT GACGGTGACT CTTGGCAGCA GACCAGTGGG GGATGGCTGG 3120 

GTCCCCCAGC CCCTGCCAGC AGCTGCCTGG GAAGGCCGTG TTTCAGCCCT GATGGGCCAA 3180 

GGGAAGGCTA TCAGAGACCC TGGTGCTGCC ACCTGCCCCT ACT CAAGTGT CTACCTGGAG 3240 

CCCCTGGGGC GGTGCTGGCC AATGCTGGAA ACATTCACTT TCCTGCAGCC TCTTGGGTGC 3300 

TTCTCTCCTA TCTGTGCCTC TTCAGTGGGG GTTTGGGGAC CATATCAGGA GACCTGGGTT 33 60 

GTGCTGACAG CAAAGATCCA CTTTGGCAGG AGCCCTGACC CAGCTAGGAG GTAGT CTGGA 3 420 

GGGCTGGTCA TTCACAGATC CCCATGGTCT TCAGCAGACA AGTGAGGGTG GTAAATGTAG 3 480 

GAGAAAGAGC CTTGGCCTTA AGGAAATCTT TACTCCTGTA AGCAAGAGCC AACCTCACAG 3 540 

GATTAGGAGC TGGGGTAGAA CTGGCTATCC TTGGGGAAGA GGCAAGCCCT GCCTCTGGCC 3 600 

GTGTCCACCT TTCAGGAGAC TTTGAGTGGC AGGTTTGGAC TTGGACTAGA TGACTCTCAA 3 660 

AGGCCCTTTT AGTTCTGAGA TTCCAGAAAT CTGCTGCATT TCACATGGTA CCTGGAACCC 3 720 

AACAGTTCAT GGATATCCAC TGATATCCAT GATGCTGGGT GCCCCAGCGC ACACGGGATG 3780 

GAGAGGTGAG AACTAATGCC TAGCTTGAGG GGTCTGCAGT CCAGTAGGGC AGGCAGTCAG 3 840 

GTCCATGTGC ACTGCAATGC CAGGTGGAGA AAT CACAG AG AGGTAAAATG GAGGCCAGTG 3 900 

CCATTTCAGA GGGGAGGCTC AGGAAGGCTT CTTGCTTACA GGAATGAAGG CTGGGGGCAT 3 960 

TTTGCTGGGG GGAGATGAGG CAGCCTCTGG AATGGCTCAG GGATTCAGCC CTCCCTGCCG 4020 

CTGCCTGCTG AAGCTGGTGA CTACGGGGTC GCCCTTTGCT CACGTCTCTC TGGCCCACTC 4 0 80 

ATGATGGAGA AGTGTGGTCA GAGGGGAGCA ATGGGCTTTG CTGCTTATGA GCACAGAGGA 4140 

ATTCAGTCCC CAGGCAGCCC TGCCTCTGAC TCCAAGAGGG TGAAGTCCAC AGAAGTGAGC 4200 

TCCTGCCTTA GGGCCTCATT TGCTCTTCAT CCAGGGAACT GAGCACAGGG GGCCTCCAGG 42 60 

AGACCCTAGA TGTGCTCGTA CTCCCTCGGC CTGGGATTTC AGAGCTGGAA ATATAGAAAA 432 0 

TATCTAGCCC AAAGCCTTCA TTTTAACAGA TGGGGAAAGT GAGCCCCCAA GATGGGAAAG 43 80 

AACCACACAG CTAAGGGAGG GCCTGGGGAG CCCCACCCTA GCCCTTGCTG CCACACCACA 4440 

TTGCCTCAAC AACCGGCCCC AGAGTGCCCA GGCACTCCTG AGGTAGCTTC TGGAAATGGG 4500 

GACAAGTCCC CTCGAAGGAA AGGAAATGAC TAGAGTAGAA TGACAGCTAG CAG AT C T CT T 4560 

CCCTCCTGCT CCCAGCGCAC ACAAACCCGC CCTCCCCTTG GTGTTGGCGG TCCCTGTGGC 462 0 

CTTCACTTTG TTCACTACCT GTCAGCCCAG CCTGGGTGCA CAGTAGCTGC AACTCCCCAT 4680 

TGGTGCTACC TGGCTCTCCT GTCTCTGCAG CTCTACAGGT GAGGCCCAGC AGAGGGAGTA 4740 

GGGCTCGCCA TGTTTCTGGT GAGCCAATTT GGCTGATCTT GGGTGTCTGA ACAGCTATTG 4800 

GGTCCACCCC AGTCCCTTTC AGCTGCTGCT TAATGCCCTG CTCTCTCCCT GGCCCACCTT 48 60 

ATAGAGAGCC CAAAGAGCTC CTGTAAGAGG GAGAACTCTA TCTGTGGTTT ATAATCTTGC 4 920 

ACGAGGCACC AGAGTCTCCC TGGGTCTTGT GATGAACTAC ATTTATCCCC TTTCCTGCCC 4 980 

CAACCACAAA CTCTTTCCTT CAAAGAGGGC CTGCCTGGCT CCCTCCACCC AACTGCACCC 5 040 

ATGAGACTCG GTCCAAGAGT CCATTCCCCA GGTGGGAGCC AACTGTCAGG GAGGTCTTTC 5100 

CCACCAAACA TCTTTCAGCT GCTGGGAGGT GACCATAGGG CTCTGCTTTT AAAGATATGG 5160 

CTGCTTCAAA GGCCAGAGTC ACAGGAAGGA CTTCTTCCAG GGAGATTAGT GGTGATGGAG 522 0 

AGGAGAGTTA AAATGACCTC ATGTCCTTCT TGT CCACGGT TTTGTTGAGT TTTCACTCTT 52 80 

CTAATGCAAG GGTCTCACAC TGTGAACCAC TTAGGATGTG ATCACTTTCA GGTGGCCAGG 5340 

AATGTTGAAT GTCTTTGGCT CAGTTCATTT AAAAAAGATA TCTATTTGAA AGTTCTCAGA 5400 

GTTGTACATA TGTTTCACAG TACAGGATCT GTACATAAAA GTTTCTTTCC TAAACCATTC 5460 

ACCAAGAGCC AATATCTAGG CATTTTCTTG G TAG CAC AAA TTTTCTTATT GCTTAGAAAA 5520 

TTGTCCTCCT TGTTATTTCT GTTTGTAAGA CTTAAGTGAG TTAGGTCTTT AAGGAAAGCA 5580 
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ACGCTCCTCT GAAATGCTTG TCTTTTTTCT GTTGCCGAAA TAGCTGGTCC TTTTTCGGGA 5640 

GTTAGATGTA TAGAGTGTTT GTATGTAAAC ATTTCTTGTA GGCATCACCA TGAACAAAGA 57 00 

TATATTTTCT ATTTATTTAT TATATGTGCA CTT CAAGAAG TCACTGTCAG AGAAATAAAG 5760 

AATTGTCTTA AATGTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAA 

Seq ID NO: 451 Protein sequence 
Protein Accession #: XP_051860.2 

1 11 21 31 41 51 

I ] I I I 1 

MDGVNLSTEV VYKKGQDYRF ACYDRGRACR SYRVRFLCGK PVRPKI/IVTI DTNVNSTIIjN 60 

LEDNVQSWKP GDTLVIASTD YSMYQAEEFQ VLPCRSCAPN QVKVAGKPMY LHIGEEIDGV 120 

DMRAEVGLLS RNIIVMGEME DKCYPYRNHI CNFFDFDTFG GH I KFALGFK AAHLEGTELK 180 

HMGQQLVGQY PIHFHL.AGDV DERGGYDPPT YIRDLSIHHT FSRCVTVHGS NGLLIKDWG 240 

YNSLGHCFFT EDGPEERNTF DHCLGLLVKS GTLLPSDRDS KMCKMITGDS YPGYIPKPRQ 300 

DCNAVSTFWM ANPNNNLINC AAAGSEETGF WFIFHHVPTG PSVGMYSPGY SEHIPLGKFY 3 60 

NNRAHSNYRA GMI IDNGVKT TEASAKDKRP FLSIISARYS PHQDADPLKP REPAIIRHFI 420 

AYKNQDHGAW LRGGDVWLDS CRFADNGIGL TLASGGTFPY DDGSKQEIKN SLFVGESGNV 480 

GTEMMDNRIW GPGGL.DHSGR TLPIGQNFPI RGIQJbYDGPI NIQNCTFRKF V ALEGRHT S A 540 

LAFRLNNAWQ SCPHNNVTGI AFEDVPITSR VFFGEPGPWF NQLDMDGDKT SVFHDVDGSV 600 

SEYPGSYLTK NDNWLVRHPD CINVPDWRGA I CSGC YAQMY IQAYKTSNLR MKIIKNDFPS 660 

HPLYLEGALT RSTHYQQYQP WTLQKGYTI HWDQTAPAEL AIWLINFNKG DWIRVGLCYP 720 

RGTTFSILSD VHNRDLKQTS KTGVFVRTLQ MDKVEQSYPG RSHYYWDEDS GLLFLKLKAQ 780 

NEREKFAFCS MKGCERIKIK ALIPKNAGVS DCTATAYPKF TERAWDVPM PKKLFGSQLK 84 0 

TKDHFLEVKM ESSKQHFFHL WNDFAYIEVD GKKYPSSEDG IQVWIDGNQ GRWSHTSFR 900 

NSILQGIPWQ LFNYVAT I PD NSIVLMASKG RYVSRGPWTR VLEKLGADRG LKLKEQMAFV 960 
GFKGSFRPIW VTLDTEDHKA KIFQWPIPV VKKKKL 

Seq ID NO: 452 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 261.. 2861 

1 11 21 31 41 51 

I I 1 I I 1 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 240 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 480 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 540 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCAT CATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGG CGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 780 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 84 0 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGAT GACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 1080 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 1140 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 120 0 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 

TAT CTCAGAC TAAAGGTGGG GAG AAAAT TT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 13 80 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 1440 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 162 0 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 174 0 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC T AC AG AAAC C 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 1920 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCA CAT A CAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2 04 0 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 2100 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 2220 

ACTCCTACCC AGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 2280 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2340 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 2400 

GTTATTCAGA GCACATTCCA C TGGG AAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2460 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 

AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2 640 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCA TTTCAGAGGG GAGGCTCAGG 2700 

AAGGCTTCTT GCTTACAGGA ATGAAGGCTG GGGGCATTTT GCTGGGGGGA GATGAGGCAG 2760 

CCTCTGGAAT GGCTCAGGGA TTCAGCCCTC CCTGCCGCTG CCTGCTGAAG CTGGTGACTA 2 820 

CGGGGTCGCC CTTTGCTCAC GTCTCTCTGG CCCACTCATG ATGGAGAAGT GTGGTCAGAG 2880 

GGGAGCAATG GGCTTTGCTG CTTATGAGCA CAGAGGAATT CAGTCCCCAG GCAGCCCTGC 2940 

CTCTGACTCC AAGAGGGTGA AGTCCACAGA AGTGAGCTCC TGCCTTAGGG CCTCATTTGC 3000 

TCTTCATCCA GGGAACTGAG CACAGGGGGC CTCCAGGAGA CCCTAGATGT GCTCGTACTC 30 60 

CCTCGGCCTG GGATTTCAGA GCTGGAAATA TAGAAAATAT CTAGCCCAAA GCCTTCATTT 312 0 
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TAACAGATGG GGAAAGTGAG CCCCCAAGAT GGGAAAGAAC CACACAGCTA AGGGAGGGCC 318 0 

TGGGGAGCCC CACCCTAGCC CTTGCTGCCA CACCACATTG CCTCAACAAC CGGCCCCAGA 3240 

GTGCCCAGGC ACTCCTGAGG TAGCTTCTGG AAATGGGGAC AAGTCCCCTC GAAGGAAAGG 33 00 

AAATGACTAG AGTAGAATGA CAGCTAGCAG ATCTCTTCCC TCCTGCTCCC AGCGCACACA 33 60 

AACCCGCCCT CCCCTTGGTG TTGGCGGTCC CTGTGGCCTT CACTTTGTTC ACTACCTGTC 342 0 

AGCCCAGCCT GGGTGCACAG TAGCTGCAAC TCCCCATTGG TGCTACCTGG CTCTCCTGTC 3 480 

TCTGCAGCTC TACAGGTGAG GCCCAGCAGA GGGAGTAGGG CTCGCCATGT TTCTGGTGAG 3540 

CCAATTTGGC TGATCTTGGG TGTCTGAACA GCTATTGGGT CCACCCCAGT CCCTTTCAGC 3600 

TGCTGCTTAA TGCCCTGCTC TCTCCCTGGC CCACCTTATA GAGAGCCCAA AGAGCTCCTG 3 660 

TAAGAGGGAG AACTCTATCT GTGGTTTATA ATCTTGCACG AGGCACCAGA GTCTCCCTGG 3 72 0 

GTCTTGTGAT GAACTACATT TATCCCCTTT CCTGCCCCAA CCACAAACTC TTTCCTTCAA 3780 

AGAGGGCCTG CCTGGCTCCC TCCACCCAAC TGCACCCATG AGACTCGGTC CAAGAGTCCA 3 840 

TTCCCCAGGT GGGAGCCAAC TGTCAGGGAG GTCTTTCCCA CCAAACATCT TTCAGCTGCT 3 90 0 

GGGAGGTGAC CATAGGGCTC TGCTTTTAAA GATATGGCTG CTTCAAAGGC CAGAGTCACA 3 960 

GGAAGGACTT CTT CCAGGG A GATTAGTGGT GATGGAGAGG AGAGTTAAAA TGACCTCATG 4020 

TCCTTCTTGT CCACGGTTTT GTTGAGTTTT CACTCTTCTA ATGCAAGGGT CTCACACTGT 4080 

GAACCACTTA GGATGTGATC ACTTTCAGGT GGCCAGGAAT GTTGAATGTC TTTGGCTCAG 4140 

TTCATTTAAA AAAGATATCT ATTTGAAAGT TCTCAGAGTT GTACATATGT TT CACAGT AC 42 0 0 

AGGATCTGTA CATAAAAGTT TCTTTCCTAA ACCATTCACC AAGAGCCAAT AT CTAGGCAT 42 60 

TTTCTTGGTA GCACAAATTT TCTTATTGCT TAGAAAATTG TCCTCCTTGT TATTTCTGTT 432 0 

TGTAAGACTT AAGTGAGTTA GGTCTTTAAG GAAAGCAACG CTCCTCTGAA ATGCTTGTCT 43 80 

TTTTTCTGTT GCCGAAATAG CTGGTCCTTT TTCGGGAGTT AGATGTATAG AGTGTTTGTA 4440 

TGTAAACATT TCTTGTAGGC ATCACCATGA ACAAAGATAT ATTTTCTATT TATTTATTAT 450 0 

ATGTGCACTT CAAGAAGTCA CTGTCAGAGA AATAAAGAAT TGTCTTAAAT GTCATGATTG 4560 

GAGATGTCCT TTGCATTGCT TGGAAGGGGT GTACCTAGAG CCAAGGAAAT TGGCTCTGGT 462 0 

TTGGAAAAAT TTTGCTGTTA TTATAGTAAA CATACAAAGG ATGTCAAAAA AAAAAAAAAA 4 680 
AAAAAAAAAA AAAAAAAAAA AA 
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MGAAGRQDFL FKAMLTISWL TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHIGQ 60 

GKTLLLTSSA TVYSIHISEG GKLVIKDHDE PIVLRTRHIL IDNGGELHAG SALCPFQGNF 120 

TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFF 180 

ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 240 

SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 3 00 

KLFQTEHGEY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLVJKAH PGKICNRPID 3 60 

IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLCGKPVRPK LTVTIDTNVN 42 0 

STILNLEDNV QSWKPGDTLV IASTDYSMYQ AEEFQVLPCR SCAPNQVKVA GKPMYLHIGE 480 

EIDGVDMRAE VGLLSRNIIV MGEMEDKCYP YRNHICNFFD FDTFGGHIKF ALGFKAAHLE 540 

GTELKHMGQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLLI 600 

KDWGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 660 

PKPRQDCNAV STFWMANPNN NLINCAAAGS EETGFWFIFH HVPTGPSVGM YSPGYSEHIP 72 0 

LGKFYNNRAH SNYRAGMIID NGVKTTEASA KDKRPFLSII SARYSPHQDA DPLKPREPAI 780 

IRHFIAYKNQ DHGAWLRGGD VWLDSCHFRG EAQEGFLLTG MKAGGIIjLGG DEAASGMAQG 840 
FSPPCRCLLK LVTTGSPFAH VSLAHS 

Seq ID NO: 454 DNA sequence 
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1 11 21 31 41 51 

I 1 I I I I 

CGACTCCTTA GAGCATGGCA TGGCTCAGAG GTGCTGGTAA AACTGATGGG GGTTTTTGCT 60 

GTCCCTCCCC TCAGCGCCGA CACCATGTGG ATCCAGGTTC GGACCATGGA CGGGAGGCAG 120 

ACCCACACGG TGGACTCGCT GTCCAGGCTG ACCAAGGTGG AGGAGCTGAG GCGGAAGATC 180 

CAGGAGCTGT TCCACGTGGA GCCAGGCCTG CAGAGGCTGT TCTACAGGGG CAAACAGATG 240 

GAGGACGGCC ATACCCTCTT CGACTACGAG GTCCGCCTGA ATGACACCAT CCAGCTCCTG 3 00 

GTCCGCCAGA GCCTCGTGCT CCCCCACAGC ACCAAGGAGC GGGACTCCGA GCTCTCCGAC 3 60 

ACCGACTCCG GCTGCTGCCT GGGCCAGAGT GAGTCAGACA AGTCCTCCAC CCACGGCGAG 420 

GCGGCCGCCG AGACTGACAG CAGGCCAGCC GATGAGGACA TGTGGGATGA GACGGAATTG 480 

GGGCTGTACA AGGT CAATG A GTACGTCGAT GCTCGGGACA CGAACATGGG GGCGTGGTTT 54 0 

GAGGCGCAGG TGGTCAGGGT GACGCGGAAG GCCCCCTCCC GGGACGAGCC CTGCAGCTCC 600 

ACGTCCAGGC CGGCGCTGGA GGAGGACGTC ATTTACCACG TGAAATACGA CGACTACCCG 6 60 

GAGAACGGCG TGGTCCAGAT GAACT CCAGG GACGTCCGAG CGCGCGCCCG CACCATCATC 720 

AAGTGGCAGG ACCTGGAGGT GGGCCAGGTG GTCATGCTCA ACTACAACCC CGACAACCCC 780 

AAGGAGCGGG GCTTCTGGTA CGACGCGGAG ATCTCCAGGA AGCGCGAGAC CAGGACGGCG 840 

CGGGAACTCT ACGCCAACGT GGTGCTGGGG GATGATTCTC TGAACGACTG TCGGATCATC 900 

TTCGTGGACG AAGTCTTCAA GATTGAGCGG CCGGGTGAAG GGAGCCCCAT GGTTGACAAC 960 

CCCATGAGAC GGAAGAG CGG GCCGTCCTGC AAGCACTGCA AGGACGACGT GAAC AG ACT C 1020 

TGCCGGGTCT GCGCCTGCCA CCTGTGCGGG GGCCGGCAGG ACCCCGACAA GCAGCTCATG 10 80 

TGCGATGAGT GCGACATGGC CTTCCACATC TACTGCCTGG ACCCGCCCCT CAG CAGTGTT 1140 

CCCAGCGAGG ACGAGTGGTA CTGCCCTGAG TGCCGGAATG ATGCCAGCGA GGTGGTACTG 1200 

GCGGGAGAGC GGCTGAGAGA GAGCAAGAAG AAGGCGAAGA TGGCCTCGGC CACATCGTCC 1260 

TCACAGCGGG ACTGGGGCAA GGGCATGGCC TGTGTGGGCC GCACCAAGGA ATGTACCATC 1320 

GTCCCGTCCA ACCACTACGG ACCCATCCCG GGGATCCCCG TGGGCACCAT GTGGCGGTTC 13 80 

CGAGTCCAGG TCAGCGAGTC GGGTGT CCAT CGGCCCCACG TGGCTGGCAT ACACGGCCGG 1440 

AG CAACGACG GAGCGTACTC CCTAGTCCTG GCGGGGGGCT ATGAGGATGA CGTGGACCAT 150 0 

GGGAATTTTT TCACATACAC GGGTAGTGGT GGTCGAGATC TTTCCGGCAA CAAGAGGACC 1560 

GCGGAACAGT CTTGTGATCA GAAACT CACC AACACCAACA GGGCGCTGGC TCTCAACTGC 1620 

TTTGCTCCCA TCAATGACCA AGAAGGGGCC GAGGCCAAGG ACTGGCGGTC GGGGAAGCCG 1680 

GTCAGGGTGG TGCGCAATGT CAAGGGTGGC AAGAATAGCA AGTACGCCCC CGCTGAGGGC 1740 

AACCGCTACG ATGGCATCTA CAAGGTTGTG AAATACTGGC CCGAGAAGGG GAAGTCCGGG 1800 

TTTCTCGTGT GGCGCTACCT TCTGCGGAGG GACGATGATG AGCCTGGCCC TTGGACGAAG 1860 

GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC AGTATCCAGA AGGCTACCTG 192 0 
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GAAGCCCTGG CCAACCGAGA GCGAGAGAAG G AG AAC AG CA AGAGGGAGGA GGAGGAGCAG 1980 

CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA AGTGGAAGCG GAAGTCGGCA 2040 

GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT CCAAGAAAAC CAAGGTGGAG 210 0 

CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG AGGACAAGAG CAACGCCAAG 2160 

CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG CGAGCGGCAG CCCGTTCCAG 2220 

TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT GCTGTCAGGA GCTGGTGTTC 2280 

CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTG CAAGG ACTGCCTGGA CAGATCCTTT 2 340 

CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC TGGGCCGCAG CTATGCCATG 2400 

CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT TCCCCGGCTA CGGCAATGGC 2460 

CGGTGAT CTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA AACGTGTCGG AGGGCTCGTT 252 0 

CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTTAAAC AGGTAGTGTT TCCTCCGTTC 2580 

CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT TCAAAT CTAT ACATTTTCAG 2 640 

GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT TGTGTTTAGT TCTTTGAAAA 2700 

CATAAAAG CC TGCAATTTCT CGACAAAACA ACACAAGATT TTTTAAAGAT GGAATCAGAA 2760 

ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG TTCTCAGAAG TTGCTGCCAC 2820 

CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG GTTCTGGCCC CCAAGGTCAG 2880 

AGCAAGCATC TTCCTGACAG CATTTTGTCA T CTAAAGT CC AGTGACATGG TTCCCCGTGG 2 940 

TGGCCCGTGG CAGCCCGTGG CATGGCGTGG CTCAGCTGTC TGTTGAAGTT GTTGCAAGGA 3000 

AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC AAAGCCATCC CCCACCAGAC 3060 

TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA CGAGAGCAGG GAGTTGGGGC 312 0 

CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG GCCAGGCGTG TGACTGACGC 318 0 

TGTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA AGTCACGTGC AAGTGCCTTT 3240 

GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 33 60 

GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 342 0 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATT T TCTAAATGAA 3480 

TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA ACGTTAGGGT TTGGTTGTTT 354 0 

TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 372 0 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 
TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 
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MWIQVRTMDG RQTHTVDSLS RLTKVEELRR KIQELPHVEP GLQRLFYRGK QMEDGHTLFD 60 

YEVRLNDTIQ LLVRQSDVLP HSTKERDSEL SDTDSGCCLG QSESDKSSTH GEAAAETDSR 120 

PADEDMWDET ELGLYKVNEY VDARDTNMGA WFEAQWRVT RKAPSRDEPC SSTSRPALEE 180 

DV I YHVKYDD YPENGWQMN SRDVRARART IIKWQDLEVG QWMLNYNPD NPKERGFWYD 240 

AEISRKRETR TARELYANW LGDDSLNDCR IIFVDEVFKI ERPGEGSPMV DNPMRRKSGP 300 

SCKHCKDDVN RLCRVCACHL CGGRQDPDKQ LMCDECDMAF HIYCLDPPLS SVPSEDEWYC 3 60 

PECRNDASEV VLAGERLRES KKKAKMASAT SSSQRDWGKG MACVGRTKEC TIVPSNHYGP 420 

IPGIPVGTMW RFRVQVSESG VHRPHVAGIH GRSNDGAYSL VLAGGYEDDV DHGNFFTYTG 4 BO 

SGGRDLSGNK RTAEQSCDQK LTNTNRALAL NCFAPINDQE GAEAKDWRSG KPVRWRNVK 540 

GGKNSKYAPA EGNRYDGIYK WKYWPEKGK SGFLVWRYLL RRDDDEPGPW TKEGKDRIKK 60 0 

LGLTMQYPEG YLEAbANRER EKENSKREEE EQQEGGFASP RTGKGKWKRK SAGGGPSRAG 660 

SPRRTSKKTK VEPYSLTAQQ SSLIREDKSN AKLWNEVLAS LKDRPASGSP FQLFLSKVEE 720 
TFQCICCQEL VFRPITTVCQ HNVCKDCLDR SFRAQVFSCP ACRYDLGRSY AMQVNQPLQT 
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GGGGACTTCT TGAACTTGCA GGGAGAATAA CTTGCGCACC CCACTTTGCG CCGGTGCCTT 60 

TGCCCCAGCG GAGCCTGCTT CGCCATCTCC GAGCCCCACC GCCCCTCCAC TCCTCGGCCT 12 0 

TGCCCGACAC TGAGACGCTG TTCCCAGCGT GAAAAGAGAG ACTGCGCGGC CGGCACCCGG 180 

GAGAAGGAGG AGGCAAAGAA AAGGAACGGA CATTCGGTCC TTGCGCCAGG TCCTTTGACC 240 

AGAGTTTTTC CATGTGGACG CTCTTTCAAT GGACGTGT CC CCGCGTGCTT CTTAGACGGA 300 

CTGCGGTCTC CTAAAGGTCG ACCATGGTGG CCGGGACCCG CTGTCTTCTA GCGTTGCTGC 3 60 

TTCCCCAGGT CCTCCTGGGC GGCGCGGCTG GCCTCGTTCC GGAGCTGGGC CGCAGGAAGT 42 0 

TCGCGGCGGC GTCGTCGGGC CGCCCCTCAT CCCAGCCCTC TGACGAGGTC CTGAGCGAGT 480 

TCGAGTTGCG GCTGCTCAGC ATGTTCGGCC TGAAACAGAG ACCCACCCCC AGCAGGGACG 540 

CCGTGGTGCC CCCCTACATG CTAGACCTGT ATCGCAGGCA CTCAGGTCAG CCGGGCTCAC 600 

CCGCCCCAGA CCACCGGTTG GAGAGGGCAG CCAGCCGAGC CAACACTGTG CGCAGCTTCC 660 

ACCATGAAGA ATCTTTGGAA GAACTACCAG AAACGAGTGG GAAAACAACC CGGAGATTCT 72 0 

TCTTTAATTT AAGTTCTATC CCCACGGAGG AGTTTAT CAC CTCAGCAGAG CTTCAGGTTT 780 

TCCGAGAACA GATGCAAGAT GCTTTAGGAA ACAATAGCAG TTTCCATCAC CGAATTAATA 840 

TTTATGAAAT CATAAAACCT GCAACAG CCA ACTCGAAATT CCCCGTGACC AGACTTTTGG 900 

ACACCAGGTT GGTGAATCAG AATGCAAGCA GGTGGGAAAG TTTTGATGTC ACCCCCGCTG 960 

TGATGCGGTG GACTGCACAG GGACACGCCA ACCATGGATT CGTGGTGGAA GTGG CCCACT 1020 

TGGAGGAGAA ACAAGGTGTC TCCAAGAGAC ATGTTAGGAT AAGCAGGTCT TTGCACCAAG 10 80 

ATGAACACAG CTGGTCACAG ATAAGGCCAT TGCTAGTAAC TTTTGGCCAT GATGGAAAAG 1140 

GGCATCCTCT CCACAAAAGA GAAAAACGTC AAGCCAAACA CAAACAGCGG AAACGCCTTA 120 0 

AGTCCAGCTG TAAGAGACAC CCTTTGTACG TGGACTTCAG TGACGTGGGG TGGAATGACT 12 60 

GGATTGTGGC TCCCCCGGGG TATCACGCCT TTTACTGCCA CGGAGAATGC CCTTTTCCTC 132 0 

TGG CTGATCA TCTGAACTCC ACTAATCATG CCATTGTTCA GACGTTGGTC AACTCTGTTA 13 80 

ACTCTAAGAT TCCTAAGGCA TGCTGTGTCC CGACAGAACT CAGTGCTATC TCGATGCTGT 1440 

ACCTTGACGA GAATGAAAAG GTTGTATTAA AGAACTATCA GGACATGGTT GTGGAGGGTT 1500 
GTGGGTGTCG CTAGTACAGC AAAATTAAAT ACATAAATAT ATATATA 
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MVAGTRCLLA LLLPQVLLGG AAGLVPELGR RKPAAASSGR PSSQPSDEVL SEFELRLLSM 60 

5 FGLKQRPTPS RDAWPPYML DLYRRHSGQP GSPAPDHRLE RAASRANTVR SFHHEESLEE 120 

LPETSGKTTR RFFFNLSSIP TEEFITSAEL QVFREQMQDA LGNNSSFHHR INIYEIIKPA 180 
TANSKFPVTR LLDT 

Seq ID NO: 458 DNA sequence 
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JO ATGGGGAGAA GACGGAGGCT GTGTCTCCAG CTCTACTTCC TGTGGCTGGG CTGTGTGGTG 60 

CTCTGGGCGC AGGGCACGGC CGGCCAGCCT CAGCCTCCTC CGCCCAAGCC GCCCCGGCCC 120 

CAGCCGCCGC CGCAACAGGT TCGGTCCGCT ACAGCAGGCT CTGAAGGCGG GTTTCTAGCG 180 

CCCGAGTATC GCGAGGAGGG TGCCGCAGTG GCCAGCCGCG TCCGCCGGCG AGGACAGCAG 240 

GACGTGCTCC GAGGGCCCAA CGTGTGCGGC TCCAGATTCC ACTCCTACTG CTGCCCTGGA 3 00 

20 TGGAAGACGC TCCCTGGAGG AAACCAGTGC ATTGTCCCGA TTTGTAGAAA TAGTTGTGGA 3 60 

GATGGATTTT GTTCCCGTCC TAACATGTGT ACTTGTTCCA GTGGGCAAAT ATCATCAACC 420 

TGTGGATCAA AATCAATTCA GCAGTGCAGT GTGAGATGCA TGAATGGTGG GACCTGTGCA 480 

GATGACCACT GCCAGTGCCA GAAAGGATAT ATTGGAACTT ATTGTGGACA ACCTGTCTGT 540 

GAAAATGGAT GTCAGAATGG TGGACGTTGC ATCGCCCAAC CGTGTGCTTG TGTTTATGGG 60 0 

25 TTCACTGGTC CACAGTGTGA AAGAGATTAC AGGACAGGCC CGTGTTTCAC TCAGGTCAAC 660 

AACCAGATGT GCCAAGGGCA GCTGACAGGC ATTGTCTGCA CGAAGACTCT GTGCTGTGCC 72 0 

ACCACTGGAC GGGCGTGGGG CCATCCCTGT GAGATGTGTC CAGCCCAGCC TCAGCCCTGC 780 

CGACGGGGTT TCATCCCCAA CATCCGCACT GGAGCTTGCC AAGATGTTGA TGAATGCCAG 84 0 

GCTATCCCAG GGATATGCCA AGGAGGAAAC TGTATCAATA CAGTGGGCTC TTTTGAATGC 900 

30 AGATGCCCTG CTGGT CACAA ACAGAGTGAA ACTACTCAGA AATGTGAAGA CATTGATGAG 960 

TGCAGCATCA TTCCTGGGAT ATGTGAAACT GGTGAATGTT CCAACACCGT GGGAAGCTAT 1020 

TTTTGTGTTT GTCCACGTGG ATATGTAACC TCAACAGATG GCTCTCGATG CATCG AT CAG 108 0 

AGAACAGGCA TGTGTTTCTC GGGCCTGGTG AATGGCCGCT GTGCACAAGA GCT CCCGGGG 1140 

AGAATGACGA AAATGCAGTG CTGCTGTGAG CCTGGCCGCT GCTGGGGCAT CGGAACCATT 1200 

35 CCTGAAGCCT GTCCTGTCAG AGGTTCTGAG GAATATCGCA GACTTTGCAT GGATGGACTT 1260 

CCAATGGGAG GAAT TCCAGG GAGTGCTGGT TCCAGACCTG GAGGCACTGG GGGAAATGGC 1320 

TTTGCCCCAA GTGG CAATGG CAATGG CT AT GGCCCAGGAG GGACAGGCTT CATCCCCATC 13 80 

CCTGGAGGCA ATGGCTTTTC TCCTGGCGTT GGGGGAGCCG GTGTGGGGGC CGGGGGACAG 1440 

GGACCTATCA TCACTGGACT AACAATTCTG AAC C AG AC AA TAGATATCTG TAAGCAT CAT 1500 

40 GCTAACCTTT GTTTAAATGG ACGCTGTATA CCAACTGTCT CAAGCTACCG ATGTGAATGC 1560 

AACATGGGTT ATAAGCAGGA TGCAAATGGA GATTGTATAG ATGTTGATGA ATGCACATCA 162 0 

AATCCCTGCA CTAATGGAGA TTGTGTTAAC ACACCTGGTT CCTATTATTG TAAATGTCAT 1680 

GCTGGATTCC AGAGGACTCC TACCAAGCAA GCATGCATTG ATATTGATGA GTGCATCCAG 174 0 

AATGGGGTTC TTTGTAAAAA CGGTCGATGC GTGAACTCAG ATGGAAGTTT CCAGTGCATT 1800 

45 TGCAATGCCG GCTTTGAATT AACTACAGAT GGAAAAAACT GTGTTGATCA TGATGAATGT 1860 

ACAACTACCA ACATGTGTTT GAATGGAATG TGCATCAATG AAGATGGCAG CTTCAAGTGC 1920 

ATCTGCAAAC CAGGATTTGT CTTGGCTCCA AATGGGCGTT ACTGTACTGA TGTTGATGAA 1980 

TGCCAGACCC CAGGAATCTG CATGAATGGG CACTGCAT CA ACAGTGAAGG GTCCTTCCGC 2040 

TGTGACTGTC CCCCAGGCCT GGCTGTGGGC ATGGATGGAC GTGTGTGTGT TGATACT CAC 2100 

50 ATGCGCAGTA CCTGCTATGG AGGAATCAAG AAAGGAGTGT GTGTGCGTCC TTTCCCCGGT 2160 

GCAGTGACCA AGTCCGAATG CTGCTGTGCC AATCCAGACT ATGGTTTTGG AGAACCCTGC 2220 

CAGCCATGCC CTGCAAAAAA TTCAGCTGAA TTCCACGGCC TTTGTAGTAG TGGAGTAGGT 2280 

ATCACTGTGG ATGGAAGAGA TAT CAATG AA TGTGCTTTGG ATCCTGATAT ATGTGCCAAT 2340 

GGGATTTGTG AAAACTTACG TGGTAGTTAC CGTTGTAATT GCAACAGTGG CTATGAACCA 2400 

55 GATGCCTCTG GAAGAAACTG TATTGACATT GATGAATGTT TAGTAAACAG ACTGCTTTGT 2460 

GATAACGGAT TGTGCCGAAA CACGCCAGGA AGTTACAGCT GTACGTGCCC ACCAGGGTAT 252 0 

GTGTTCAGGA CTGAGACAGA GACCTGTGAA GATATAAATG AATGTGAAAG CAACCCATGT 2580 

GTCAATGGGG CCTGCAGAAA CAACCTTGGA TCTTTCAATT GTGAATGTTC GCCCGGCAGC 264 0 

AAACTCAGCT CCACAGGATT GATCTGTATT GACAGCCTGA AGGGGACCTG TTGGCTCAAC 2700 

60 ATCCAGGACA GCCGCTGTGA GGTGAATATT AATGGAGCCA CTCTGAAATC TGAATGCTGT 2760 

GCCACCCTCG GAGCCGCCTG GGGGAGCCCC TGTGAGCGGT GTGAACTAGA TACAGCTTGC 2 820 

CCAAGAGGGC TTGCCAGGAT TAAAGGTGTT ACGTGTGAAG ATGTTAATGA GTGTGAGGTG 2 880 

TTCCCTGGCG TTTGTCCAAA TGGACGCTGT GTCAACAGTA AGGGATCTTT TCATTGCGAG 2940 

TGCCCTGAAG GCCTTACGTT GGATGGGACT GGCCGTGTAT GTTTGGATAT TCGCATGGAG 3 000 

65 CAGTGTTACT TGAAGTGGGA TGAAGATGAA TGCATCCACC CCGTTCCTGG AAAGTT CCGC 3060 

ATGGATGCCT GCTGCTGTGC TGTCGGGGCG GCTTGGGGCA CCGAGTGTGA GGAGTGCCCC 3120 

AAACCTGGCA CCAAGGAATA CGAGACACTG TGCCCCCGCG GGGCTGGCTT TGCTAACCGA 3180 

GGGGATGTTC TTACTGGGCG GCCATTTTAC AAAGACATCA ATGAATGCAA AGCATTTCCT 32 40 

GGGATGTGCA CTTATGGGAA GTGCAGAAAT ACAATCGGAA GCTTCAAATG CCGTTGCAAT 3300 

70 AGTGGCTTTG CTCTAGACAT GGAGGAAAGA AACTGCACGG ACATCGACGA GTGCAGGATT 3360 

TCTCCTGACC TCTGTGGCAG TGGAATCTGC GTCAATACAC CGGGCAGCTT TGAGTGCGAG 3420 

TGCTTCGAAG GCTATGAAAG TGGCTTCATG ATGATGAAGA ACTGCATGGA CATTGACGGA 3480 

TGTGAACGTA ACCCTCTCCT TTGTAGGGGT GGCACCTGTG TGAACACTGA GGGCAGCTTT 3540 

CAGTGTGACT GCCCACTGGG ACACGAGCTG TCACCATCCC GTGAGGACTG TGTGGATATT 3600 

75 AATGAATGCT CCCTGAGTGA CAATCTCTGC AGAAATGGAA AATGTGTGAA CATGATTGGA 3660 

ACCTATCAGT GCTCTTGCAA TCCTGGATAT CAGGCTACGC CAGACCGCCA GGGCTGTACA 3720 

GATATTGATG AATGTATGAT AATGAACGGA GGCTGTGACA CCCAGTGCAC AAATTCAGAG 3 780 

GGAAGCTACG AATGCAGCTG CAGTGAGGGT TATGCCCTGA TGCCAGATGG GAGATCGTGT 3 840 

GCAGACATTG ATGAATGTGA AAACAATCCT GATAT CTGTG ATGGCGGCCA GTGTACCAAC 3900 

80 ATTCCTGGAG AGTATCGCTG CCTCTGCTAT GATGGCTTCA TGGCTTCCAT GGACATGAAA 3960 

ACATGCATTG ATGTCAATGA ATGTGACCTA AATTCAAATA TCTGCATGTT TGGGGAATGT 4020 

G AG AA CACAA AGGGATCCTT CATTTGCCAC TGTCAGCTGG GTTACTCAGT GAAGAAGGGG 4080 

ACCACAGGAT GTACAGATGT GGATGAGTGT GAAATTGGTG CTCATAACTG CGACATGCAT 4140 

GCCTCATGTC TGAATATCCC AGGAAGCTTC AAGTGTAGCT GCAGAGAAGG CTGGATTGGA 4200 

85 AACGGCATCA AGTGTATTGA TCTGGACGAA TGTTCTAATG GAACCCACCA GTGTAGCATC 4260 

AATGCTCAGT GTGTAAATAC CCCGGGCTCA TACCGCTGTG CCTGCTCCGA AGGTTTCACT 4320 

GGTGATGGCT TTACCTGCTC AGATGTTGAT GAGTGTGCAG AAAACATAAA CCTCTGTGAG 43 80 
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AACGGACAGT GCCTTAATGT CCCGGGTGCA TATCGCTGCG AGTGTGAGAT GGGCTTCACT 4440 

CCAGCCTCAG ACAGCAGATC CTGCCAAGAT ATTGATGAAT GCTCCTTCCA AAACATTTGT 4500 

GTCTCTGGAA CATGTAATAA CCTGCCTGGA ATGTTTCATT GCATCTGCGA TGATGGTTAT 4560 

GAATTGGACA GAACAGGAGG GAACTGTACA GATATTGATG AGTGTGCAGA TCCTATAAAC 4620 

TGTGTCAATG GCCTATGTGT CAACACGCCT GGTCGCTATG AGTGTAACTG CCCACCCGAT 4680 

TTTCAGTTGA ACCCAACTGG TGTGGGTTGT GTTGACAACC GTGTGGGCAA CTGCTACCTG 4740 

AAGTTTGGAC CTCGAGGAGA TGGGAGTCTG TCTTGCAACA CCGAGATCGG GGTGGGCGTC 480 0 

AGTCGCTCTT CATGCTGCTG CTCTCTGGGA AAGGCCTGGG GAAACCCCTG TGAGACATGC 4860 

CCCCCTGTCA ATAGCACTGA ATATTACACC CTGTGT CCCG GAGGTGAAGG CTTCAGACCT 492 0 

AACCCCATCA CAATCATTTT AGAAGACATT GACGAATGCC AGGAGTTACC AGGTCTCTGC 4980 

CAGGGT GGAA ACTG CAT CAA CACTTTTGGG AGCTTCCAGT GTGAGTGCCC ACAAGGCTAC 5040 

TACCTCAGCG AGGATACCCG CATCTGTGAG GATATTGATG AGTGTTTTGC ACATCCTGGT 5100 

GTGTGTGGGC CTGGGACCTG CTATAACACC CTGGGAAATT ACACCTGCAT TTGCCCACCT 5160 

GAGTACATGC AGGTCAATGG AGGCCACAAC TGCATGGACA TGAGAAAAAG CTTTTGCTAC 5220 

CGAAG C TATA ATGGAACCAC TTGTGAGAAT GAGTTGCCTT TCAATGTGAC AAAAAGGATG 52 80 

TGCTGCTGCA CATATAATGT GGGCAAAGCT GGGAACAAAC CTTGTGAACC ATGCCCAACT 5340 

C C AGGAAC AG CTGACTTTAA AACCATATGT GGAAATATTC CTGGATTCAC CTTTGACATT 5400 

CACACAGGAA AAGCTGTTGA CATTGATGAA TGTAAAGAGA TTCCAGGCAT TTGTGCAAAT 5460 

GGTGTGTGCA TTAACCAGAT TGGCAGTTTC CGCTGTGAAT GCCCTACAGG ATTCAGTTAC 552 0 

AATGACCTGC TGTTGGTTTG TGAAGATATA GATGAGTGCA GCAATGGTGA TAATCTCTGC 5580 

CAGCGGAATG CAGACTGCAT CAATAGTCCT GGTAGTTACC GCTGTGAATG TGCCGCGGGT 5640 

TTCAAACTTT CACCCAATGG GGCCTGTGTA GATCGCAATG AATGTTTAGA AATTCCTAAC 5700 

GTTTGCAGTC ATGGCTTGTG TGTTGATCTG CAAGGAAGTT ACCAGTGCAT CTGCCACAAT 5760 

GGCTTTAAGG CTTCTCAGGA CCAGACCATG TGCATGGATG TTGATGAGTG CGAGCGGCAC 5820 

CCATGTGGAA ATGGAACTTG TAAAAACACC. GTTGGATCCT ATAACTGTCT GTGCTACCCA 5880 

GGGTTTGAAC TCACTCATAA TAATGATTGC CTGGACATAG ATGAGTGCAG TTCCTTTTTT 5940 

GGT CAGGTGT GCAGAAATGG ACGTTGTTTT AATGAAATTG GTTCTTTCAA GTGTCTATGT 6000 

AACGAAGGTT ATGAACTTAC CCCAGATGGC AAAAACTGTA TAGACACTAA TGAGTGTGTC 6060 

GCCCTTCCCG GCTCTTGCTC TCCTGGTACC TGTCAGAATT TGGAGGGATC CTTCAGATGC 6120 

ATCTGTCCCC CAGGGTATGA AGTAAAAAGC GAGAACTGCA TTGATATAAA TGAATGTGAT 6180 

GAAGATCCCA ACATTTGTCT TTTTGGTTCC TGTACTAATA CTCCAGGGGG CTTCCAGTGC 6240 

CTCTGCCCCC CTGGCTTTGT ACTATCTGAT AATGGACGGA GATGCTTTGA TACTCGCCAG 6300 

AGCTTCTGCT TCACAAATTT TGAAAATGGA AAGTGTTCTG TACCCAAAGC TTT CAACACC 6360 

ACAAAAGCAA AATGCTGCTG TAGTAAGATG CCAGGAGAGG GCTGGGGGGA CCCCTGTGAG 642 0 

CTGTGCCCCA AAGACGATGA AGTTGCATTT CAGGATTTGT GTCCATATGG CCATGGAACT 6480 

GTCCCTAGTC TTCATGATAC ACGTGAAGAT GTCAATGAGT GTCTTGAGAG CCCAGGCATT 6540 

TGTTCAAATG GTCAATGTAT CAACACCGAC GGATCTTTTC GCTGTGAATG TCCAATGGGC 660 0 

TACAACCTTG ACTACACTGG AGTACGCTGT GTGGATACTG ATGAGTGTTC AATCGGCAAT 6660 

CCGTGTGGAA ATGGTACATG CACCAATGTT ATTGGGAGTT TTGAATGCAA TTGCAATGAA 672 0 

GGCTTTGAGC CAGGGCCCAT GATGAATTGT GAAGATATCA ACGAATGTGC CCAGAACCCA 67 80 

CTGCTGTGTG CTTTACGCTG CATGAACACT TTTGGGTCCT ATGAATGCAC GTGCCCGATT 6840 

GGCTATGCCC TCAGGGAAGA TCAAAAGATG TGCAAAGAT C TGGATGAATG TGCTGAAGGG 6 900 

TTACACGACT GTGAATCTAG GGGCATGATG TGTAAGAATC TAATCGGCAC CTTCATGTGC 6 960 

ATCTGCCCTC CTGGAATGGC CCGAAGGCCC GATGGAGAAG GCTGTGTAGA TGAAAATGAA 702 0 

TGCAGGACCA AGCCAGGAAT CTGTGAAAAT GGACGTTGTG TTAACATTAT TGGAAGCTAT 7080 

AGATGTGAGT GTAATGAAGG ATTCCAGT C A AGTTCTTCAG GCACTGAATG CCTTGACAAT 7140 

CGACAGGGTC TCTGCTTTGC AGAGGTACTG CAGACAATAT GTCAAATGGC ATCCAGTAGT 72 0 0 

CGCAATCTCG TCACTAAGTC AGAATGCTGC TGTGATGGTG GGCGAGGCTG GGGCCACCAG 7260 

TGCGAGCTTT GCCCACTTCC TGGAACTGCC CAGTACAAAA AGATATGTCC TCATGGCCCA 73 2 0 

GGATATACAA CTGATGGAAG AGATATTGAT GAATGTAAGG TAATGCCAAA CCTCTGCACC 73 80 

AATGGTCAGT GCATCAATAC CATGGGCTCA TTCCGATGCT TCTGCAAGGT TGGCTACACC 7440 

ACAGACATCA GTGGAACCTC TTGTATAGAC, CTTGATGAAT GCTCCCAGTC CCCGAAACCA 7500 

TGCAACTACA TCTGCAAGAA CACTGAGGGG AGTTATCAGT GTTCATGTCC GAGGGGGTAT 7560 

GTCCTGCAAG AGGATGGAAA GACATGCAAA GACCTTGATG AATGTCAAAC AAAGCAGCAT 762 0 

AACTGCCAGT TCCTCTGTGT CAACACCCTG GGGGGGTTTA CCTGTAAATG TCCACCTGGT 7680 

TTCACACAGC ATCACACTGC TTGTATCGAC AACAACGAAT GTGGGTCTCA ACCTTTGCTT 7740 

TGTGGAGGAA AGGGAATCTG TCAAAACACT CCAGGCAGTT TCAGCTGTGA ATGCCAAAGA 7800 

GGGTTCTCTC TTGATGCCAC CGGACTGAAC TGTGAAGATG TTGATGAATG TGATGGGAAC 7860 

CACAGGTGCC AACACGGCTG CCAGAACATC CTGGGTGGCT ACAGATGTGG CTGCCCCCAA 7 920 

GGCTACATCC AG CACTACCA GTGGAATCAG TGTGTCGATG AGAATGAATG CTCCAATCCC 7 980 

AATGCCTGTG GCTCTGCTTC CTGCTACAAC ACCCTGGGGA GTTACAAGTG CGCCTGCCCC 8040 

TCGGGGTTCT CCTTCGACCA GTTCTCCAGT GCCTGCCACG ACGTGAATGA GTGCTCGTCC 8100 

TCCAAGAACC CCTGCAATTA CGGCTGCTCT AACACGGAGG GGGGCTACCT CTGTGGCTGC 8160 

CCCCCTGGGT ATTACAGAGT GGGACAAGGC CACTGTGTCT CAGGAATGGG ATTTAACAAG 8220 

GGGCAGTACC TGT CACTGGA TACAGAGGTC GATGAGGAAA ATGCTCTGTC CCCAGAAGCA 82 80 

TGCTACGAGT GCAAAATCAA CGGCTATCCT AAGAAAGACA GCAGGCAGAA GAGAAGTATT 8340 

CATGAACCTG ATCCCACTGC TGTTGAACAG ATCAGCCTAG AGAGTGTCGA CATGGACAGC 8400 

CCCGTCAACA TGAAGTTCAA CCTCTCCCAC CTCGGCTCTA AGGAGCACAT CCTGGAACTA 8460 

AGGCCCGCCA TCCAGCCCCT CAACAACCAC ATCCGTTATG TCATCTCTCA AGGGAACGAT 8520 

GACAGCGTCT TCCGCATCCA CCAAAGGAAT GGGCTCAGCT ACTTGCACAC GGCCAAGAAG 8580 

AAGCTCATGC CCGGCACATA CACACTGGAA ATCACTAGCA TCCCTCTCTA CAAGAAGAAG 8 640 

GAGCTTAAGA AACTGGAAGA GAGCAATGAG GATGACTACC TCCTAGGGGA GCTTGGGGAG 8700 

GCTCTCAGAA TGAGGCTGCA GATTCAGCTC TATTAACCGT TCACAGACTT GGGCCCAGGC 8760 

TCAAATCCTA GCACAGCCAG TCTGCAGAAG CATTTGAAAA GTCAAGGACT AATTTTAAAG 8 82 0 

AGGAAAAATA ATAATAACTC TTGTTTCTTT CCTCCCTGTC TTAGACTTTG AATGTTGACC 8880 

CTCACAGGGA GGGATAATTT AGACTCTGGT ATGGCCAAAG ATTTGAGCTC AAAGGCAACC 8940 

GTGGTTACTG TATTTTTTAT ATAACTTCAT TTTAAAATAT ATTAAAAGAA ACCTAAATGT 9000 

TCAAGATATC AGCATATGGC ACTAAATGCA CAAAAATAAT GTGAGCTTTT TTTTTTTTTT 90 60 

CCTGTTAGCA GTCTGTAACA CTTTGGGTAT TTTGCTATAG TTGCTAATTA AAAAAATATA 9120 

GATGTTTATT TATTTTTAAT GCAGTAATAT ATGGAGAAAT GAACAAACTA TGTAAACAAA 9180 

AAGGGAAACT CACTTGTTTT TCTTTAGATT TATAAATTTG AGCTATTTTT TTTAGAGGTG 9240 

CTTTTTAAAA ATCCAATAGA TACAAGAGAT GTTTCCTTTG GTTTTCTGCC AGTCATCCAG 9300 

CTGATACACA CCTGAT CGAT TTTAAAGAAA GCCACACAGA GCTGAATCGG GCAGTGCTAA 93 60 

TCAATAATTT AAAAGACATG AATGTCATTA GATCCTTTAT AACGT AG AT C GAAGCCAAAG 9420 

CAGCTCATTT GTGACAACAT TTCATATCAC CAGACACACC AGGCAACAGA AGTTGAAGCA 9480 

CAACCACTGT AG C AAAAT AC CTTGACTGCT TGTGAGACCA TTAGCATTGC AGGCCAAACC 9540 

GTACTGTATT TCCTTCTCAT AACCTCAAGG AACCATATGT GCTACCCACA ACACCTCATT 9600 
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CTTACCCAGG GTGCGCTGCG TCCTCATGGT 
TTGAAAGGGA ACACCTGGCA TTCTGTGGTG 
ATTATGTTCA AGTTATTTCA GGATTGCCAT 
GGAATATATG TTGTTGTTGT TGTTTTAAAC 
TGTAGTTATA CACCATATGC CTCATTTTAT 
ACAATGAATT GATGTTTAGT TTGCTTTAGT 
TATTAAGAGC ACGTAT CCAT TATTCTTCTC 
CCAAACCTCA TATGTGAAAT GGCCAAAGCA 
CTGTGCTGAC CAAAGATTAG TAACCAGTTA 
TTAATAACTA AAAAAAAACT CGTGCC 



ACTGTAGGCA GCTGAAGAAC CGCCGTTCCC 9660 
TTTCGTGCTG TCTTAAATAA TGGTGCATTT 9720 
ATGTGCAAAC AAATCATGCA ATGCAGCCAA 9780 
CCATTTTTTT TTTAGAATTT TCATTAATAC 9840 
CATAGCCTAT TGTGTATGAA AGATGTTTGT 9900 
CATTTAAAAA GATATTGTAC CAGGATGTGC 9960 
AACCCAAGAA CCTGTTTCCT GGACCAGTGA 10020 
CATGCAGGCT CCTGGTTGTT CCTCTCAAAC 10 0 80 
TACCCAGTAT TTTGAGGTTT TATTGTTTTT 10140 



Seq ID NO: 459 Protein sequence 
Protein Accession #: NP_001990.1 



1 11 21 31 41 51 

I I I ! I I 

MGRRRRLCLQ LYFLWLGCW LWAQGTAGQP QPPPPKPPRP QPPPQQVRSA TAGSEGGFLA 60 
PEYREEGAAV ASRVRRRGQQ DVLRGPNVCG SRFHSYCCPG WKTLPGGNQC IVPICRNSCG 120 
DGPCSRPNMC TCSSGQISST CGSKSIQQCS VRCMNGGTCA DDHCQCQKGY IGTYCGQPVC 180 
ENGCQNGGRC IAQPCACVYG FTGPQCERDY RTGPCFTQVN NQMCQGQLTG IVCTKTLCCA 240 
TTGRAWGHPC EMCPAQPQPC RRGFIPNIRT GACQDVDECQ AIPGICQGGN CINTVGSFEC 3 00 
RCPAGHKQSE TTQKCEDIDE CSIIPGICET GECSNTVGSY FCVCPRGYVT STDGSRCIDQ 3 60 
RTGMCFSGLV NGRCAQELPG RMTKMQCCCE PGRCWGIGTI PEACPVRGSE EYRRLCMDGL 42 0 
PMGGIPGSAG SRPGGTGGNG FAPSGNGNGY GPGGTGFIPI PGGNGFSPGV GGAGVGAGGQ 480 
GPIITGLTIL NQTIDICKHH ANLCLNGRCI PTVSSYRCEC NMGYKQDANG DCIDVDECTS 540 
NPCTNGDCVN TPGSYYCKCH AGFQRTPTKQ ACIDIDECIQ NGVLCKNGRC VNSDGSFQCI 600 
CNAGFELTTD GKNCVDHDEC TTTNMCLNGM CINEDGSFKC ICKPGFVLAP NGRYCTDVDE 660 
CQTPGICMNG HCINSEGSFR CDCPPGLAVG MDGRVCVDTH MRSTCYGGIK KGVCVRPFPG 720 
AVTKSECCCA NPDYGFGEPC QPCPAKNSAE FHGLCSSGVG ITVDGRDINE CALDPD I CAN 780 
GICENLRGSY RCNCNSGYEP DASGRNCIDI DECLVNRLLC DNGLCRNTPG SYSCTCPPGY 840 
VFRTETETCE DINECESNPC VNGACRNNLG SFNCECSPGS KIiSSTGLICI D SIiKGT CWLN 900 
IQDSRCEVNI NGATLKSECC ATLGAAWGSP CERCELDTAC PRGLARIKGV TCEDVNECEV 960 

FPGVCPNGRC VNSKGSFHCE CPEGLTLDGT GRVCLDIRME QCYLKWDEDE CIHPVPGKFR 102 0 

MDACCCAVGA AWGTECEECP KPGTKEYETIi CPRGAGFANR GDVLTGRPFY KDINECKAFP 10 80 

GMCTYGKCRN TIGSFKCRCN SGFALDMEER NCTDIDECRI SPDLCGSGIC VNTPGSFECE 1140 

CFEGYESGFM MMKNCMDIDG CERNPLLCRG GTCVNTEGS F QCDCPLGHEL SPSREDCVDI 1200 

NECSLSDNLC RNGKCVNMIG TYQCSCNPGY QATPDRQGGT DIDECMIMNG GCDTQCTNSE 12 60 

GSYECSCSEG YALMPDGRSC ADIDECENNP DICDGGQCTN IPGEYRCLCY DGFMASMDMK 13 20 

TCI DVNECDL NSNICMFGEC ENTKGSFICH CQLGYSVKKG TTGCTDVDEC EIGAHNCDMH 13 80 

ASCLNIPGSF KCSCREGWIG NGIKCIDLDE CSNGTHQCSI NAQCVNTPGS YRCACSEGFT 1440 

GDGFTCSDVD ECAENINLCE NGQCLNVPGA YRCECEMGFT PASDSRSCQD IDECSFQNIC 150 0 

VSGTCNNLPG MFHCICDDGY ELDRTGGNCT DIDECADPIN CVNGLCVNTP GRYECNCPPD 1560 

FQLNPTGVGC VDNRVGNCYL KFGPRGDGSL SCNTEIGVGV SRSSCCCSLG KAWGNPCETC 162 0 

PPVNSTEYYT LCPGGEGFRP NPITIILEDI DECQELPGLC QGGNCINTFG SFQCECPQGY 1680 

YLSEDTRICE DIDECFAHPG VCGPGTCYNT LGNYTCICPP EYMQVNGGHN CMDMRKSFCY 1740 

RSYNGTTCEN ELPFNVTKRM CCCTYNVGKA GNKPCEPCPT PGTADFKT I C GNIPGFTFDI 1800 

HTGKAVD IDE CKEIPGICAN GVCINQIGSF RCECPTGFSY NDLLLVCEDI DECSNGDNLC 1860 

QRNADCINSP GSYRCECAAG FKLSPNGACV DRNECLE I PN VCSHGLCVDL QGSYQCICHN 192 0 

GFKASQDQTM CMDVD ECERH PCGNGTCKNT VGSYNCLCYP GFELTHNNDC LDIDECSSFF 1980 

GQVCRNGRCF NEIGSFKCLC NEGYELTPDG KNCIDTNECV ALPGSCSPGT CQNLEGSFRC 2 040 

ICPPGYEVKS ENCIDINECD EDPNICLFGS CTNTPGGFQC LCPPGFVLSD NGRRCFDTRQ 2100 

S F CFTNFENG KCSVPKAFNT TKAKCCCSKM PGEGWGDPCE LCPKDDEVAF QDLCPYGHGT 2160 

VPSLHDTRED VNECLESPGI CSNGQCINTD GSFRCECPMG YNLDYTGVRC VDTDECSIGN 222 0 

PCGNGTCTNV IGSFECNCNE GFEPGPMMNC EDINECAQNP LLCALRCMNT FGSYECTCPI 22 8 0 

GYALREDQKM CKDLDECAEG LHDCESRGMM CKNLIGTFMC ICPPGMARRP DGEGCVDENE 23 40 

CRTKPGICEN GRCVNIIGSY RCECNEGFQS SSSGTECLDN RQGLCFAEVL QTICQMASSS 2 400 

RNLVTKSECC CDGGRGWGHQ CELCPLPGTA QYKKICPHGP GYTTDGRDID ECKVMPNLCT 2460 

NGQCINTMGS FRCFCKVGYT TDISGTSCID L.DECSQSPKP CNYICKNTEG SYQCSCPRGY 252 0 

VLQEDGKTCK DLDECQTKQH NCQFLCVNTL GGFTCKCPPG FTQHHTACID NNECGSQPLL 2580 

CGGKGICQNT PGSFSCECQR GFSLDATGLN CEDVDECDGN HRCQHGCQNI LGGYRCGCPQ 2 640 

GYIQHYQWNQ CVDENECSNP NACGSASCYN TLGSYKCACP SGFSFDQFSS ACHDVNECSS 2700 

SKNPCNYGCS NTEGGYLCGC PPGYYRVGQG HCV SGMGFNK GQYLSLDTEV DEENALSPEA 2760 

CYECKINGYP KKDSRQKRSI HEPDPTAVEQ ISDESVDMDS PVNMKFNLSH LGSKEHILEL 2 82 0 

RPAI QPLNNH IRYVISQGND DSVFRIHQRN GLSYLHTAKK KLMPGTYTLE ITSIPLYKKK 2 880 
ELKKLEESNE DDYLLGELGE ALRMRLQIQL Y 

Seq ID NO: 460 DNA sequence 

Nucleic Acid Accession ft: NMJD13372.1 

Coding sequence: 63,. 617 

1 11 21 31 41 51 

I I I I I I 

GCGGCCGCAC TCAGCGCCAC GCGTCGAAAG CGCAGGCCCC GAGGACCCGC CGCACTGACA 60 
GTATGAGCCG CACAGCCTAC ACGGTGGGAG CCCTGCTTCT CCTCTTGGGG ACCCTGCTGC 12 0 
CGGCTGCTGA AGGGAAAAAG AAAGGGTCCC AAGGTGCCAT CCCCCCGCCA GACAAGGCCC 180 
AGCACAATGA CTCAGAGCAG ACTCAGTCGC CCCAGCAGCC TGGCTCCAGG AACCGGGGGC 240 
GGGGCCAAGG GCGGGGCACT GCCATGCCCG GGGAGGAGGT GCTGGAGTCC AGCCAAGAGG 3 00 
CCCTGCATGT GACGG AG CGC AAATACCTGA AGCGAGACTG GTGCAAAACC CAGCCGCTTA 360 
AGCAGACCAT CCACGAGGAA GGCTGCAACA GTCGCACCAT CATCAACCGC TTCTGTTACG 420 
GCCAGTGCAA CTCTTTCTAC ATCCCCAGGC ACATCCGGAA GGAGGAAGGT TCCTTTCAGT 480 
CCTGCTCCTT CTGCAAGCCC AAGAAATTCA CTACCATGAT GGTCACACTC AACTGCCCTG 540 
AACTACAGCC ACCTACCAAG AAGAAGAGAG TCACACGTGT GAAGCAGTGT CGTTGCATAT 600 
CCATCGATTT GGATTAAGCC AAATCCAGGT GCACCCAGCA TGTCCTAGGA ATGCAGCCCC 660 
AGGAAGTCCC AGACCTAAAA CAACCAGATT CTTACTTGGC TTAAACCTAG AGGCCAGAAG 72 0 
AACCCCCAGC TGCCTCCTGG CAGGAGCCTG CTTGTGCGTA GTTCGTGTGC ATGAGTGTGG 780 
ATGGGTGCCT GTGGGTGTTT TTAGACACCA GAGAAAACAC AGTCTCTGCT AGAGAGCACT 840 
CCCTATTTTG TAAACATATC TGCTTTAATG GGGATGTACC AGAAACCCAC CTCACCCCGG 900 
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CTCACATCTA AAGGGGCGGG GCCGTGGTCT GGTTCTGACT TTGTGTTTTT GTGCCCTCCT 960 

GGGGACCAGA ATCTCCTTTC GGAATGAATG TTCATGGAAG AGGCTCCTCT GAGGGCAAGA 102 0 

GACCTGTTTT AGTGCTGCAT TCGACATGGA AAAGTCCTTT TAACCTGTGC TTGCATCCTC 1080 

CTTTCCTCCT CCTCCTCACA ATCCATCTCT TCTTAAGTTG ATAGTGACTA TGTCAGTCTA 1140 

ATCTCTTGTT TGCCAAGGTT CCTAAATTAA TTCACTTAAC CATGATGCAA ATGTTTTTCA 12 0 0 

TTTTGTGAAG ACCCTCCAGA CTCTGGGAGA GGCTGGTGTG GGCAAGGACA AG C AGG AT AG 12 60 

TGGAGTGAGA AAGGGAGGGT GGAGGGTGAG GCCAAATCAG GTCCAGCAAA AGTCAGTAGG 132 0 

GACATTGCAG AAGCTTGAAA GGCCAATACC AGAACACAGG CTGATGCTTC TGAGAAAGTC 1380 

TTTTCCTAGT ATTTAACAGA ACCCAAGTGA ACAGAGGAGA AATGAGATTG CCAGAAAGTG 1440 

ATTAACTTTG GCCGTTGCAA TCTGCTCAAA CCTAACACCA AACTGAAAAC ATAAATACTG 1500 

ACCACTCCTA TGTTCGGACC CAAGCAAGTT AGCTAAACCA AACCAACTCC TCTGCTTTGT 1560 

CCCTCAGGTG GAAAAGAGAG GTAGTTTAGA ACTCTCTGCA TAGGGGTGGG AATTAATCAA 1620 

AAACCKCAGA GGCTGAAATT CCTAATACCT TTCCTTTATC GTGGTTATAG TCAGCTCATT 1680 

TCCATTCCAC TATTTCCCAT AATGCTTCTG AGAGCCACTA ACTTGATTGA TAAAGATCCT 1740 

GCCTCTGCTG AGTGTACCTG ACAGTAAGTC TAAAGATGAR AGAGTTTAGG GACTACTCTG 180 0 

TTTTAGCAAG ARATATTKTG GGGGTCTTTT TGTTTTAACT ATTGTCAGGA GATTGGGCTA 1860 

RAGAGAAGAC GACGAGAGTA AGGAAATAAA GGGRATTGCC TCTGGCTAGA GAGTAAGTTA 192 0 

GGTGTTAATA CCTGGTAGAA ATGTAAGGGA TATGACCTCC CTTTCTTTAT GTGCTCACTG 1980 

AGGATCTGAG GGGACCCTGT TAGG AG AG CA TAGCATCATG ATGTATTAGC TGTTCATCTG 2040 

CTACTGGTTG GATGGACATA ACTATTGTAA CTATTCAGTA TTTACTGGTA GGCACTGTCC 210 0 

TCTGATTAAA CTTGGCCTAC TGGCAATGGC TACTTAGGAT TGATCTAAGG GCCAAAGTGC 2160 

AGGGTGGGTG AACTTTATTG TACTTTGGAT TTGGTTAACC TGTTTTCTTC AAGCCTGAGG 2220 

T T T TAT AT AC AAACT CCCTG AATACTCTTT TTGCCTTGTA TCTTCTCAGC CTCCTAGCCA 2280 

AGTCCTATGT AATATGGAAA ACAAACACTG CAGACTTGAG ATTCAGTTGC CGATCAAGGC 23 40 

TCTGGCATTC AGAGAACCCT TGCAACTCGA GAAGCTGTTT TTATTTCGTT TTTGTTTTGA 2400 

TCCAGTGCTC TCCCATCTAA CAACTAAACA GGAGCCATTT CAAGGCGGGA GATATTTTAA 2460 

ACACCCAAAA TGTTGGGTCT GATTTTCAAA CTTTTAAACT CACTACTGAT GATTCT CACG 252 0 

CTAGGCGAAT TTGTCCAAAC ACATAGTGTG TGTGTTTTGT ATACACTGTA TGACCCCACC 25 8 0 

CCAAATCTTT GTATTGT CCA CATTCTCCAA CAATAAAGCA CAGAGTGGAT TTAATTAAGC 264 0 

ACACAAATGC TAAGGCAGAA TTTTGAGGGT GGGAGAGAAG AAAAGGGAAA GAAGCTGAAA 270 0 

ATGTAAAACC ACACCAGGGA GGAAAAATGA CATTCAGAAC CAGCAAACAC TGAATTTCTC 2760 

TTGTTGTTTT AACTCTGCCA CAAGAATGCA ATTTCGTTAA TGGAGATGAC TTAAGTTGGC 2 82 0 

AGCAGTAATC TTCTTTTAGG AGCTTGTACC ACAGTCTTGC ACATAAGTGC AGATTTGGCT 2 880 

CAAGTAAAGA GAATTTCCTC AACACTAACT TCACTGGGAT AATCAGCAGC GTAACTACCC 2 940 

TAAAAGCATA TCACTAGCCA AAGAGGGAAA TATCTGTTCT TCTTACTGTG CCTATATTAA 3000 

GACTAGTACA AATGTGGTGT GTCTTCCAAC TTT CATTGAA AATG CC AT AT CTATACCATA 30 60 

TTTTATTCGA GTCACTGATG ATGTAATGAT ATATTTTTTC ATTATTATAG TAGAATATTT 312 0 

TTATGGCAAG ATATTTGTGG T CTTGAT CAT ACCTATTAAA ATAATGCCAA ACACCAAATA 3180 

TGAATTTTAT GATGTACACT TTGTGCTTGG CATTAAAAGA AAAAAACACA CATCCTGGAA 3240 

GTCTGTAAGT TGTTTTTTGT TACTGTAGGT CTTCAAAGTT AAGAGTGTAA GTGAAAAATC 33 0 0 

TGGAGGAGAG GATAATTTCC ACTGTGTGGA ATGTGAATAG TTAAATGAAA AGTTATGGTT 33 60 

ATTTAATGTA ATTATTACTT CAAATCCTTT GGTCACTGTG ATTTCAAGCA TGTTTTCTTT 3420 

TTCTCCTTTA TATGACTTTC TCTGAGTTGG GCAAAGAAGA AGCTGACACA CCGTATGTTG 3480 

TTAGAGTCTT TTATCTGGTC AGGGGAAACA AAATCTTGAC CCAGCTGAAC ATGTCTTCCT 3540 

GAGT CAGTGC CTGAATCTTT ATTTTTTAAA TTGAATGTTC CTTAAAGGTT AACATTTCTA 3 60 0 

AAGCAAT AT T AAGAAAGACT TTAAATGTTA TTTTGGAAGA CTTACGATGC ATGTATACAA 3 660 

ACGAATAGCA GATAATGATG ACTAGTTCAC ACATAAAGTC CTTTTAAGGA GAAAATCTAA 372 0 

AATGAAAAGT GGATAAACAG AACATTTATA AGTGATCAGT TAATGCCTAA GAGTGAAAGT 3780 

AGTTCTATTG ACATTCCTCA AGATATTTAA TATCAACTGC ATTATGTATT ATGTCTGCTT 3 840 

AAATCATTTA AAAACGGCAA AGAATTATAT AGACTATGAG GTACCTTGCT GTGTAGGAGG 3 900 

ATGAAAGGGG AGTTGATAGT CTCATAAAAC TAATTTGGCT TCAAGTTTCA TGAATCTGTA 3 960 

ACTAGAATTT AATTTTCACC CCAATAATGT TCTATATAGC CTTTGCTAAA GAGCAACTAA 402 0 
TAAATTAAAC CTATTCTTTC AAAAAAAAA 

Seq ID NO: 461 Protein sequence 
Protein Accession #: NP__037504.1 

1 11 21 31 41 51 

I I I I I I 

MSRTAYTVGA IiLLLLGTLDP AAEGKKKGSQ GAI PPPDKAQ HNDSEQTQSP QQPGSRNRGR 6 0 

GQGRGTAMPG EEVLESSQEA LHVTERKYLK RDWCKTQPLK QTIHEEGCNS RTI INRFCYG 12 0 

QCNSFYIPRH IRKEEGSFQS CSFCKPKKFT TMMVTLNCPE LQPPTKKKRV TRVKQCRCIS 180 
IDLD 

Seq ID NO: 462 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..2733 

1 11 21 31 41 51 

1)11)1 

ATGAAAGTTG GAGTGCTGTG GCTCATTTCT TTCTTCACCT TCACTGACGG CCACGGTGGC 60 

TTCCTGGGGA AAAATGATGG CATCAAAACA AAAAAAGAAC TCATTGTGAA TAAGAAAAAA 12 0 

CATCTAGGCC CAGTCGAAGA ATATCAGCTG CTGCTTCAGG TGACCTATAG AGATTCCAAG 180 

GAGAAAAGAG ATTTGAGAAA TTTTCTGAAG CTCTTGAAGC CTCCATTATT ATGGT CAC AT 240 

GGGCTAATTA GAATT AT CAG AGCAAAGGCT ACCACAGACT GCAACAGCCT GAATGGAGTC 300 

CTGCAGTGTA CCTGTGAAGA CAGCTACACC TGGTTTCCTC CCTCATGCCT TGATCCCCAG 3 60 

AACTGCTACC TTCACACGGC TGGAGCACTC CCAAGCTGTG AATGTCATCT CAACAACCTC 420 

AGCCAGAGTG TCAATTTCTG T G AG AG AACA AAGATTTGGG GCACTTTCAA AATTAATGAA 480 

AGGTTTACAA ATGACCTTTT GAATTCATCT TCTGCTATAT ACTCCAAATA TGCAAATGGA 540 

ATTGAAATTC AACTTAAAAA AGCATATGAA AGAATTCAAG GTTTTGAGTC GGTTCAGGTC 600 

ACCCAATTTC GAAATGGAAG CATCGTTGCT GGGTATGAAG TTGTTGGCTC CAGCAGTGCA 660 

TCTGAACTGC TGTCAGCCAT TGAACATGTT GCCGAGAAGG CTAAGACAGC CCTTCACAAG 720 

CTGTTTCCAT TAGAAGACGG CTCTTTCAGA GTGTTCGGAA AAGCCCAGTG TAATGACATT 780 

GTCTTTGGAT TTGGGTCCAA GGATGATGAA TATACCCTGC CCTGCAGCAG TGGCTACAGG 840 

GGAAACATCA CAGCCAAGTG TGAGTCCTCT GGGTGGCAGG TCATCAGGGA GACTTGTGTG 90 0 

CTCTCTCTGC TTGAAGAACT GAACAAGAAT TTCAGTATGA TTGTAGGCAA TGCCACTGAG 960 

GCAGCTGTGT CATCCTTCGT GCAAAATCTT TCTGTCATCA TTCGGCAAAA CCCATCAACC 1020 
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ACAGTGGGGA ATCTGGCTTC GGTGGTGTCG ATTCTGAGCA ATATTTCATC TCTGTCACTG 1080 

GCCAGCCATT TCAGGGTGTC CAATTCAACA ATGGAGGATG TCATCAGTAT AGCTGACAAT 1140 

ATCCTTAATT CAGCCTCAGT AACCAACTGG ACAGTCTTAC TGCGGGAAGA AAAGTATGCC 1200 

AGCT CACGGT TACTAGAGAC ATTAGAAAAC ATCAGCACTC TGGTGCCTCC GACAGCTCTT 1260 

CCTCTGAATT TTTCTCGGAA ATTCATTGAC TGGAAAGGGA TTCCAGTGAA CAAAAGCCAA 1320 

CTCAAAAGGG GTTACAGCTA TCAGATTAAA ATGTGTCCCC AAAATACATC TATTCCCATC 13 80 

AGAGGCCGTG TGTTAATTGG GT C AG ACCAA TTCCAGAGAT CCCTTCCAGA AACTATTATC 1440 

AGCATGGCCT CGTTGACTCT GGGGAACATT CTACCCGTTT CCAAAAATGG AAATGCTCAG 150 0 

GTCAATGGAC CTGTGATATC CACGGTTATT CAAAACTATT CCATAAATGA AGTTTTCCTA 1560 

TTTTTTTCCA AGATAGAGTC AAACCTGAGC CAGCCTCATT GTGTGTTTTG GGATTTCAGT 162 0 

CATTTGCAGT GGAACGATGC AGGCTGCCAC CTAGTGAATG AAACT CAAGA CATCGTGACG 1680 

TGCCAATGTA CTCACTTGAC CTCCTTCTCC ATATTGATGT CACCTTTTGT CCCCTCTACA 1740 

ATCTTCCCCG TTGTAAAATG GATCACCTAT GTGGGACTGG GTATCTCCAT TGGAAGTCTC 1800 

ATTTTATGCC TGATCATCGA GGCTTTGTTT TGGAAGCAGA TTAAAAAAAG CCAAACCTCT 1860 

CACACACGTC GTATTTG CAT GGTGAACATA GCCCTGTCCC TCTTGATTGC TGATGTCTGG 1920 

TTTATTGTTG GTGCCACAGT GGACACCACG GTGAACCCTT CTGGAGTCTG CACAGCTGCT 1980 

GTGTTCTTTA CACACTTCTT CTACCTCTCT TTGTTCTTCT GGATGCTCAT GCTTGGCATC 2040 

CTGCTGGCTT ACCGGATCAT CCTCGTGTTC CATCACATGG CCCAGCATTT GATGATGGCT 2100 

GTTGGATTTT GCCTGGGTTA TGGGTGCCCT CTCATTATAT CTGTCATTAC CATTGCTGTC 2160 

ACGCAACCTA GCAATACCTA CAAAAGGAAA GATGTGTGTT GGCTTAACTG GTCCAATGGA 222 0 

AGCAAACCAC TCCTGGCTTT TGTTGTCCCT GCACTGGCTA TTGTGGCTGT GAACTTCGTT 22 80 

GTGGTGCTGC TAGTTCTCAC AAAGCTCTGG AGGCCGACTG TTGGGGAAAG ACTGAGTCGG 2 340 

GATGACAAGG CCACCATCAT CCGCGTGGGG AAGAGCCTCC TCATTCTGAC CCCTCTGCTA 2400 

GGGCTCACCT GGGGCTTTGG AATAGGAACA ATAGTGGACA GCCAGAATCT GGCTTGGCAT 24 60 

GTTATTTTTG CTTTACTCAA TGCATTCCAG GGATTTTTTA TCTTATGCTT TGGAATACTC 2520 

TTGGACAGTA AGCTGCGACA ACTTCTGTT C AACAAGTTGT CTGCCTTAAG TTCTTGGAAG 2580 

CAAACAGAAA AGCAAAACTC ATCAGATTTA TCTGCCAAAC CCAAATTCTC AAAGCCTTTC 2640 

AACCCACTGC AAAACAAAGG CCATTATGCA TTTTCTCATA CTGGAGATTC CTCCGACAAC 2 700 
AT CATGCTAA CTCAGTTTGT CTCAAATGAA TAA 
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40 



45 



50 



MKVGVLWLIS 
EKRDLRNFLK 
NCYLHTAGAL 
IEIQLKKAYE 
LFPLEDGSFR 
LSLLEELNKN 
ASHFRVSNST 
PLNFSRKFID 
SMASLTLGNI 
HLQWNDAGCH 
ILCLIIEALF 
VFFTHFFYLS 
TQPSNTYKRK 
DDKAT I I RVG 
LDSKLRQLLF 
IMLTQFVSNE 



11 

I 

FFTFTDGHGG 
LLKPPLLWSH 
PSCECHLNNL 
RIQGFESVQV 
VFGKAQCNDI 
FSMIVGNATE 
MEDVISIADN 
WKGIPVNKSQ 
LPVSKNGNAQ 
LVNETQDIVT 
WKQIKKSQTS 
LFFWMLMLGI 
DVCWLNWSNG 
KSLLILTPLL 
NKLSALSSWK 



21 
I 

FIiGKNDG I KT 
GLIRIIRAKA 
SQSVNFCERT 
TQFRNGSIVA 
VFGFGSKDDE 
AAVSSFVQNL 
ILNSASVTNW 
LKRGYSYQIK 
VNGPVISTVI 
CQCTHLTSFS 
HTRRICMVNI 
LLAYRIILVF 
SKPLLAFWP 
GLTWGFGIGT 
QTEKQNSSDL 



31 
I 

KKELIVNKKK 
TTD CNSLNGV 
KIWGTFKINE 
GYEWGSSSA 
YTLPCSSGYR 
SVIIRQNPST 
TVLLREEKYA 
MCPQNTSIPI 
QNYSINEVFL 
ILMSPFVPST 
ALSLLIADVW 
HHMAQHLMMA 
ALAIVAVNFV 
IVDSQNLAWH 
SAKPKFSKPF 



41 
I 

HLGPVEEYQL 
LQCTCEDSYT 
RFTNDLLNSS 
SELLSAIEHV 
GN I TAKCES S 
TVGNLASWS 
SSRLLETLEN 
RGRVLIGSDQ 
FFSKIESNLS 
IFPWKWITY 
FIVGATVDTT 
VGFCLGYGCP 
WLLVLTKLW 
VIFALLNAFQ 
NPLQNKGHYA 



51 
I 

LLQVTYRDSK 
WFPPSCLDPQ 
SAIYSKYANG 
AEKAKTALHK 
GWQVIRETCV 
ILSNISSLSL 
ISTLVPPTAL 
FQRSLPETII 
QPHCVFWDFS 
VGLGISIGSL 
VNPSGVCTAA 
LIISVITIAV 
RPTVGERLSR 
GFFILCFGIIi 
FSHTGD S SDN 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 464 DNA sequence 
Nucleic Acid Accession #: AB035089.1 
55 Coding sequence: 9845.. 10219 



60 



65 



70 



75 



80 



85 
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I 

GGGCATGCAG 
CAGTTCTAGT 
CCAAGAGGAA 
TTGGTTTGAA 
AGGAGGAAGG 
GAGACTATTT 
AAGAAAGGAA 
GTAGACAGAA 
AATCTCCTCC 
AATAAAATGT 
AAGGAAGTGA 
GTAATGACAG 
GGATTCCTGG 
CAAGTGTTCA 
TCAGGATATG 
TAGGAGAACT 
CAAGCTTTCT 
TTCAACCTTC 
CCATAGATTG 
CCTCCATTCC 
TGGTACCCGA 
CTGGATGCAG 
TTTCCTCTTC 
CTATTCCCCG 
ATATATATTG 
GTGAGAAATT 
CCTATGTGTT 
AAAACAAACT 
TGTGGAGAAA 



11 
I 

CCATCGGGGA 
AAAAGGGAGA 
TTAGGGAGAG 
AGCATACAGT 
GAGGCAAGAG 
CCCTCTCTGC 
AGCTAGTTAG 
TCCTTGGGAA 
ACTAACCAGT 
TCTCTTGACT 
GGTCCAGGAA 
GATATTTCCT 
AGCCAATGAA 
TATGCAAAAA 
TCCAGTCTCA 
ATTTAGGAAC 
CTAAATTTAA 
AGGGCAAACC 
GTCCCCTGTA 
CAGGATGAGC 
GTAAAT CCAT 
ACTCAGCTGA 
TTTCTTTTTC 
ACTTCAATCA 
ATACAGGAGA 
TCCAGATTGG 
TCTGGCACCT 
CACGGCTGGT 
TCAGAACTCT 



21 

1 

AAATCCATAG 
ACATCAATAT 
AGTTATAAGA 
AAATATGATG 
ATAATATCAT 
TTTTCAAACC 
TCTTGTTCTG 
TACAGTAATT 
TTCCCTATAG 
TTGTTACTTA 
AATCTAGGAG 
GAAAGTGTAA 
GTTGGTGTAT 
CTTCTTGGAA 
CACACCAGGA 
AGAAAAAAAT 
GCAAACTCCT 
TCCGTGCCTC 
ACCCCGGTTT 
TTGTTGCTTC 
CCTACTCCAA 
GAAGACCATT 
TTCTATCTTT 
GGGAACTTAT 
CCTAAGAAGA 
AAACACAGCT 
TGTTGTAGAT 
GTTAAAAAGG 
ATTCACGGT C 



31 
I 

TG CAG AT AAA 
AGGATGTTTC 
GATCAGCAAG 
TCTGTCCCTG 
TTTCTCTGTG 
TTACTGGAGT 
AGGTTGTTCA 
GACATATATT 
ATTGCCACAA 
ACAATGCTGA 
ATATTTCTTA 
TTTCCCATTG 
GTTTATGAAA 
TTTCTGAGTT 
TATGTCCTTT 
GCCTGAAATG 
GGTCATTTTC 
AGACGTTTAG 
GTCTCAGCTT 
TGTCCTATGA 
TAGAGGAAGG 
ATTCATTTTT 
GGATTTTTAG 
ACCTCTTAAA 
GCATGTCTTG 
TCCTTTCTCC 
AAATCTCCCT 
GCCCATGACA 
GGTTGGAATG 



41 

1 

GCAAGGAGGA 
TTAGCAATAG 
GGGACAGGGT 
GCAGTGTTGG 
CTCCAACTGT 
TGTTTTCCCT 
ATGTATACAT 
CTGTTATTTG 
GCACATAATA 
GAAAACTTTA 
ACCAATCTAT 
AGGATTTGTT 
TATCAAGAGA 
CTCTGTGGCA 
CTAGCCTGTC 
ATTTCTCATT 
AGTTAGTACC 
CCATAGTCTG 
GTTATCCTGT 
GACATTAGAT 
TCCATTTTTG 
GGAATTCTTT 
TCCATCAACG 
CTCATTCAGA 
GGGGTTGAGG 
CATCCAGCCC 
TGACTTTGTG 
ATACCAAGTG 
CACACTTGTG 



51 

1 

AGAAGAAGGA 
AAAAAGAAGG 
TAGATTTGGT 
CAGAGTAGGA 
ACTTACATAT 
CATGAAAACC 
ATCTATATCT 
ATGCTTGAAA 
AGAAACAATA 
CAG CCTTCAT 
AAAGGCATTA 
TTTAATTTCT 
CATAAGTTGG 
ATATATGACA 
TATCACATGC 
TGAACTCATC 
TTTCCTTAAG 
AAATTCTCTT 
TTTTTTCTTC 
TCCTTTTCTT 
TCTTATAGCG 
ATCTCAGATA 
CCCCATTAGT 
GACTCAAAAC 
AAACAGGCAG 
CTACTTTCAG 
ATGTGCTGAG 
TTGGGGAGAA 
CAGAATTCTA 
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120 
180 
240 
300 
360 
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480 
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600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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TGGAGAAGAG TCTGGCATTT CCTCAAAATG TTAACCTGGA TTTACCATAT GACCCAGCGA 1800 

TTTCATTCAT AGGTTTATAC TCAAAAGAAA TGAAGAAATA TGCCATGCAA AAAAATGTAC 1860 

ATGAAAGGTC ACAACATCAT TATTCATAAT AGTAAAAGGA TGGAAACAAC ACAAATGTCC 1920 

AT CAACTTAT GATTAAAGAA AATCTGGTCT ATT CATAGAA TGGAATATTA TTCGACCACA 1980 

5 AAAAGGAATG ATGTACTGAT CCATGCAATG ATGTGGACAA ACCATGAAAA TAACACTAGA 2 040 

TTAAAGAAGC CAGTCACAAA AGGACTTACT GTATGATTCC ATTTACCTGA AATGTTTGGA 2100 

ATAGGCAAAT CCATAGAAAC AGGAGGTAGA TTCCTGGTTT CCAGGGTCTC CAGGAAGGGA 2160 

AGAATGAAGT ACAAGATTTC TTTTGGAGGT AGTGAAATTG TTGTGGAATG AGATCATGAT 222 0 

GATGATAGCA CAACTTTGTG AATATAATAA AATCATTGAA TTGTACAGTT GAATTTATGG 2280 

10 TATATAAATT ATATGTTAAT AAAAAGGGGG TCCACAAAAC AAACAGCCCC CCACTCTGGT 2340 

TGTCAGGGAG ATATTGGATT AAATGGCCTT GGACAACAAC CCCTCTCCCT GGCCACAGAC 2400 

ATTCTTCAGA TTACAAGATA TTCCAGGGGA AACACTGGAA TGAGTCTGAA GCCAGGTGCT 2460 

AAACAGAAGG ACCATTGAGA AATGTTGTGA TCCTGACAGG TCAAGCAATT TATTTTTCGG 252 0 

CTTCATTTTT AAATGTAAAA TTAGAAAGCT GCCATTTAAA ATGGCCCGTC TGTTTCAATT 2580 

15 GCTCTTCTCA GTGTCAGCCT GTTAACTCAA TGTGTTAGTC TGTTTTCATG CTGCTGATAA 2640 

AAACATACCT GAGACTGGCA AGAAAAAGAG GTTTAATTGG GCTTAGAGTT CCACGTGATT 2700 

GGGGAGGCCT CAGAATCACA GTAGGAGGCA AAAGTTATTC TTACATGGTG GCTGCAAGAG 2 760 

AAGATGAGGA AGAAGCAAAA GAAGAAACCC CTGATAAACC CATCGGATCT CCTGAGGCTT 2 820 

ATTAACTATC ATGAGAATAG CACAAGAAAG ACCGGCCCCC ATGATTCAAT TACCTCTACC 2880 

20 TGGGTCCCTC CAATAACATG TGGAAATTCT GGTAGATACA ATT C AAGTTG AGATTTGGGT 2 940 

GGGAACACAG CCAAACCATA TCACTCAGCA AGGCAGATAA CTTTCTCACT GAGCCTATGC 30 00 

AACAGAAAAC CATCTGGGAT GGTTGTAAGG GGCACAGGAA GTGACTGGTA GGATCACTGC 30 60 

CAAAGCTGAG CACTCAGGAG AAGGCAATAG AATCCTATTC TCCATAGTAT GCTATAAGAT 312 0 

ACTGAAGTAC ACTTCTTCAC TATCTCTTTG GACTTAGAAT TAGCACTACA TTCCTTGTTA 3180 

25 TACAGAAAAA TTACTAAGGA AATTCATAGG ATGACAAAAA CTTTCAGAAC TGAAAAACAG 3240 

GAAATGTAAG CTTTTTAGTT CTTTGGTATT CGAAGTATGC CTAAAAGACA ATGCAAAATC 3300 

CAAGAAAAGA ATGGTGGGGT TTTTGTTTGT TTGGTTTTGT TTTTGTTTTA CAG CTGGAGT 3360 

AGAATACAAA GGGATGGAGT TGAAACAAAT " GAGAGGAAAT TGGAATTCTA AACTTATTCT 3420 

CATTGGCATT AGAAAGGCAC CTACATGTAT TTCACATGAG CCGGTGACTG CTGACTTGCA 3480 

30 TTCTTATTTT TTCCCTATAG ATTAAAAAGG AGGTACAATG GTAGAACTGT AATCCTGTCC 3540 

TTTGTCATAA ATTTTCATAT TCATAAAGGT GAGTGTTAGC CCGCTTGTGA AATCTGAAGT 3600 

TGAGTAACTT CAAATACTAA CCACAGAGGG AAAGGCAGCA AGAGGAGAGG CATAAATTTA 3660 

GGATCTCACC CTTCATTCCA CAGACACACA CAGCCTCTCT GCCCACCTCT GCTTCCTCTA 372 0 

GGAACACAGG TAAGAGCTTC AAGCCTCTCC AGCTTAATAA CATGAATTAT TTTTGAGAAT 3780 

35 AATAATGATA CTGTGTTCTA TAT C ATGCAT CTCCTGCATT CTGTCTGATT ATATTTTACT 3840 

TATTCTGCCA GAGCAAAATT AAAATACCTA TTTCATCTGA TTTGTCCTTT AT CTAAATTG 3900 

CTTAGTTCCA AGTAAACCAA GGCACTTTTA GGAACACAGA GGGAGAGTGC CTTGCAGCCA 3960 

GAGAGTCTTG AAGGAGATGT CAGGGACGCA TCTTAACAGC TGGTTGGATG TGATCCACAG 402 0 

AGGTCTCCTG TTAGCATTCA TTGTAAAGCC ATCCTACCTA GCTCTAGTGT AACCAGCAAT 4080 

40 GAAAGAAAGA TAAAGAGGGT CGATTACTTA TTTACAATAG TCTTTAAAAA CGTAGTTTTG 4140 

TAAGCCTTCT AATTAGGACA TTAATATATT TAATATATGC ACATTGTAGA AAGATTGAAG 4200 

CGTTAAAAAT AAGAGAAAAA CTTTAAATGT CAAAATCTCA CAACCCAGAT ATATCATTTC 4260 

TTTAAGAAAA TTGTACTACA AAATACCATT CCATTTATTA AAGTCATTCT GACAGGAATC 432 0 

TGATGCTTTT CCAGGAGTTC CAGATCACAT CGAGTTCACC ATGAAT TCAC TCAGTGAAGC 4380 

45 CAACACCAAG TTCATGTTCG ATCTGTTCCA ACAGTT CAGA AAATCAAAAG AGAACAACAT 4440 

CTTCTATTCC CCTATCAGCA TCACATCAGC ATTAGGGATG GTCCTCTTAG GAGCCAAAGA 4500 

CAACACTGCA CAACAAATTA GCAAGGTAGC TATCAGCATC ATTACGTTGT CCTGTTGCAG 4560 

TTTTTCTCTG GTTCCGTCGG CTAGCACGCA GATGGTAATA GATGTGGTGG TCTGATGGGT 4620 

AGCACAGGGG GCTGTGCAGG AATTCCCATA ACTGTGAGAC CACTGACTTA AACAGATCTT 4680 

50 TTGAGTAAAG TTTTCTTGTC CCGCTTCATG TCTCTTCCAG GTTCTTCACT TTGATCAAGT 4740 

CACAGAGAAC ACCACAGAAA AAGCTGCAAC AT AT CATGTG AGTCACAGAG CACTCTGATT 4800 

CAGCTTTAGA TCCCTGAACA GGT CATAGTT TAAACCTGGA ACTTCACAAA AACTAAGAAA 4860 

AGGCCAGTTT TAGGGAAAAT CTTGGACACA AAGATTGAGA CATACAGAGT GGGTTGGCAT 4920 

TTCATGGCAC ATAATTATTA TTCCTCATTT CTGCGTTACT AAAAGACAGT CAGCACTGTA 4980 

55 CCT CAGAGCA TAGGTCTGGA TCAGGATAGG CTGGGTT CAG ACTCCAGCTT TGCTCTTCAC 5040 

AAATGATGAA TAAGAGCAGG ACACAACTGC TCGGAGTCCC AGTGACCTCA TCCCAGAAAA 5100 

CTAAGGGTAA GAAAAAATCT GACTCAATAC ATGCAAATAC ATGCAAATGT TTACAACAGT 5160 

GCCTTGCCCA T AAAAGT CAT AATAAATGTT ATTATTATTA TAAAGTAGCT ATAATTATAC 522 0 

TAATCATAAT AATGTGAAAA TAATTTAATT TTCATTGAGT CATTAATGAG ATTCAGAGGA 5280 

60 ATAAGCACAA GTCCAAGTAT ATTTTGGAAA ATGATTGCTA TGGAATATAT TGGTTTAGAG 534 0 

CCTTAATAGT GCAAAATGCT TTGCTGGAAG GTAGAAAGTT CTAGATTTAA ACAGGCTTAG 5400 

GTTCAAAACT TGGCACTTCT AATTTATGTC TCTATAAACA GGGTTTTTTT CCCCATTCTC 5460 

TGAGCTTTCT TGTGTTCATC TGAATTGAAC TAAAGACTTA GAGTTACCCA TGTAAAGTCC 5520 

TTAGCCATGG ACCTGGCATA CACTCTTCTT ACGTGCAGAG AATGACCATC ATGAGGAAAG 5580 

65 AGCCACAGAT CAGT CAATGT GTCCTACAAG ATAATAGCAC CAACAGGTAT AACAGGGCTT 5640 

CCTGGCATAA TCTATTTAAA ATATCCAACC TTCAACATAC TCGTATCCTT GATGACTGTT 570 0 

AGAAGTGAAA TATGGTCCTT GCCCATAAGG AGCTGAGAGT TTAACTGGGA AGCTAAACCT 5760 

AACCCTTTAA ACCAACAAGG AGAAAATCTA CTGGTAGACA GCGCTGCATC TTTAGTT CAG 582 0 

AAGAGAAAAG ATTGCAGTAC GTTAGAGCAA GAAGAATTTT CTGGAAGAAG TCAAATATAA 5880 

70 GGTGGATTTT GAAGGGTATT TGAGGTGAAA TACACCAATT ATCAGGGAAT AACATCAAAG 5940 

GTCCTCAATG AGACTACCAG CATT TAGGG A CTGATCTAAC AGACTTAGCA TGGGTTTAGT 6000 

ATTTACATTG ATACAGCAAT TGAATGATCT CCTTTTTTGA TGTTTGAAGG TTGATAGGTC 6060 

AGGAAATGTT CATCACCAGT TTCAAAAGCT TCTGACTGAA TTCAACAAAT CCACTGATGC 6120 

ATATGAGCTG AAGAT CGCCA ACAAGCTCTT CGGAGAAAAG ACGTATCAAT TTTTACAGGT 6180 

75 AATTTCACCT GGCCTACCCA CATTTCATTT GCATCCTGAT GTCTGTGTCT CTGAGTGGCC 6240 

AAATGGAAGA AAGCAAGGCA GATGAGCCTG GCCGACCCAG GTGGAGAGCA TTTACTCAGA 63 00 

GTGCATTAGC TCCATTTCCA CAACTCTCCC CCACTGGAGT GTCCCAGACC CCAACGATAC 63 60 

ATCACTGAAG TGTGGATTTA GGGATAATCT TGTGATAAAA GAGGAGGTTG TGTAATAGAG 6420 

TGAGTAAGAG TAATAAGTAA TAAGATACCA TCGATAAACT GGCACTGACT CAGTCACATA 6480 

80 CGATACATCT TGGTGGGAAA TGTATGACTA ATGGGATATT ATTGGAATGG GCAGGCTTGG 6540 

GTGAGTTCCT GAGAATAGTT GAGGAAGTAC CAGGAAATAT TGAATGCACA GGATGAAAGA 6600 

CAAAAACAAA GATCAGAAAC ATCATGGTTA AAATTACTGG AGAGAAGTCT GAGAAGCAAT 6660 

GAATCTCCTT CAGGGAAGCC TGCTCTGCAG TTTGCAAACC ACAGCCTCTT CTGCTTCTGC 6720 

CTTTTGCCAA GATGATATTG ACCTTCAGTG ACCTCTTTCT TGTGCCAGCC CACATTCCCC 6780 

85 TTTTGCATTG CCTACATGAC ACCTGTATAA AAATATCCAT GGACAGGAGA TACTGCATCT 6840 

ATTCAGGGTC TGGATTCAGC TTACTGTTGT TACAAATAAG TAAGTTTGGT AATATATAGT 6900 

TACATAAATT ACTCCTAATT CCTACTTCTT CCTTCATATC TCAAAGGAAT ATTTAGATGC 6 960 
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CAT CAAGAAA TTTTACCAGA CCAGTGTGGA ATCTACTGAT TTTGCAAATG CTCCAGAAGA 702 0" 
AAGT CGAAAG AAGATTAACT CCTGGGTGGA AAGT CAAACG AATGGTAGGA GAGCCACCCA 70 80 
TTATAGAAAC ACCTTTGAGA AACCTATGCC AGTGAGCCTT GTGCTTGACA CTGCATGGGG 7140 
GAACAGGTGT GGGGATTGAG ATGGGTTTGC AGGGAGGGCT GAAGAGGGCA CTCCAGATGA 7200 
AGGATTTGTC CAAATGAATA TGAAGAGAGC CTAGGGGAGC CAAGGAGGAA AT CACAGGAA 7260 
G CCAATTAGA TGGAAACACA TCTGGAGAAT TATTTGCTTA TGGCCCTGCA TGACAATAGC 7320 
TTTGTGGATC CCCTGTCTCC GCTCAGACCT ATTTTGAGAT CATAT CCTTT ACTTTAAATC 73 80 
AGACTCAAAT TTTTATGATG AATATTTAAT AGAAAACATT AGAAAGCGTC TCTCGTCTCC 7440 
TTTACTAATT GGGAAACAAG CAGCTCTCTG GTAAATCACC CTTTTGTCTC TGAGCTGGAG 7500 
CTGCCTGGAT CACATCTGTA GCCAATGTGT TCTGCAGGGA TTATCACAGC TCTCTTCCCC 7560 
AT CAAGGGCA AAGAGCTTGA CAAAGTCTCC ATTCTACAGA CATCTTTCTT ACCTCCCACC 7620 
TCTCATTACA GGCCAAACTT ACAGCAACTC AACATGAGAG TGAATAGGAA GATACCCCCG 7680 
GAAGTAGTGT CTGACAGCAC AGGACATGCG TTTCATATTA CAGAGCT CAA GTCACTCATC 7740 
CTAAAATGCA ATCAGGGCCT CCTTCCTCTG AATGGGGACC CCGTAGTTAA AAAAAAATAA 7800 
AAGTAGGAAG AGGAGGGAGG GAGAAAGGAA AGACACATGT TGGAAGAGTA GACAAAATCA 7860 
GTTTATCAGT ATTCCAAATC AGATGATTGG AGACATTCAT ACACAGAGAA CGTGAACTCC 792 0 
TTCTCTATCA CAAGAAGTGA TGTCTCCATC AAGGGTAACT TTATACGACT GGAGCCTTGA 7980 
AGAAAGCTGC AT CTGGTGAA CCACTGGTCA GTGAGTCTAA CAATTCAAAG AT CAAAGTCA 8040 
GTGAGTCTCA AGCAGGGATT TGGGTCAATA ATTAACGATC AGTCACGAAC ATTTGCAAAG 8100 
CATCTTCCAG ACAAGCCATT TGTAGCTTGT GTAAAAGACT CTTTTATTCT TTCCCTTGCA 8160 
GAAAAAATTA AAAACCTATT TCCTGATGGG ACTATTGGCA ATGATACGAC ACTGGTTCTT 8220 
GTGAACGCAA TCTATTTCAA AGGGCAGTGG GAGAATAAAT TTAAAAAAGA AAACACTAAA 8280 
GAGGAAAAAT TTTGGCCAAA CAAGGTATTG TCTATATTTT ATTTATATAG TGTAATATGT 8340 
TAATACATGG AATGTTAAAC ATTTCTGATG GAATGTAACA TGATAAGTAA AAAATAAAAA 8400 
TTGTTCATGT CTGTTATTTT GTTGTTTTAC TCTTATAACT TTATTTAGTT AGGAATACCT 8460 
GAAAAACTAT TGTTTCTAAC TCATGGAATT CCTGGGTTAT TTCTTAGAAG AAGAAGGATG 852 0 
TGTTGCTATC TCAATAATAT TATCTTTTTT GTCTTGTGTT TCACGTGTTA TTTGTTGGAC 8580 
ACATTGATTT ATTGCAGAAT ACATACAAAT CTGTACAGAT GATGAGGCAA TACAATTCCT 8640 
TTAATTTTGC CTTGCTGGAG GATGTACAGG CCAAGGTCCT GGAAATACCA TACAAAGGCA 8700 
AAGATCTAAG CATGATTGTG CTGCTGCCAA ATGAAATCGA TGGTCTGCAG AAGGTAAGAA 87 60 
CTTGCAT CTA CAACTCTTCC TTCTACTGCC GGACATTTTT CCAAAGATAC C AAGT T T AAA 8820 
CAAGGTAAAA GCTTATGACC GAGTTGCCTC AAAATGATGA AAAATTCTAA ATGAGGAATG 8880 
ATGACTCACC TTCATATTAC AAATATTTGA GCATAGGGCC TGACACAAAC TGAAAGCTTA 8 940 
GTTTTTGTTT GTTTGTTTGT TTTTATTATT ATTATTATAA TACTTTAAGC TTTAGGGTAC 90 00 
ATGTGCACAA TGTGCAGGTT AGTTACATAT GTATACATGT GCCATGCTGG TGTGCTGCAC 90 60 
CCATTAACTC ATCATTTAGC GTTAGGTATA TCTCCTAATG CTATCCCTCC CCCCTCCCCC 912 0 
CACCCCACAA CAGTCCTCAG AGTGTGATGT TACCTTCCTG TGTCCAAGTG TTCTCATTGT 9180 
TCAATTCCCA TCTATGATTT AATTCCATCT ATGGCTTAGT TAATGATTAA TTTATTAGAG 9240 
TTACATGCAT TGGATAT CAA TTTGATGATA TTATTATGCA GCAATTTAAA CTTGACTGGG 93 00 
AGAAATATAT ACCAATGTGA GGAAAGTTTA CAAATAGGCC GAGTAGAAAA GGGAATACAA 93 60 
ATTTAGGAAT TTAGGGAATT ACAATTTAAT AATTGCAATG TGTACTAAAT AATGTATACA 9420 
GAAAAAT AT G ATGAGCCTAT TAAAAATTGA CACATGTAGT AGGCTGTTGG CACAAGAAAT 9480 
AGTGATACAT ACAGTTCATT GTGTACAAAA TAATGTAATC ATATTTTACA TGTGTATCAT 9540 
ACAGTTGTAT ACATACATAT GTACACATAT ACATATACGT AAAAACATGA TTCTGTTTTT 9600 
ACATACATGT ATATACATAT ACACATATAA CCCAATGTAT TTATATATTC AGGACTCATA 9660 
TTTTACCTAT TAGAATAATA ATGTCTATTA AAGTGAACCT TCTGTATTTC ACATTTATTG 9720 
CCAAAATAAC GAATCTCCAC ATAGTCAATT CATTGTTAAG GTGTATTAGA GATCGACAGT 9780 
TAGT CATAT C AGTTTCTTTT TTCCATTTGT ATAGCTTGAA GAGAAACTCA CTGCTGAGAA 9840 
ATTGATGGAA TGG AC AAGT T TGCAGAATAT GAGAGAGACA TGTGTCGATT TACACTTACC 9900 
TCGGTT CAAA ATGGAAGAGA GCTATGACCT CAAGGACACG TTGAGAACCA TGGGAATGGT 9960 
GAATATCTTC AATGGGGATG CAGACCTCTC AGGCATGACC TGGAGCCACG GTCTCTCAGT 10020 
ATCTAAAGTC CTACACAAGG CCTTTGTGGA GGTCACTGAG GAGGGAGTGG AAGCTGCAGC 10080 
TGCCACCGCT GTAGTAGTAG TCGAATTATC ATCTCCTTCA ACTAATGAAG AGTTCTGTTG 10140 
TAATCACCCT TTCCTATTCT T CAT AAGGCA AAATAAGACC AACAGCATCC TCTTCTATGG 10200 
CAGATTCTCA TCCCCATAGA TGCAATTAGT CTGTCACTCC ATTTAGAAAA TGTTCACCTA 10260 
GAGGTGTTCT GGTAAACTGA TTGCTGGCAA CAACAGATTC TCTTGGCTCA TATTTCTTTT 1032 0 
CTATCTCATC TTGATGATGA TAGTCATCAT CAAGAATTTA ATGATTAAAA TAGCATGCCT 10380 
TTCTCTCTTT CTCTTAATAA GCCCACATAT AAATGTACTT TTCCTTCCAG AAAAATTTCC 10440 
CTTGAGGAAA AATGT CCAAG ATAAGATGAA TCATTTAATA CCGTGTCTTC TAAATTTGAA 10500 
ATATAATTCT GTTTCTGACC TGTTTTAAAT GAACCAAACC AAATCATACT TTCTCTTCAA 10560 
ATTTAGCAAC CTAGAAACAC ACATTTCTTT GAATTTAGGT GATACCTAAA TCCTTCTTAT 10620 
GTTTCTAAAT TTTGTGATTC TATAAAACAC AT CAT CAATA AAATAATGAC ATAAAATCAT 10680 
TTTTGCTTTA CCTGTTTTCT CTCTGGAAAG GGCAAGTGTC CAGTTACACA TAGGAAAGAT 10740 
AATTTAGAGA TATATTAATC ATATATAAAG GAAAATTAAA AACAGAGTAG TTCATGATGA 10800 
GCCTGGAGTA GAAGGCATAT CCCAGAACAG GAGGAGCCTT GTAAACCACA TAGGAACTTC 10860 
CTATTTTATG CTAAAGGGAT AAGAAACTCA TTACAGGCTT TGATGGTTGT TTGTCAAAGA 10 920 
GGGG CAT AAA ATTATCATAT CCACATCTAG AAAATACATC TCTGGCTACG CTGATATCAA 10980 
TGGATGCGAG GAAAGAACAG TGTGGTTACC ATATATAAAT TAGGAAATCA TTAGAGTATT 11040 
GGGAGTGGAA ATGGAGAGAA AGAAAGAGCC TGGGGGAATT ATTTAGGAAA TAATAGTTAC 11100 
AGAAAGACAT CTAAGTTGCT GACCTATCTG ACTGGATGGA TGGAAGAATA TCTTGTTTCT 11160 
GAGAGAAAAA AAGACTTTGG GTTTAAATTT GTACTTGATG AATTAAGGTA CTTTTAATAT 11220 
TCAAATGGAT TTGCCTGGCA GGCACTTGAA GATATTAGTC TAAATCTCAG AAACAGAATA 11280 
TGATCTGAAG CTCTAAATTT GTGATATTCA ATATAAATAC TTTAGAGTCA TTGGGATAAA 11340 
TATGGTAGTT GTAGCTAAAA GCAAAAATAA GATACTAGGG AGAAAGGATA AAGTTAGAAG 11400 
AAAGAAGAAT CTAGAATTGA CCTTGAAGTA TATCAG CATG TGTAAAGATC AGGAATTGAT 11460 
CATTTTTATT TTCCAGAAAG TAGCTTTTCT TAGGGTTCCA TATTTACTCC CAT AG ATT CT 11520 
TCCC 

Seq ID NO: 465 Protein sequence 
Protein Accession #: BAB21525.1 

1 11 21 31 41 51 

I I I I I I 

MNSLSEANTK FMFDLFQQFR KSKENNIFYS PISITSALGM VLLGAKDNTA QQISKVLHFD 60 

QVTENTTEKA ATYHVDRSGN VHHQFQKLLT EFNKSTDAYE liKIANKLFGE KTYQFLQEYIi 120 

DAIKKFYQTS VES TDFANAP EESRKKINSW VESQTNEKIK NLFPDGTIGN DTTLVLVNAI 180 

YFKGQWENKF KKENTKEEKF WPNKNTYKSV QMMRQYNSFN FALLEDVQAK VLEIPYKGKD 240 
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LSMIVLLPNE IDGLQKLEEK LTAEKLMEWT SLQNMRETCV DLHLPRFKME ESYDLKDTLR 
TMGMVNIFNG DADLSGMTWS HGLSVSKVLH KAFVEVTEEG VEAAAATAW WELSSPSTN 
EEFCCNHPFL FFIRQNKTNS ILFYGRFSSP 

Seq ID NO: 466 DNA sequence 

Nucleic Acid Accession ft: NM_001910 . 1 

Coding sequence: SO.. 1240 



PCT/US02/12476 



300 
360 



GGAGAGAAGA 
CCTTCTTTTG 
GCCCCTCAGG 
GTTCTGGAAA 
GAGTGCCAAG 
TGGCTCCCCA 
CCCCTCTGTG 
GTCCAGCACA 
GTCCGGGATC 
GT TTGGAGAA 
TCTGGGCCTG 
GATGGCTCAG 
AGGTGGTGCG 
CCTGAATTGG 
GGTGGGAGGC 
TTCCCTCATC 
CCCCGTGGAT 
CTTCACCATT 
CGTGGATGGA 
TGGGCCCCTC 
TGGGAATAAC 
CCTGCCTGTC 
GTTATTTTCC 
ACATGAGAAT 
ACTCCCACCA 
TGATTATGAA 
GCAGAGATCA 
CACACGGCCA 
GTACCTGGAT 
TAACATCCTT 
CTGTTTTTGT 
TTTGAAATGT 
TTGAAATCCC 
AGTTAGATCT 
CTTGTTGCAT 
CTGCTGGGTG 



11 

I 

AAGGAGGGGG 
CTGCTGGTGC 
AGGCATCCGT 
TCCCATAATT 
GAACCCCTCA 
CCACAGAACT 
TACTGCACTA 
TACAGCCAGC 
ATTGGAGCCG 
AGTGTCACAG 
GGATACCCCT 
AACCTGGTGG 
GGGAGCGAGC 
GTCCCAGTCA 
ACTGTTATGT 
ACTGGCCCTT 
GGAGAATATG 
AACGGAGTCC 
ATGCAGTTCT 
TGGATCCTGG 
CGTGTGGGAC 
TGACAGACCT 
AGAGAATGTA 
ACACACACAC 
CCGTCATGAT 
AATCAAAAAT 
TGGTATAATA 
GGCCTGTTTA 
CATTCTGAAG 
AAATATACAA 
AGTTGGATTG 
CTGTAAGTCT 
GAGGTGTCAT 
GGAGGTCTGA 
CCTGTCAGGA 
GGTTTGGAGT 



21 

I 

CAAGGGAGAA 
TCCTGGAGCT 
CCCTCAAGAA 
TGGACATGAT 
TCAACTACTT 
TCACTGT CAT 
GCCCAGCCTG 
CAGGTCAATC 
ACCAAGTCTC 
AGCCAGGCCA 
CCTTGGCTGT 
ACTTGCCGAT 
TGATTTTTGG 
CCAAGCAAGC 
TCTGCTCCGA 
CCGACAAGAT 
CTGTGGAGTG 
CCTATACCCT 
GCAGCAGTGG 
GGGATGTCTT 
TGGCCCCAGC 
TGAATATGTT 
GCTGTTTCCA 
ACACACATAT 
GGAGGAATTA 
TTTCACATTT 
AATCCCTTTG 
TCTACACTGC 
CAAATTCCGA 
TCGGAATTCA 
TTTGTATTAG 
CTTTCCATCT 
TTGACATGGT 
TTTTGTGGCA 
GG CAGATAAT 
TCTTGGCTTT 



31 
I 

GCTGCTGGTC 
GGGAGAGGCC 
GAAGCTGCGG 
CCAGTTCACC 
GGATATGGAA 
CTTCGACACT 
CAAGACGCAC 
TTTCTCCATT 
TGTGGAAGGA 
GACCTTTGTG 
GGGAGGAGTG 
GTTTTCTGTC 
AGGCTACGAC 
TTACTGGCAG 
GGGCTGCCAG 
TAAGCAGCTG 
TGCCAACCTT 
CAGCCCAACT 
CTTTCAAGGA 
CAT TCGACAG 
AGTCCCCTAA 
AGGCTGGGGC 
GGGTTGCAAC 
ACACACACAC 
CGTTATACAT 
GATTATGAAA 
CAACTCCACT 
TGCCCACTCC 
GCATTACATC 
AGCATCTCCC 
G ATT CAAGCA 
ACAGAGTTTA 
TCTCTGAACT 
AAAATACTTC 
GCTGGTGCCT 
AATCATTCAT 



41 
I 

GG ACT CACAA 
CAAGGATCCC 
GCACGGAGCC 
GAGTCCTGCT 
TACTTCGGCA 
GGCTCCTCCA 
AGCAGGTTCC 
CAGTATGGAA 
CTAACCGTGG 
GATGCAGAGT 
ACTCCAGTAT 
TACATGAGCA 
CACTCCCATT 
ATTGCACTGG 
GCCATTGTGG 
CAAAACGCCA 
AACGTCATGC 
GCCTACACCC 
CTTGACATCC 
TTTTACTCAG 
GGAGGGGCCT 
ATTCTTTACA 
TTGAATTAAG 
ACACTTCACA 
TCATATTTTG 
ATCTCCAAAC 
CAGCCCTGAC 
TCTCTCCAGC 
ATTTTGTCCA 
ATTGTCCCAC 
AGGCCCATAT 
GCACATT TGA 
TATCTTTCCT 
CTAGGTGGTG 
CTCTATTGGT 
TACAAAGTTC 



51 
I 

TGAAAACGCT 
TTCACAGGGT 
AGCTCTCTGA 
CAATGGACCA 
CTATCTCCAT 
ACCTCTGGGT 
AGCCTTCCCA 
CCGGGAGCTT 
TTGGCCAGCA 
TTGATGGAAT 
TTGACAACAT 
GTAACCCAGA 
TCTCTGGGAG 
ATAACATCCA 
ACACAGGGAC 
TTGGGGCAGC 
CGGATGTCAC 
TACTGGACTT 
ACCCTCCAGC 
TCTTTGACCG 
TGTGTCTGTG 
CCTACAAAAA 
ACCAAACAGA 
CATACACACC 
TATTGATTTT 
ATATGCACAA 
AACCCATCCA 
TCCACATGCT 
TAAATATTTC 
AAATGTTTGG 
ATTGCATTTA 
ACGTTGCTGG 
ATAAAATGGT 
CTGGGTACTT 
AATGTTAAGA 
AGCATTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 467 Protein sequence 
Protein Accession #: NP_001901.1 



1 
I 

MKTLLLLLLV 
SMDQSAKEPL 
QPSQSSTYSQ 
FDGILGLGYP 
FSGSLNWVPV 
IGAAPVDGEY 
HPPAGPLWIL 



11 
I 

LLELGEAQGS 
INYLDMEYFG 
PGQSFSIQYG 
SLAVGGVTPV 
TKQAYWQIAL 
AVE C ANLNVM 
GDVFIRQFYS 



21 
I 

LHRVPLRRHP 
TISIGSPPQN 
TGSLSGIIGA 
FDNMMAONLV 
DNIQVGGTVM 
PDVTFTINGV 
VFDRGNNRVG 



31 

i 

SLKKKLRARS 
FTVIFDTGSS 
DQVSVEGLTV 
DLPMFSVYMS 
FCSEGCQAIV 
PYTLSPTAYT 
LAPAVP 



41 
I 

QLSEFWKSHN 
NLWVPSVYCT 
VGQQFGESVT 
SNPEGGAGSE 
DTGTSLITGP 
LLDFVDGMQF 



51 
I 

LDMIQFTESC 
SPACKTHSRF 
EPGQTFVDAE 
LIFGGYDHSH 
SDKIKQLQNA 
CSSGFQGLDI 



Seq ID NO: 468 DNA sequence 

Nucleic Acid Accession #: NM_018058.1 

Coding sequence: 3 19.. 15 75 



TACGCGCTGC 
GACGGCCGGG 
TACACCGACA 
GTCAACGTGG 
AGAAAGGGCT 
CCTGATGCCC 
CTCAGAGATG 
GTGGGCCCCA 
AACTTCCTTT 
GTGGACGACC 
AAAGTGGACA 
ACCCATGGGA 
GTCCGCACGG 
AACATTGCCT 
GGAGACCCCC 
ACAGGGGGTG 
GGAGAGT CCA 
TGGCTGCGAG 
CTCTACACCA 
TGTGAGATGG 
GTGACGTGGC 



11 
I 

GGGACCGGCA 
AGGAGATCTA 
AGTTGTTCAA 
CCCGTGGTGT 
CTGGACGCTA 
TCATTGAAAT 
TGGCTGCTGA 
TCCTCAGCAG 
TCCACAACCG 
CCCACCAGCA 
TCGTCTATGG 
AGGTCCGCTT 
TCATCACCGC 
ACCGCAGCTC 
TCATCGAGGA 
TGGTGACCGA 
TGGCTCAGCC 
TGGTGCCACG 
AGAAGAGTGG 
AGCCCGTGGC 
CAGATGGCAA 



21 
I 

GGGGAACGCC 
CTTCCTCAAC 
GTTCCGCAAT 
GGCCAGCCTC 
CTCTATCTAC 
GGACCCTGAG 
GGCTGGGGTC 
CAGTGCCTCG 
GGGCGATGGC 
TGGGCGAGGT 
CAACTGGAAT 
CCGGGACATC 
CGACTTTGAC 
CTCAGCCAAC 
GCTCAATCCC 
CTTCGACGGA 
GCTGTCCGTC 
CACCCGGGTT 
GGCCCACCTG 
ACACTTTGGC 
GATGGTGAGC 



31 

I 

ATCGGGGTCA 
ACCAATAATG 
AACCGGTGGG 
TTTGCCGGAC 
ATTGCCAATT 
GCCAGTGACC 
AGCAAATATA 
GATATCTTCT 
ACCTTTGTGG 
GTCGCCCTGG 
GGCCCCCACC 
GCCTCACCCA 
AATGACCAGG 
CGCCTCTTCC 
GGCGACGCCT 
GACGGGATGC 
TTCCGGGGCA 
GGGGCCTTTG 
AGGATCATCG 
CTGGGGAAGG 
CGGAACGTGG 



41 
I 

CAGCCTGCGA 
CCTTCTCGGG 
AAGACATCCT 
GCTCTGTGGC 
ACGCCTACGG 
TCTCCCGGGG 
CAGGGGGCCG 
GCGACAATGA 
ACGCTGCGGC 
CTGACTTCAA 
GCCTCTATCT 
AGTTCTCCAT 
AGCTGGAGAT 
GCGTCATCCG 
TGGAGCCTGA 
TGGACCTCAT 
ATCAGGGCTT 
CCAGGGGAGC 
ACGGGGGCTC 
ATGAAG CCAG 
CCAGCGGGGA 



51 
I 

CATCGACGGG 
GGTGGCCACG 
GAGCGATGAG 
CTGTGTGGAC 
TAATGTGGGC 
CATTCTGGCG 
AGGCGTCAGC 
GAATGGGCCT 
CAGTGCTGGT 
CCGTGATGGC 
GCAAATGAGC 
GCCCTCCCCT 
CTTCTTCAAC 
TAGAGAGCAC 
GGGCCGGGGC 
CTTGTCCCAT 
CAACAACAAC 
TAAGGTCGTG 
AGGCTACCTG 
CAGTGTGGAG 
GATGAACTCA 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



365 



10 



15 



WO 02/086443 

GTGCTGGAGA TCCTCTACCC CCGGGATGAG GACACACTTC AGGACCCAGC CCCACTGGAG 1320 

ACACCAATGA ATGCATCCAG TTCCCATTCG TGTGCCCTCG AGACAAGCCC GTATGTGTCA 13 80 

ACACCTATGG AAGCTACAGG TGCCGGACCA ACAAGAAGTG CAGTCGGGGC TACGAGCCCA 1440 

ACGAGGATGG CACAGCCTGC GTGGGGACTC TCGGCCAGTC ACCGGGCCCC CGCCCCACCA 1500 

CCCCCACCGC TGCTGCTGCC ACTGCCGCTG CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 1560 

CACCGGTCCT CGTAGATGGA GATCTCAATC TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 162 0 

CCAGCTGCTG AGCAGGGGTG GGACATGAAC CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 1680 

AAGTGGGCTT GTGCTGCTGC CTAGACAGTA GGGATGTAAA GGCCTGGGAG CTAGACCCTC 1740 

CCCAAGCCCA TCCATGCACA TTACTTAGCT AACAATTAGG GAGACTCGTA AGGCCAGGCC 1800 

CTGTGCTGGG CACATAGCTG TGATCACAGC AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 1860 

AT T CCAGTGG GTCTAATGAC CAT AT CTTAG GACACAGATG TGCCCAGGGA GGTGGTGTCA 1920 

CTGCACAGGA AGTATGAGGA CTTTAGTGTC CTGAGTTCAA ATCCTGATTC AGGAACTCAC 1980 

AAAGCTATGT GACCTTACAC CAGTCACTTA ACTTGTTAGC CATCCATTAT CGCATCTGCA 2 040 

AAATGGGGAT TAAGAATAGA ATCTTGGGGT TAGTGTGGAG ATTAGATTAA ATGTATGTAA 2100 

GACACTTGGC ACAAAACCTG GCACATAGTA AAGGCTCAAT AAAAACAAGT GCCTCTCACT 2160 
GGGCTTTGTC AACACGTG 



PCT/US02/12476 



20 



25 



30 



35 



Seq ID NO: 46 9 Protein sequence 
Protein Accession #: NP_060528.1 



MDPEASDLSR 
RGDGTFVDAA 
FRDIASPKFS 
ELNPGDALEP 
RTRVGAFARG 
KMVSRNVASG 
GAGPTRSAVG 



11 
I 

GILALRDVAA 
ASAGVDDPHQ 
MPSPVRTVIT 
EGRGTGGWT 
AKWLYTKKS 
EMNSVLEILY 
ATSPTRMAQP 



21 

I 

EAGVSKYTGG 
HGRGVALADF 
ADFDNDQELE 
DFDGDGMLDL 
GAHLRI IDGG 
PRDEDTLQDP 
AWGLSASHRA 



31 

1 

RGVSVGPILS 
NRDGKVDIVY 
IFFNNIAYRS 
ILSHGESMAQ 
SGYLCEMEPV 
APLETPMNAS 
PAPPPPPLLL 



41 

1 

SSASDIFCDN 
GNWNGPHRLY 
SSANRLFRVI 
PLSVFRGNQG 
AHFGLGKDEA 
SSHSCALETS 
PLPLLLPLLE 



51 

I 

ENGPNFLFHN 
LQMSTHGKVR 
RREHGDPIiIE 
FNNNWLRWP 
SSVEVTWPDG 
PYVSTPMEAT 
LPLLHRSS 



Seq ID NO: 470 DNA sequence 
Nucleic Acid Accession #: AJ279016 
Coding sequence: 1..1962 
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ATGT CCAGGA 
CAGCGGGCTG 
AGTAATCCCA 
TTTGAGATCG 
CAGAAGCGGC 
GACCGGCAGG 
GAGATCTACT 
TTGTTCAAGT 
CGTGGTGTGG 
GGACGCTACT 
ATTGAAATGG 
GCTGCTGAGG 
CTCAGCAGCA 
CACAACCGGG 
CACCAGCATG 
GTCTATGGCA 
GTCCGCTTCC 
ATCACCGCCG 
CGCAGCTCCT 
ATCGAGGAGC 
GTGACCGACT 
GCTCAGCCGC 
GTGCCACGCA 
AAGAGTGGGG 
CCCGTGGCAC 
GATGGCAAGA 
CTCTACCCCC 
TTCTCCCAGC 
GTGTGCCCTC 
AACAAGAAGT 
CTCGGCCAGT 
GCTGCTGCCG 
CTGGGGTCGG 
CCAGCGGATG 
AGGGATGTAA 
TAACAATTAG 
CAGACAGGGT 
GGACACAGAT 
CCTGAGTTCA 
AACTTGTTAG 
TTAGTGTGGA 
AAAGGCTCAA 



TGTTACCGTT 
AACCCATGTT 
CCCAGCTCAA 
TCGTGGCGGG 
TGGTGAACAT 
GGAACGCCAT 
TCCTCAACAC 
TCCGCAATAA 
CCAGCCTCTT 
CTATCTACAT 
ACCCTGAGGC 
CTGGGGTCAG 
GTGCCTCGGA 
GCGATGGCAC 
GGCGAGGTGT 
ACTGGAATGG 
GGGACATCGC 
ACTTTGACAA 
CAGCCAACCG 
TCAATCCCGG 
TCGACGGAGA 
TGTCCGTCTT 
CCCGGTTTGG 
CCCACCTGAG 
ACTTTGGCCT 
TGGTGAGCCG 
GGGATGAGGA 
AGGAAAATGG 
GAGACAAGCC 
GCAGTCGGGG 
CACCGGGCCC 
CTGCTGGAGC 
TGGTTAAGGA 
GAGTCCAGCA 
AGGCCTGGGA 
GGAGACTCGT 
CGCTGCCCTG 
GTGCCCAGGG 
AATCCTGATT 
CCATCCATTA 
GATTAGATTA 
TAAAAACAAG 



CCTGCTGCTG 
CACTGCAGTC 
CTATGGTGTG 
GTACAATGGA 
CGCGGTCGAT 
CGGGGTCACA 
CAATAATGCC 
CCGGTGGGAA 
TGCCGGACGC 
TGCCAATTAC 
CAGTGACCTC 
CAAATATACA 
TATCTTCTGC 
CTTTGTGGAC 
CGCCCTGGCT 
CCCCCACCGC 
CTCACCCAAG 
TGACCAGGAG 
CCTCTTCCGC 
CGACGCCTTG 
CGGGATGCTG 
CCGGGG CAAT 
GGCCTTTGCC 
GAT CATCG AC 
GGGGAAGGAT 
GAACGTGGCC 
CACACTTCAG 
CCATTGCATG 
CGTATGTGTC 
CTACGAGCCC 
CCGCCCCACC 
TGCCACTGCT 
GAGCTGCGAG 
GGGGAGTGGG 
GCTAGACCCT 
AAGGCCAGGC 
ATGGCGCTTA 
AGGTGGTGTC 
CAGGAACTCA 
TCGCATCTGC 
AATGTATGTA 
TGCCTCTCAC 



CTCTGGTTTC 
ACCAACTCAG 
GCAGTTACTG 
CCCAACCTGG 
GAGCGCAGCT 
GCCTGCGACA 
TTCTCGGGGG 
GACAT CCTGA 
TCTGTGGCCT 
GCCTACGGTA 
TCCCGGGGCA 
GGGGGCCGAG 
GACAATGAGA 
GCTGCGGCCA 
GACTTCAACC 
CTCTATCTGC 
TTCTCCATGC 
CTGGAGATCT 
GTCATCCGTA 
GAGCCTGAGG 
GACCTCATCT 
CAGGGCTTCA 
AGGGGAGCTA 
GGGGGCTCAG 
GAAGCCAGCA 
AGCGGGGAGA 
GACCCAGCCC 
GACACCAATG 
AACACCTATG 
AACGAGGATG 
ACCCCCACCG 
GCACCGGTCC 
CCCAGCTGCT 
AAAGTGGGCT 
CCCCAAGCCC 
CCTGTGCTGG 
CATTCCAGTG 
ACTGCACAGG 
CAAAGCTATG 
AAAATGGGGA 
AGACACTTGG 
TGGGCTTTGT 



TGCCCATCAC 
TTCTGCCTCC 
ATGTGGACCA 
TTCTGAAGTA 
CACCCTACTA 
TCGACGGGGA 
TGGCCACGTA 
GCGATGAGGT 
GTGTGGACAG 
ATGTGGGCCC 
TTCTGGCGCT 
GCGT CAGCGT 
ATGGGCCTAA 
GTGCTGGTGT 
GTGATGGCAA 
AAATGAGCAC 
CCTCCCCTGT 
TCTTCAACAA 
GAGAGCACGG 
GCCGGGGCAC 
TGTCCCATGG 
ACAACAACTG 
AGGTCGTGCT 
GCTACCTGTG 
GTGTGGAGGT 
TGAACTCAGT 
CACTGGAGTG 
AATGCATCCA 
GAAGCTACAG 
GCACAGCCTG 
CTGCTGCTGC 
TCGTAGATGG 
GAGCAGGGGT 
TGTGCTGCTG 
ATCCATGCAC 
GCACATAGCT 
GGTCTAATGA 
AAGTATGAGG 
TGACCTTACA 
TTAAGAATAG 
CACAAAACCT 
CAACACG 



TGAGGGGTCC 
TGACTATGAC 
TGATGGGGAC 
TGACCGGGCC 
CGCGCTGCGG 
CGGCCGGGAG 
CACCGACAAG 
CAACGTGGCC 
AAAGGGCTCT 
TGATGCCCTC 
CAGAGATGTG 
GGGCCCCATC 
CTTCCTTTTC 
GGACGACCCC 
AGTGGACATC 
CCATGGGAAG 
CCGCACGGTC 
CATTGCCTAC 
AGACCCCCTC 
AGGGGGTGTG 
AGAGTCCATG 
GCTGCGAGTG 
CTACACCAAG 
TGAGATGGAG 
GACGTGGCCA 
GCTGGAGATC 
TGGCCAAGGA 
GTTCCCATTC 
GTGCCGGACC 
CGTGGGGACT 
CACTGCCGCT 
AGATCTCAAT 
GGGACATGAA 
CCTAGACAGT 
ATTACTTAGC 
GTGATCACAG 
CCATATCTTA 
ACTTTAGTGT 
CCAGTCACTT 
AATCTTGGGG 
GGCACATAGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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1500 
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2100 
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Seq ID NO: 4 71 Protein sequence 
Protein Accession #: CAC08451 
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85 MSRMLPFLLL LWFLPITEGS QRAEPMFTAV TNSVLP PDYD SNPTQLNYGV AVTDVDHDGD 60 
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GRYSIYIANY AYGNVGPDAL IEMDPEASDL SRGILALRDV 
LSSSASDIFC DNENGPNFLF HNRGDGTFVD AAASAGVDDP 
VYGNWNGPHR LYLQMSTHGK VRFRDIASPK FSMPSPVRTV 
RSSSANRLFR VIRREHGDPL IEELNPGDAL EPEGRGTGGV 
AQPIiSVFRGN QGFNNNWLRV VPRTRFGAFA RGAKWIiYTK 
PVAHFGLGKD EASSVEVTWP DGKMVSRNVA SGEMNSVLEI 
FSQQENGHCM DTNECIQFPF VCPRDKPVCV NTYGSYRCRT 
LGQSPGPRPT TPTAAAATAA AAAAAGAATA APVLVDGDLN 

Seq ID NO: 472 DNA sequence 
Nucleic Acid Accession #: FGENESHH 
Coding sequence: 1..4794 



PCT/US02/12476 







240 


UOTT<*ZT? r'\ T ft T . a 
rty rl*jK.Li V Ai-iM. 


n etntd nfnrvn t 
jj Jr jn kjj v u ± 


3 00 


ITADFDNDQE 


LEIPFNNIAY 


360 


VTDFDGDGML 


DLILSHGESM 


420 


KSGAHLRIID 


GGSGYLCEME 


480 


LYPRDEDTLQ 


DPAPLECGQG 


540 


NKKCSRGYEP 


NEDGTACVGT 


600 


LGSWKESCE 


PSC 





ATGGCGTGTC 
AGCGGCTCCT 
GTTCTGAAGT 
TCACCCTACT 
ATCGACGGGG 
CACAGCAGCT 
CCACCTACAA 
TCCTCCCTGG 
TGTCGGGGTG 
GGGGTGGCCA 
CTGAGCGATG 
GCCTGTGTGG 
GGTAATGTGG 
GGCATTCTGG 
TTCTCCCACA 
GGAGGAGACC 
TGCCGGCTGG 
CAGAGGGAGG 
TCCAAAAGCC 
GCGCCTTCTC 
CCCCTTGTCA 
CCCCACCCCC 
CTGATGGCTG 
CTGAGAAGCT 
GAGCTGGGAG" 
CTGGGAGAAC 
CCCAAGGTCA 
GGCCCCGGGA 
CTCTCCCATC 
GTGCCGGGAG 
CTGGCGTGGA 
TTTAGGCTCA 
CTGCAGTTCC 
TCTGCCACTC 
ATCCTCAGCA 
TTCCACAACC 
GCCTTCATCG 
CTAGCAGAAA 
CCACATTGCC 
TTCTTGACGC 
CAGGGGGCCC 
ACTGCCTATT 
TTGTCCTCTG 
GCCCTGGCTG 
CCCCACCGCC 
TCACCCAAGT 
GACCAGGAGC 
CTCTTCCGAT 
GGTCAGGGAG 
AAGGT CAACA 
AGAGGCTGTG 
AAAGGGAAGG 
CCACACTACC 
GTCCAATCAC 
CGGGGTCCAA 
GCTACGGGCT 
AGGGGCTACG 
AGAAAGGGGC 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
GTCCAATCAC 
CGGGGTCCAA 
GCTATGGGGT 
AGGGGCTATG 
AGAGAG CACG 
GGCCGGGGCA 
TTGTCCCATG 
AACAACAACT 
AAGGTCGTGC 
GGCTACCTGT 
AGTGTGGAGG 



11 
I 

CGGGAGGACT 
CCCCAGCATC 
ATGACCGGGC 
ACGCGCTGCG 
ACGGCCGGGA 
CAGCGCAGGT 
CCCCTGCAGG 
GTCAGGCTTC 
GACTGAGACC 
CGTACACCGA 
AGGTCAACGT 
ACAGAAAGGG 
GCCCTGATGC 
CGCTCAGAGA 
CTGCCTCTCC 
CAGAGGAGGC 
GCTGGAAGGA 
CTGGGGCAGC 
ATTTGGCTGA 
CAGCCCACCC 
CTCAGCTAAT 
GAGCCCCAGG 
AGGCTTTGGG 
GGGAGGAAAG 
GTCCCTGGAG 
CTCCCATTTT 
CACAGGAGTG 
GGGTGGCCAA 
CCCTGGTCCC 
CTGCCCTGCC 
ACCAGATGGA 
GGAAAGCACG 
CCTCAGGCCT 
ACTGTGGGTC 
GCAGTGCCTC 
GGGGCGATGG 
TTCACCTCAA 
CTGGTCCTTC 
ATCATGGTTT 
AAGGCTTGGC 
CACCCTGCCT 
ACATTGTCCT 
AAAGAGTCAA 
ACTTCAACCG 
TCTATCTGCA 
TCTCCATGCC 
TGGAGATCTT 
GCTCCATCCT 
AAGGTTTAAG 
CAGGTCCCCT 
GGAATGCAGG 
GAAATGTGG C 
ACAAAAAGGG 
TACCAGGAAA 
TCACTACCAG 
CCAATCACTA 
GGCTCCAATC 
TACGGGCTCC 
GGGCTACAGG 
AAAGGGGCTA 
AGGAAAAGGG 
TACCACAGAA 
TCACTACCAG 
CCAATCACTA 
GGGTCCAATC 
GAGACCCCCT 
CAGGGGGTGT 
GAGAGTCCAT 
GGCTGCGAGT 
TCTACACCAA 
GTGAGATGGA 
TGACGTGGCC 



21 
I 

CCCAGCCCGT 
CCCTCCCCAT 
CCAGAAGCGG 
GGACCGGCAG 
GGAGAT CTAC 
CCCTTCTGGG 
CCTCCTGGGT 
TCCGGACAGC 
TACCCATGAA 
CAAGTTGTTC 
GGCCCGTGGT 
CTCTGGACGC 
CCT CATTGAA 
TGTGGCTGCT 
AAGCATTGGT 
AGATGAGGAG 
CGGGCAGTTC 
TGGCGTGCCC 
CAAGAACCTA 
TTTCCCTGCC 
GACACATGGA 
AATGGACCCC 
CGCGTGGCCA 
CAGGCAGAAG 
CCAAGCCACA 
ACAAAGAACA 
CCATCTAGTG 
G CGAGAG ATT 
CAACTTCCCC 
TGGGAAT CCT 
AAAAGAGGAG 
GGAAGCAGAA 
CAGAGGCAGC 
GATGTCTTTT 
GGATATCTTC 

AT AT CACCT C 
CTCCTCCTGC 
GTCTATGAGC 
CTCCAGTGCC 
TCTGGCAAGA 
GTGGTCTGCC 
CGTGGGTGTG 
TGATGGCAAA 
AATGAGCACC 
CTCCCCTGTC 
CTTCAACAAC 
GGCTCGTGGC 
AATCAGAAGG 
GATGAAGAAA 
GCAAAGCCTG 
CCAAAGTGTG 
GCTACAGGGT 
AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
GTCCAATCAC 
CGGGGTCCAA 
GCTACAGGGT 
AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCACAG 
CATCGAGGAG 
GGTGACCGAC 
GGCTCAGCCG 
GGTGCCACGC 
GAAGAGTGGG 
GCCCGTGGCA 
AGATGGCAAG 



31 
I 

TGCTCTGGTT 
TCCTCCTCCA 
CTGGTGAACA 
GGGAACGCCA 
TTCCTCAACA 
CTCCACAGAA 
CTGCCTCCAC 
AGGCAGGGAG 
CCAGAACCAT 
AAGTTCCGCA 
GTGGCCAGCC 
TACTCTATCT 
ATGGACCCTG 
GAGGCTGGGG 
GAGATATCTG 
CACAGTGGGG 
AAGGAAGAAG 
AGAGGACGTG 
TTTGGCCCAC 
CGCCAAGCCC 
CGTCTGGCTG 
AAATGTAAGG 
GCGCTCAGCA 
GGGCAGGCCA 
CAGCACCTGC 
GACGGAGATC 
GCCACCATGC 
GGGAGAGAGA 
AGCTGCTTGA 
GGGAACTGGG 
GGGAAGATTC 
TTCCCCCCAG 
CCTGTCCTCC 
CTAGGGGGCC 
TGCGACAATG 
GACGCTGCGG 
TGCAGAGATT 
TGCCCGTGGC 
TTTACAAGGA 
CACCGGAGGA 
GCTCCCTGTG 
ATCCCAGAGA 
GACGACCCCC 
GTGGACATCG 
CATGGGAAGG 
CGCACGGTCA 
ATTGCCTACC 
TCTTCATCCT 
GGAGGGTTCC 
CAGAAAGGAA 
GCCAAGGAGC 
CCCAGAACCC 
CCAATCACTA 
GGGTCCAATC 
TACGGGGTCC 
GGGCTACAGG 
AAAGGGGCTA 
AGGAAAAGGG 
TACCAGGAAA 
TCACTACCAG 
CCAATCACTA 
GGCTCCAATC 
TACGGGCTCC 
GGGCTACGGG 
AAAGGGGCTA 
CTCAATCCCG 
TTCGACGGAG 
CTGTCCGTCT 
ACCCGGTTTG 
GCCCACCTGA 
CACTTTGGCC 
ATGGTGAGCC 



41 

I 

GGATGGGACT 
GGTACAATGG 
TCGCGGTCGA 
TCGGGGTCAC 
CCAATAATGC 
ACAGGCCTGT 
TCAGCGGAAG 
AGAGGGTGCC 
TTCTTCTGAG 
ATAACCGGTG 
TCTTTGCCGG 
ACATTGCCAA 
AGGCCAGTGA 
TCAGCAAATA 
GCAGAACCGA 
ATGGAAGCAC 
CAGCAGCTTT 
TTCGAACAGC 
CATGTTACTA 
CCCAACACTA 
GAAAACTAGC 
GCCGCCATGC 
CCACTGTGGT 
TGTCCAGATG 
CTGCTAGAGA 
CAGGGAGGAG 
CAGCTCTCGG 
CTGGGGCAGT 
GGCCTCTTGA 
TTCTGGACAT 
ATGGAGACCA 
GCTCCTCTGA 
AGGTGGGCCT 
GAGGCGTCAG 
AGAATGGGCC 
CCAGTGCTGA 
TTCCTCACTC 
ATGCACGTCT 
CCGGGTCACG 
CACTCAGCCT 
TCCTGGGGTC 
GCCTGATGAC 
ACCAGCATGG 
TCTATGGCAA 
TCCGCTTCCG 
TCACCGCCGA 
GCAGCTCCTC 
TGACAGCTGG 
CAGGGCCAGG 
GGAAGGACGA 
CGGCCTCTGC 
AAGCGCCACA 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
GTCCAATCAC 
CAGGGTCCAA 
GCTACGGGGT 
AGGGGCTACG 
GAAAAGGGGC 
CCAGGAAAAG 
ACTACCAGGA 
AATCACTACC 
CTCCAATCAC 
CGGGGTCCAA 
GCGACGCCTT 
ACGGGATGCT 
TCCGGGGCAA 
GGGCCTTTGC 
GGATCATCGA 
TGGGGAAGGA 
GGAACGTGGC 



51 
I 

GGGTGGGCCC 
ACCCAACCTG 
TGAGCGCAGC 
AGCCTGCGAC 
CTTCTCGGGC 
GCTGAAGCCT 
GGACTTTTCC 
GGTTCCCTGC 
ACCCAAATCA 
GGAAGACATC 
ACGCTCTGTG 
TTACGCCTAC 
CCTCTCCCGG 
TACAGAAGGC 
GGAGCGGGAA 
CAGCCAACTG 
GGTGGAGGAA 
TCTGCAGACT 
TTCTGTCTGC 
CCCTGTAGCC 
CCGGAGTGTC 
TGAGCCCGGC 
GCCAGGGGGC 
TGCACTCAGG 
GCTGTATGAC 
AAGGGACTCG 
GGGACTCGAG 
AGGAAGACCA 
AGCCGGGACA 
GGCCAAGGCC 
TGAGCCCAGA 
GGAGCCTCTG 
GGGGCTTGCT 
CGTGGGCCCC 
TAACTTCCTT 
ACGTCGTTTA 
CCTGTGCCAC 
TCTTCAGGCT 
GTTCTATTCA 
CCAGGGTTCT 
TCTGATCCCC 
CCACAGCTAT 
GCGAGGTGTC 
CTGGAATGGC 
GGACAT CGCC 
CTTTGACAAT 
AGCCAACCGC 
TGGGAGGAAC 
GGGTCAGGCC 
GGACTGGGCA 
TATTGCAGGG 
AGATACAAAG 
GGGCTACGGG 
AAAGGGGCTA 
AGGAAAAGGG 
TACCAGGAAA 
TCACTACCAC 
CCAATCACTA 
GGGTCCAATC 
TACGGGGTCC 
GGGCTACAGG 
AAAGGGGCTA 
AGGAAAAGAG 
TACCAGGAAA 
CGTCAT CCGT 
GGAGCCTGAG 
GGACCTCATC 
TCAGGGCTTC 
CAGGGGAGCT 
CGGGGGCTCA 
TGAAGCCAGC 
CAGCGGGGAG 
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WO 02/086443 

ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG 
CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG 
GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC 
GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG 
GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA 
CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC 
CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG 
CAGAAAGCTC CAGGTATTCC AGAAG CCCAA GTGTATGAAC 

Seq ID NO j 473 Protein sequence 
Protein Accession #: FGENESH predicted 



ACACACTTCA 
GCCATTGCAT 
CCGTATGTGT 
GCTACGAGCC 
CAATGACGTG 
CCGTCTGGTC 
CTGCTCCCAG 
AAGATCAGGA 



GGACCCAGCC 
GGACACCAAT 
CAACACCTAT 
CAACGAGGAT 
GAAACCAAGG 
CTTTTTCCTG 
CACCCTTCTC 
ATAA 



MACPGGLPAR 
SPYYALRDRQ 
PPTTPAGLLG 
GVATYTDKLF 
GNVGPDALIE 
GGDPEEADEE 
SKSHLADKNL 
PHPRAPGMDP 
ELGGPWSQAT 
GPGRVAKREI 
LAWNQMEKEE 
SATHCGSMSF 
AFIVHLKYHL 
FLTQGLASSA 
LSSERVNVGV 
SPKFSMPSPV 
GQGEGLRIRR 
KGKGNVAQSV 
RGPITTRKRG 
RKGLRAPITT 
NHYQEKGLQG 
AMGSNHYQEK 
GRGTGGWTD 
KWLYTKKSG 
MNSVLEILYP 
GSYRCRTNKK 
PGCRLLLKRA 



11 
I 

CSGWMGLGGP 
GNAI GVTACD 
LPPLSGRDFS 
KFRNNRWEDI 
MDPEASDLSR 
HSGDGSTSQL 
FGPPCYYSVC 
KCKGRHAEPG 
QHLPARELYD 
GRETGAVGRP 
GKIHGDHEPR 
LGGRGVSVGP 
CRDFPHSLCH 
HRRTJjSLQGS 
DDPHQHGRGV 
RTVITADFDN 
GGFPGPGGQA 
PRTQAPQDTK 
YGVQSLPGKG 
RKRGYGVQSL 
PITTRKRGYR 
GLRAPITTRK 
FDGDGMLDLI 
AHLRIIDGGS 
RDEDTLQDPA 
CSRGYEPNED 
QLQAAPSTLL 



21 
I 

SGSSPASPPH 
IDGDGREEIY 
SSLGQASPDS 
LSDEVNVARG 
GILALRDVAA 
CRLGWKDGQF 
APSPAHPFPA 
LMAEALGAWP 
LGEPPILQRT 
LSHPLVPNFP 
FRLRKAREAE 
ILSSSASDIF 
LAETGPSSSC 
QGAPPCLLAR 
ALADFNRDGK 
DQELEIFFNN 
KVNTGPLMKK 
PHYHKKGLQG 
ATGSNHYQEK 
PGKGATGSNH 
VQSLPQKGAT 
RGYGVQSLPQ 
LSHGESMAQP 
GYLCEMEPVA 
PIjECGQGFSQ 
GTACVGTELG 
QKAPGIPEAQ 



31 

I 

SSSRYNGPNL 
FLNTNNAFSG 
RQGERVPVPC 
VASLFAGRSV 
EAGVSKYTEG 
KEEAAALVEE 
RQAPQHYPVA 
ALSTTWPGG 
DGDPGRRRDS 
SCLRPLEAGT 
FPPGSSEEPL 
CDNENGPNFL 
CPWHARLLQA 
APCVLGSLIP 
VDIVYGNWNG 
IAYRSSSANR 
QKGRKDEDWA 
PITTRKRGYG 
GLQGPITTRK 
YQEKGLRGPI 
GSNHYQEKGL 
KGATGSNVIR 
LSVFRGNQGF 
HFGLGKDEAS 
QENGHCMDTN 
SRHTMTWKPR 
VYEQDQE 



41 
I 

VLKYDRAQKR 
HSSSAQVPSG 
CRGGLRPTHE 
ACVDRKGSGR 
FSHTASPSIG 
QREAGAAGVP 
PLVTQLMTHG 
LRSWEESRQK 
PKVTQECHLV 
VPGAALPGNP 
LQFPSGLRGS 
FHNRGDGT F V 
PHCHHGLSMS 
TAYYIVLWSA 
PHRLYLQMST 
LFRCSILARG 
RGCGNAGQSL 
VQSLPGKGAT 
RGYGLQSLPG 
TTRKRGYGLQ 
RGPITTRKRG 
REHGDPLIEE 
NNNWLRWPR 
SVEVTWPDGK 
ECIQFPFVCP 
PKKELQLSQG 



Seq ID NO: 474 DNA sequence 

Nucleic Acid Accession #: NMJ303661.1 

Coding sequence: 1..1152 



ATGAGTGCAC 
CAAAACGTTC 
GCTGCTGGCA 
AAGGAAAAAG 
GGATTCGTGG 
GACAACCTTG 
TACAGAAACT 
AGAAGGCTCC 
AATGTGGTGT 
CTGGCACCCT 
ATCACAGCCG 
ACACAAGCCC 
GAGTTTTTGG 
ACACGAGGCA 
GTACCGCATG 
GAACAGGTGG 
ACGGATGTGG 
TCAAAGCACT 
CAGGAGCTGG 
CAAGAACTGT 



11 
I 

TTTTCCTTGG 
CAAGTGGGAC 
CCATGGACCC 
TGAGCACACA 
CTGCTGCTGA 
CAAGACAAAT 
GGTTTCTGAA 
GTGCCCTTGC 
CTGGCT CTCT 
TCACAGAGGG 
CTTTGACCGG 
AAGCCCACGA 
GTGAGAACAT 
T TGGGAAGGA 
CCTCAGCCTC 
AGAGGGTTAA 
CCCCTGTAAG 
TACATGAGGG 
AGGAGAAGCT 
GA 



21 

I 

TGTGGGAGTG 
AGATACTGGA 
AGAGAGCAGT 
GAATCTGCTA 
ACTGCCCAGG 
GATCATGAAA 
AGAGTTTCCT 
AGATGGGGTT 
CAGCATTTCC 
AGGCAGCCTT 
GATTACCAGC 
CCTGGTCATC 
AT CCAACTTT 
CATCCGTGCC 
ACGCCCCCGG 
TGAACCCAGC 
CTTCTTTCTT 
GGCAAAGTCA 
AAACATTCTC 



31 
I 

AGGGCAGAGG 
GATCCTCAAA 
AT CTTTATTG 
CTCCTGCTGA 
AATGAGGCAG 
GACAAAAACT 
CGGTTGAAAA 
CAGAAGGTCC 
TCTGGCATCC 
GTACTCTTGG 
AGTACCATGG 
AAAAGCCTTG 
CTTTCCTTAG 
CT CAGACG AG 
GTCACTGAGC 
ATCCTGGAAA 
GTGCTGGATG 
GAGACAGCTG 
AACAATAATT 



41 
I 

AAGCTGGAGC 
GTAAGCCCCT 
AGGATG CCAT 
CTGATAATGA 
ATGAGCTCCG 
GGCACGATAA 
GTGAGCTTGA 
ACAAAGGCAC 
TGACCCTCGT 
AACCTGGGAT 
ACTACGGAAA 
ACAAATTGAA 
CTGGCAATAC 
CCAGAGCCAA 
CAATCTCAGC 
TGAG CAGAGG 
TAGTCTACCT 
AGGAGCTGAA 
ATAAGATTCT 



Seq ID NO: 475 Protein sequence 
Protein Accession #: NP_003652.1 



1 
I 

MSALFLGVGV 
KEKVSTQNLL 
YRNWFLKEFP 
LAPFTEGGSL 
EFLGENISNF 
EQVERVNEPS 
QELEEKLNIL 



11 

I 

RAEEAGARVQ 
LLLTDNEAWN 
RLKSELEDNI 
VLLEPGMELG 
LSLAGNTYQL 
ILEMSRGVKL 
NNNYKILQAD 



21 
I 

QNVPSGTDTG 
GFVAAAELPR 
RRLRAIiADGV 
ITAALTGITS 
TRGIGKDIRA 
TDVAPVSFFL 
QEL 



31 

f 

DPQSKPLGDW 
NEADELRKAL 
QKVHKGTTIA 
STMDYGKKWW 
LRRARANLQS 
VLDWYLVYE 



41 
I 

AAGTMDPESS 
DNLARQMIMK 
NWSGSLSIS 
TQAQAHDLVI 
VPHASASRPR 
SKHLHEGAKS 



51 
I 

IFIEDAIKYF 
DKNWHDKGQQ 
SGILTLVGMG 
KSLDKLKEVR 
VTEPISAESG 
ETAEELKKVA 



4380 
4440 
4500 
4560 
4620 
4680 
4740 
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51 
I 

LVNIAVDERS 
LHRNRPVLKP 
PEPFLLRPKS 
YSIYIANYAY 
EISGRTEERE 
RGRVRTALQT 
RLAGKLARSV 
GQAMSRCALR 
ATMPALGGLE 
GNWVLDMAKA 
PVLQVGLGLA 
DAAASAERRL 
FTRTGSRFYS 
IPESLMTHSY 
HGKVRFRD I A 
SSSLTAGGRN 
AKEPASAIAG 
GSNHYQEKGL 
KGATGSNHYH 
SLPGKGATGS 
YGLQSLPGKE 
LNPGDALEPE 
TRFGAFARGA 
MVSRNVASGE 
RDKPVCVNTY 
ICTPVWSFFL 



51 
I 

GAGGGTGCAA 
CGGTGACTGG 
TAAGTATTTC 
GGCCTGGAAC 
TAAAGCTCTG 
AGGCCAGCAG 
GGATAACATA 
CACCATCGCC 
CGGCATGGGT 
GGAGTTGGGA 
GAAGTGGTGG 
GGAGGTGAGG 
TTACCAACTC 
TCTTCAGTCA 
TGAAAGCGGT 
AGTCAAGCTC 
CGTGTACGAA 
GAAGGTGGCT 
GCAGGCGGAC 



60 
120 
180 
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300 
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Seq ID NO: 476 DNA sequence 

Nucleic Acid Accession #: NMJ)14452 . 

Coding sequence: 1..1968 
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ATGGGGACCT 
GCCACAGCCA 
GCTCAGCCAG 
ACCGGCCAGG 
ACCAACACAA 
AATGGCATAG 
TTACCTTGTG 
AACGCTACCT 
ACAGAGACTG 
TCTAGTGTGA 
AAGCCGGGGA 
ACCTCACCTT 
GTCCCTTCCT 
TCTGTTAGAC 
TCAGCAAGGG 
CAGCAAGGCC 
GGCGAGAAGT 
CACAAGCATT 
GTGCTTGTGG 
CCCCGGCAGG 
ACCCAGAACC 
CTTGTAGCAG 
AGTGAGAGGG 
GCAGCTCTGC 
GCCCTGCGCC 
ACCACCCAGC 
AGCCCCATCC 
TCCCCACAGG 
GACTCTACAT 
AAGAAGGACA 
GATGACATGC 
GCTGAGGACA 
CAGACCCTCC 



CTCCGAGCAG 
CGATGATCGC 
AACAGAAGGC 
TGCTAACCTG 
GCCTGCGCGT 
AGAAATGCCA 
CTGCCTTGAC 
GTGCCCCCCA 
AGGATGTGCG 
TGAAATGCAA 
CCAAGGAGAC 
CCCCTGGCAC 
CCACTTATGT 
CAAAGGTACT 
GGAAGGAAGA 
CCCACCACAG 
CCAGCACGCC 
TTGACATCAA 
TGATTGTGGT 
ATCCCAGTGC 
GGGAGAAATG 
CCCAAGTGGG 
AGGTTGCTGC 
AGCACTGGAC 
AGCACCGGAG 
TGGAAACTGA 
CCAGCCCCAA 
ACAAGAACAA 
CCAGCGGCTC 
CAGTGTTGCG 
TCCACTTTCT 
AACTAGACCG 
TGGACTCTGT 



CAGCACCGCC 
GGGCTCCCTT 
CTCGAATCTC 
TGACAAGTGT 
CTG CAGCAGT 
TGACTGTAGT 
TGACCGAGAA 
TACGGTGTGT 
GTGTAAGCAG 
AGCATACACA 
AGACAACGTC 
AGCCATCTTT 
TCCCAAAGGC 
GAGTAGCATC 
CGTGAACAAG 
ACACATCCTG 
CATCAAGGGC 
TGAGCATTTG 
GTGCAGTATC 
CATTGTGGAA 
GAT CTACTAC 
AAGCCAGTGG 
TTTCTCCAAT 
CATCCGGGGC 
AAACGATGTT 
CAAACTAGCT 
CGCGAAACTT 
GGGCTTCTTC 
CTCCGCGCTG 
GCAGGTACGC 
AAATCCTGAG 
GCTATTCGAA 
TTATAGCCAT 



CTCGCCTCCT 
CTCCTGCTTG 
ATTGGCACAT 
CCAGCAGGAA 
TGCCCTGTGG 
CAGCCATGCC 
TGCACTTGCC 
CCTGTGGGTT 
TGTGCTCGGG 
GACTGTCTGA 
TGTGGCACAC 
CCACGCCCTG 
ATGAACTCAA 
CAGGAAGGGA 
ACCCTCCCAA 
AAGCTGCTGC 
CCCAAGAGGG 
CCCTGGATGA 
CGGAAAAGCT 
AAGGCAGGGC 
TGCAATGGCC 
AAAGATATCT 
GGGTACACAG 
CCCGAGGCCA 
GTGGAGAAGA 
CTCCCGATGA 
GAGAATTCCG 
GTGGATGAGT 
AGCAGGAACG 
CTGGACCCCT 
GAGCTGCGGG 
ATTATTGGAG 
CTTCCTGACC 



Seq ID NO; 477 Protein sequence 
Protein Accession #: NP 055267.1 



MGTSPSSSTA 
TGQVLTCDKC 
LPCAALTDRE 
SSVMKCKAYT 
VPSSTYVPKG 
QQGPHHRHIL 
VLWIWCSI 
LVAAQVGSQW 
ALRQHRRNDV 
SPQDKNKGFF 
DDMLHFLNPE 



11 
1 

LASCSRIARR 
PAGTYVSEHC 
CTCPPGMFQS 
DCLSQNLWI 
MNSTESNSSA 
KLLPSMEATG 
RKSSRTDKKG 
KDIYQFLCNA 
VEKIRGLMED 
VDESEPLLRC 
ELRVIEEIPQ 



21 
1 

ATATMIAGSL 
TNTSLRVCSS 
NAT CAPHT VC 
KPGTKETDNV 
SVRPKVLSSI 
GEKSSTPIKG 
PRQDPSAIVE 
S ERE VAAFSN 
TTQLETDKLA 
DSTSSGSSAL 
AEDKLDRLFE 



31 
1 

LLLGFLSTTT 
CPVGTFTRHE 
PVGWGVRKKG 
CGTLPSFSSS 
QEGTVPDNTS 
PKRGHPRQNL 
KAGDKKSMTP 
GYTADHERAY 
LPMSPSPLSP 
SRNGSFITKE 
IIGVKSQEAS 



GCAGCCGCAT 
GATTCCTTAG 
ACCGCCATGT 
CCTATGTCTC 
GGACCTTTAC 
CATGGCCAAT 
CACCTGGCAT 
GGGGTGTGCG 
GTACCTTCTC 
GTCAGAACCT 
TCCCGTCCTT 
AG CACATGGA 
CAGAATCCAA 
CAGTCCCTGA 
ACCTTCAGGT 
CGTCCATGGA 
GACATCCTAG 
TTGTGCTTTT 
CGAGGACTCT 
TGAAGAAATC 
ATGGTATCGA 
ATCAGTTTCT 
CCGACCACGA 
GCCTCGCCCA 
TTCGTGGGCT 
GCCCCAGCCC 
CTGTCCTGAC 
CGGAGCCCCT 
GTTCCTTTAT 
GTGACTTGCA 
TGATTGAAGA 
TCAAGAGCCA 
TGCTGTAG 



41 
I 

AQPEQKASNL 
NGIEKCHDCS 
TETEDVRCKQ 
TSPSPGTAIF 
SARGKEDVNK 
HKHFDINEHL 
TQNREKWIYY 
AALQHWTIRG 
SPIPSPNAKL 
KKDTVLRQVR 
QTLLDSVYSH 



CGCCCGCCGA 

CACCACCACA 

TGACCGTGCC 

TGAGCATTGT * 

CAGGCATGAG 

GATTGAGAAA 

GTTCCAGTCT 

GAAGAAAGGG 

AGATGTGCCT 

GGTGGTGATC 

CTCCAGCTCC 

AACCCATGAA 

CTCTTCTGCC 

CAACACAAGC 

AGTCAACCAC 

GGCCACTGGG 

ACAGAACCTA 

CCTGCTGCTG 

GAAAAAGGGG 

CATGACTCCA 

TAT CCTGAAG 

TTGCAATGCC 

GCGGGCCTAC 

GCTAATTAGC 

GATGGAAGAC 

GCTTAGCCCG 

GGTGGAGCCT 

TCTCCGCTGT 

TACCAAAGAA 

GCCTATCTTT 

GATTCCCCAG 

GGAAGCCAGC 



51 
1 

IGTYRHVDRA 
QPCPWPMIEK 
CARGTFSDVP 
PRPEHMETHE 
TLPNLQWNH 
PWMIVLFLLL 
CNGHGIDILK 
PEASLAQLIS 
ENSALLTVEP 
LDPCDLQPIF 
LPDLL 



60 
120 
180 
240 
300 
'^360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



Seq ID NO: 478 DNA sequence 
Nucleic Acid Accession #: XM_044533 
55 Coding sequence: 238.. 2751 

1 11 21 31 " 41 51 

I I I I I i 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

60 AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 12 0 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

65 TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 42 0 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

70 AGCGGCAGTC ACCTGTT CAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 7 80 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

75 GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG G CGATG ATG A CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTT CAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

80 GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG 1560 

85 CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 



369 
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CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCGGCCTCAC ACTCGGGCGT AGTCCAGGTG 
CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 
TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 
ACCAGGCCGT GGATCCAGGA CAT CGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 
TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAG CA AGTCCAGTTC 
CAGCCCAACA CAGTGAACAC TTTGGCCTGC CCGCTCCTCT CCAACCTGGC GACCCGACTC 
TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 
GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 
GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 
CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 
GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 
TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 
AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 
CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 
CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 
GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 
CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG AT CCGTG ACT CTGTGGTGTG AGAGCTGACT 
TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 
TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 
GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 
ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 
TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 
GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 
GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 
TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 
CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 
CTGCCGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 
GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 
GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 
CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCCGT GTGCCCCTTC 
CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 
GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 
TCAACACTGC CCAG CACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID NO: 4 79 Protein sequence 
Protein Accession #: XP_044533.3 

1 11 21 31 41 51 

I I I I I I 

MLRTAMGIiRS WL.AAPWGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGSEERPFL 

RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADAEKKQQC 12 0 

SFKGKDPQRD CQNYIKILLP LSGSHLFTCG TAAFSPMCTY INMENFTLAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG EliYTGTVSSF QGNDPAISRS QSLRPTKTES SLNWLQDPAF 240 

VASAYIPESIi GSLQGDDDKI YFFFSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQIjLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 42 0 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMANCSLYR SCGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQIi ATRPWIQDIE GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNLATR LWLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDEGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAVL LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 
PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 

Seq ID NO: 480 DNA sequence 

Nucleic Acid Accession #: NM_004217.1 

Coding sequence: 58.. 1092 

1 11 21 31 41 51 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 60 

GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 12 0 

AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

AGTGGGACAC CCGACATCTT AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 300 

CCTCTGGGCA AAGGCAAGTT TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 360 

ATCGTGGCGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 420 

CTGCGCAGAG AGAT CGAAAT CCAGGCCCAC CTGCACCATC CCAACATCCT GCGTCTCTAC 480 

AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 540 

CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGCGAACAGC CACGATCATG 600 

GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 

AAG CCAG AAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG 72 0 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 

CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GGCCCAGGTC 102 0 

TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTGTATG 1140 

TATAGGGGAA AGAAGGG AT C CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CTGAAGCTTT TTGT 

Seq ID NO: 481 Protein sequence 
Protein Accession #: NP_004208 

1 11 21 31 41 51 
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GACCAGGGGG GCCGGGAGGG CCCCGTGCCC CAACCCCAGG CTCAGGGTCT GGCCGAGAAG 
ATCCGAGCCT GCTGCTACCT TGAGTGCTCA GCCTTGACGC AGAAGAACTT GAAGGAAGTA 
TTTGACTCGG CTATTCTCAG TGCCATTGAG CACAAAGCCC GGCTGGAGAA GAAACTGAAT 
GCCAAAGGTG TGCGCACCCT CTCCCGCTGC CGCTGGAAGA AGTTCTTCTG CTTCGTTTGA 

Seq ID NO: 485 Protein sequence 
Protein Accession it; PGENESH predicted 



PCT/US02/12476 



720 
780 
840 



1 
I 

MPPRELSEAE 
RPTALDTFSG 
LPNSGSPRPA 
FQNITEKWLP 
IRACCYLECS 



11 
I 

PPPLRAPTPP 
TYVQSPVRPR 
PAVQVLVDGA 
EIRTHNPQAP 
ALTQKNLKEV 



21 
I 

PRRRSAPPEL 
GCGGAVHRGA 
PVR I ELWDTA 
VLLVGTQADL 
FDSAILSAIE 



31 
I 

G I KCVLVGDG 
GAGVSAGGRR 
GQEDFDRLRS 
RDDVNVLIQL 
HKARLEKKLN 



41 
I 

AVGKSSLIVS 
GPRGGDWSRP 
LCYPDTDVFL 
DQGGREGPVP 
AKGVRTLSRC 



Seq ID NO: 486 DNA sequence 

Nucleic Acid Accession #: XM_063832.2 

Coding sequence: 1..711 



ATGCCGCCGC 
CCGCGGCGGC 
GCCGTGGGCA 
CGGCCCACTG 
ATTGAGCTCT 
CCGGATACCG 
ATCACAGAGA 
GTGGGCACCC 
GGCCGGGAGG 
TGCTGCTACC 
GCTATTCTCA 
GTGCGCACCC 



11 
I 

GGGAGCTGAG 
GTAGCGCGCC 
AGAGCAGCCT 
CGCTGGACAC 
GGGACACAGC 
ATGTCTT CCT 
AATGGCTGCC 
AGGCCGACCT 
GCCCCGTGCC 
TTGAGTGCTC 
GTGCCATTGA 
TCTCCCGCTG 



21 
I 

CGAGG CCGAG 
CCCAGAGCTG 
CATCGTCAGC 
CTTCTCTGTG 
GGGACAGGAG 
GGCGTGCTTC 
CGAGATCCGC 
GAGGGACGAT 
CCAACCCCAG 
AGCCTTGACG 
GCACAAAGCC 
CCGCTGGAAG 



31 

I 

CCGCCCCCGC 
GGCAT CAAGT 
TACACCTGCA 
CAAGT CCTGG 
GATTTTGACC 
AGCGTGGTGC 
ACGCACAACC 
GTCAACGTAC 
GCTCAGGGTC 
CAGAAGAACT 
CGGCTGGAGA 
AAGTTCTTCT 



41 
I 

TCCGGGCCCC 
GCGTGCTGGT 
ATGGGTACCC 
TGGATGGAGC 
GACTTCGTTC 
AGCCCAGCTC 
CCCAGGCGCC 
TAATTCAGCT 
TGGCCGAGAA 
TGAAGGAAGT 
AGAAACTGAA 
GCTTCGTTTG 



Seq ID NO: 487 Protein sequence 
Protein Accession ft: XP_063832.1 



11 



21 



31 



41 



51 
I 

YTCNGYPARY 
RGGAGAAQDA 
ACFSWQPSS 
QPQAQGLAEK 
RWKKFFCFV 



51 

I 

GACCCCTCCC 
GGGCGACGGC 
CGCGCGCTAC 
TCCGGTGCGC 
CCTTTGCTAC 
CTTTCAAAAC 
TGTGCTGCTG 
GGACCAGGGG 
GATCCGAGCC 
ATTTGACTCG 
TGCCAAAGGT 
A 



51 



I 

MPPRELSEAE PPPLRAPTPP PRRRSAPPEL GI KCVLVGDG AVGKSSLIVS YTCNGYPARY 

RPTALDTFSV QVLVDGAPVR IELWDTAGQE DFDRLRSLCY PDTDVFLACF SWQPSSFQN 

ITEKWLPEIR THNPQAPVLL VGTQADLRDD VNVLIQLDQG GREGPVPQPQ AQGLAEKIRA 

CCYLECSALT QKNLKEVFDS AILSAIEHKA RLEKKLNAKG VRTLSRCRWK KFFCFV 

Seq ID NO: 488 DNA sequence 

Nucleic Acid Accession #: NM_0143 98.1 

Coding sequence: 64.. 13 14 



GGCACCGATT 
ACCATGCCCC 
CACGATGGCA 
ACTGCAGCAG 
CCTCACCAAA 
ACAGTAAAAA 
ATTACCTACA 
GTTACTGAAG 
CCACCAGCTC 
ACTCAACCCA 
ACAACCGGTC 
AATACCACCC 
CCATCGTCAG 
GCAGAGATGG 
TACTTCAACA 
AACCTTCTGT 
TCATATTATA 
CAAGGAATCA 
GTGAGTGAAC 
CTT CAAGCCT 
TACACAATTG 
GGTGTCTATA 
CCCGGGGGGA 
TTGGGAAATT 
AATGAAGTGA 
GTTTATTTTA 
GCCACTCAAA 
AGCCTTCAAA 
ATTTTATTTT 
GCTTTTACTA 
ACTCCTTTTC 
CAGGCTGGAG 
TGATTCTCCT 
GCTAATTTTT 
CTCTTGACCT 
AGCCATTGCG 



11 
1 

CGGGGCCTGC 
GGCAGCTCAG 
GTCAAATGAG 
CAACAGTACA 
CTTTAGCAGC 
TTCCAACAAC 
CCCTGGTCAC 
TTACAGTCGG 
ATACAGCTGG 
GTAACCAGAC 
AGAAGCCTGA 
GCACAGCTGC 
TCAAGACTGG 
GGATACAGCT 
TCGACCCCAA 
TGAATTTTCA 
TCAGTGAAGT 
AACATG CGGT 
AGAGCCT CCA 
TTGATTTTGA 
TGCTTCCTGT 
AAATCCGCCT 
ATGAAAATAA 
CCCTCAGAGT 
GTCATGTGTG 
TGAAAGATAT 
GTCAACATTT 
TTATAAACCA 
ACCCTTGATC 
TCTGTGTTTT 
CACTTTAAAT 
TACAGTGGCA 
GCTTCAGCTT 
GTATTTTTAT 
CAGGTGATCC 
CCCGGCCTTA 



21 
I 

CCGGACTTCG 
CGCGGCGGCC 
AGCAAAAGCA 
GGACATAAAA 
AAGATTCATG 
TACCCCAGCA 
AACCCAGGCC 
CCCTAGCTTA 
AACCAGTTCA 
CACCCTTCCA 
TCAACCCACC 
ACCTGCCTCC 
AATTTAT CAG 
GATTGTT CAA 
CGCAACGCAA 
GGGCGGATTT 
GGGAGCCTAT 
GGTGATGTTC 
GTTGTCAGCC 
AGATGACCAC 
GATTGGGGCC 
AAGGTGTCAA 
TGGAATTTAG 
GTGGGTCCTT 
ATTTAAGTTC 
AGTGAGCTGT 
GAGATATGTT 
AGGGTCAATT 
TTAACAAAGC 
ATGGTTTCAT 
TTGTTTTTGT 
CGATCTCGGC 
CCCGAGTAGC 
TATAGACGGG 
ACCCACCTCA 
AATGTTTTTT 



31 
I 

CCGCACGCTG 
GCGCTCTTCG 
TTTCCAGAAA 
AAACCTGTCC 
GATGGTCATA 
ACTACAAAAA 
ACACCCAACA 
GCCCCTTATT 
TCAACCGTCA 
GCAACTTTAT 
CATGCCCCAG 
ACGGTTCCTG 
GTTCTAAACG 
GACAAGGAGT 
GCCTCTGGGA 
GTGAATCTCA 
TTGACCGTCT 
CAGACAGCAG 
CACCTGCAGG 
TTTGGAAATG 
ATCGTGGTTG 
TCATCTGGAT 
AGAACTCTTT 
CAAACAATGT 
AGGCAGCACA 
TTATTTTCTA 
GAATTAACAT 
GTAACTAATA 
CTTTGCTTTG 
GTAACATACA 
TTTTTGAGAC 
TTATGGCAAC 
TGGGATTACA 
TTT CACCATG 
GCCTCCCAAA 
TTAATCATCA 



41 
I 

CAGAACCTCG 
CGTCCCTGGC 
CCAGAGATTA 
AGCAACCAGC 
TCACCTTTCA 
ACACTGCAAC 
ACTCACACAC 
CACTGCCACC 
GCCACACAAC 
CGATAGCACT 
GAACAACGGC 
GGCCCACCCT 
GAAGCAGACT 
CGGTTTTTTC 
ACTGTGGCAC 
CATTTACCAA 
CAGATCCAGA 
TCGGGCATTC 
TGAAAACAAC 
TGGATGAGTG 
GTCTCTGCCT 
ACCAGAGAAT 
CATCCCTTCC 
AAACCACCAT 
TCAATTTCTA 
GTTTCCTTTA 
AATATATGTA 
CTACTGTGTG 
TTATCAAATG 
TATTCCTGGT 
GGAGTTTCAC 
CTCCGCCTCC 
GGCACACACT 
TTGGCCAGAC 
GTGCTGGGAT 
AAAAGAACAA 



51 
I 

CCCAGCGCCC 
CGTAATTTTG 
TTCTCAACCT 
TAAGCAAGCA 
AACAGCGGCC 
CACCAGCCCA 
AGCTCCTCCA 
CACCATCACC 
TGGGAACACC 
GCACAAAAGC 
AGCTGCCCAC 
TGCACCTCAG 
CTGTATAAAA 
ACCTCGGAGA 
CCGAAAATCC 
GGATGAAGAA 
GACAGTTTAC 
CTTCAAGTGC 
CGATGTCCAA 
CTCGTCTGAC 
TATGGGTATG 
CTAATTGTTG 
AGGATGGATG 
CTTCTATTCA 
AATACTTTTT 
GAATATTTTA 
AAGTAGAATA 
TGCATTGAAG 
GACTTTCAGT 
GTAG CACTT A 
TCTTGTCACC 
CGGGTTCAAG 
ACCACGCCTG 
TGGTCTTGAA 
TACAGGCATG 
CATATCT CAG 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
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600 
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1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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GTTGTCTAAG TGTTTTTATG TAAAACCAAC AAAAAGAACA 
CTTGATGACT CCTGCTCCAG AATTGCTAGA CTAAGAATTA 
CTAAACAATA AG CAAG AG AC AATAATAATG GCCCTTAATT 
TAGGCTAAGC ACTTTATCTA TATCTCATTT CATTCTCACA 
ACTGAGACTT AAGGGAACTG AATCACTTAA ATGTCACCTG 
GAGCTTGAAT TCATGTTGGT CTGACATCAA GGTCTTTGGT 
CCTACAAGAA CAATGACACC ACACTCTGCC TGAAGGCTCA 
TCACCTTACA GGGAAATGGG TTTATCCAGG AT C ATG AG AC 
AGCTTTGCAG ATAACAAAAT AGCCTATCCT TAATAAATCC 
TGAGGGGCTT TGTAAAACAT TAGTCAGTTG CTCATTTTTA 
GTAAAGATGA AGGCAT CAAA TAAACTCAAA GTATTTTTAA 
ACTTCGCTAA CCAACTGTTC TTTCTTGAGT GTATAGCCCC 
TTCTGCACTT CATATCCATA TTTCCTATTG TTCACTTTAT 
GAATTTTATT TCTGCTGTTT TTTTTGCTGC TAAAGAAAGG 
AGAAAAGTCC ACATAACCCT AGAATTCTTA GTCAAGGAAT 
CCATGTTGAC TTTCCTCATG TGTTTCCTTA TGACTCAGTA 
TAGTCTTAAT AAAACATTGA ATTGTAGTAA AGGTTTTTGC 

Seq ID NO: 489 Protein sequence 
Protein Accession #: NP_055213.1 



AATCAGCTTA 
GGTGGCTACA 
ATTAACAAAG 
ACTTATAAGT 
GCTAACTGAT 
CTTCTCCCTA 
CACCTCATAC 
ATTAGGGTAG 
TCCACTCTCT 
TGGGATTGCT 
ATTTTTTTGA 
ATCTTGTGGT 
TCTGTAGAGC 
AACTAAGTCA 
AATTCAAGTC 
AGTTGGCAAG 
AATAAAAACT 



TATTTTTTAT 
GATGGTAGAA 
TGCCAGAGTC 
GAATGAGTAA 
GGCAGAGCCA 
CACCAAGTTA 
CAGCATACGC 
ATGAAAGGAG 
GGAAGGAGAC 
TAGCTGGGCT 
TAATAGAGAA 
AACTTGCTGC 
AGCCTGCCAA 
GGATGTTAAC 
AGCCTAGAGA 
GTCCTGACTT 
TACTTTGG 



MPRQIi S AAAA 
HQTLAARFMD 
TEVTVGPSLA 
TGQKPDQPTH 
EMGIQLIVQD 
YYISEVGAYL 
QAFDFEDDHF 



11 
I 

LFASLAVILH 
GHITFQTAAT 
PYSLPPTITP 
APGTTAAAHN 
KESVFSPRRY 
TVSDPETVYQ 
GNVDECSSDY 



21 
I 

DGSQMRAKAF 
VKIPTTTPAT 
PAHTAGTSSS 
TTRTAAPAST 
FNIDPNATQA 
GIKHAWMFQ 
TIVLPVIGAI 



31 
I 

PETRDYSQPT 
TKNTATTSPI 
TVSHTTGNTT 
VPGPTLAPQP 
SGNCGTRKSN 
TAVGHSFKCV 
WGLCLMGMG 



41 

I 

AAATVQDIKK 
TYTLVTTQAT 
QPSNQTTLPA 
SSVKTGIYQV 
LLLNFQGGFV 
SEQSLQLSAH 
VYKIRLRCQS 



51 
I 

PVQQPAKQAP 
PNNSHTAPPV 
TL S I ALHKST 
LNGSRLCIKA 
NLTFTKDEES 
LQVKTTDVQL 
SGYQRI 



Seq ID NO: 490 DNA sequence 

Nucleic Acid Accession #: NM_005409.3 

Coding sequence: 94.. 3 78 



TTCCTTTCAT 
CAACAGCACC 
TTGGCTGTGA 
TGTCTTTGCA 
ATAATGTACC 
AAAGGACAAC 
GAAAGAAAGA 
AACCTAGAAC 
AGACTTTTCT 
GGGTGAAAGG 
CTGCCCAAAG 
GGTTACCATC 
GCATTTCTAG 
GAGAACATTT 
CTGTGGTTAC 
CAT CTATGTG 
CCAAATATCA 
TTTATAACCA 
TGGGATACTG 
GATGTTTTTC 
TGTACTTTTT 
TACAAAATGT 
AATCACTTTT 
TTGTTCATGC 
GTCATTTTTT 



11 

I 

GTTCAGCATT 
AGCAGCAACA 
TATTGTGTGC 
TAGGCCCTGG 
CAAGTAACAA 
GATGCCTAAA 
ATTTTTAAAA 
AAGTTTAACT 
ATGGTTTTGT 
ACCAAAAACA 
GAGTCCAGCA 
GGAGTTTACA 
GCTAGAGAAC 
CTGTCTCTAG 
AGTGGAGACA 
TCGTAAAGCA 
TGTAGCACAT 
ATTCATTAAA 
GCAACAGTGC 
AACTTTTATT 
GTTTTGATCC 
TTTTGTCTAC 
ACTTTTTGTA 
CTATATACTG 
TCTCTAATAA 



21 
1 

TCTACTCCTT 
GCAAAAAACA 
TACAGTTGTT 
GGTAAAAGCA 
CTGTGACAAA 
TCCCAAATCG 
ATATCAAAAC 
GTGACTACTG 
GACTTTCAAC 
GAAATACAGT 
AT T AAATGG A 
AAGTGCTTTC 
CTT CTAGATT 
AAGTTATCTG 
TTGACATTAT 
TTCCTCAAAC 
CAATATGTAG 
TGTAATTCAT 
ACATATTTCA 
CATTGAGATG 
GTTTGTATAA 
CAAAGAAAAA 
ATTCTGTCTC 
TAAAATTTAG 
ACTACCACAA 



31 

1 

CCAAGAAGAG 
AACATGAGTG 
CAAGGCTTCC 
GTGAAAGTGG 
ATAGAAGTGA 
AAGCAAGCAA 
AT ATGAAGT C 
AAATGACAAG 
TTTTGTACAG 
CTTCCTGAAT 
TTTCTAGGAA 
ACGTTCTTAC 
TGATGCTTAC 
TCTGTATTGA 
TACTGGAGTC 
ATTTTTTCAT 
GGAAACATTC 
AAAATGTACT 
TAACCAAATT 
TTTTGAAGCA 
ATGATAGCAA 
TGTTGAAAAA 
TTAGAAAAAT 
GTATACTCAA 
CCTTTCTTTT 



41 
I 

CAGCAAAGCT 
TGAAGGGCAT 
CCATGTTCAA 
CAGATATTGA 
TTATTACCCT 
GGCTTATAAT 
CTGGAAAAGG 
AATTCTACAG 
TTATGTGAAG 
GAATGACAAT 
AAGCTACCTT 
TTGTTGTATT 
AACTATTCTG 
TCTTTATGCT 
AAGCCCTTAT 
GCAAATACAC 
TTATGCATCA 
ATGAAAAAAA 
AGCAGCACCG 
ATTAGGATAT 
TATCTTGGAC 
TAAGCAAATG 
ACATAATCTA 
GACTAGTTTA 
TTAAAAAAAA 



51 

i 

GAAGTAGCAG 
GGCTATAGCC 
AAGAGGACGC 
GAAAGCCTCC 
GAAAGAAAAT 
CAAAAAAGTT 
GCATCTGAAA 
TAGGAAACTG 
GATGAAAGGT 
CAGAATTCCA 
AAGAAAGGCT 
ATACATTCAT 
TTGTGACTAT 
ATATTACTAT 
AAGTCAAAAG 
ACTTCTTTCC 
TTTGGTTTGT 
TTATACGCTA 
GTCTTAATTT 
GTGTGTTTAC 
ACATTTGAAA 
TATACCTAGC 
ATCAATTTCT 
AAGAATCAAA 
AAA 



Seq ID NO: 491 Protein sequence 
Protein Accession #: NP_005400.1 

1 11 21 



31 

I 



41 



51 



MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYP SNNCDKI 
EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNF 

Seq ID NO: 492 DNA sequence 

Nucleic Acid Accession #: NM_000577.1 

Coding sequence: 41.. 520 



GGCACGAGGG 
CCGACCCTCT 
GAAGACCTTC 
CAATTTAGAA 
CCATGGAGGG 
GGAGGCAGTT 
CATCCGCTCA 
CCTCTGCACA 
CGT CATGGTC 
TCCCATTCTT 



11 
I 

GAAGACCTCC 
GGGAGAAAAT 
TAT CTGAGG A 
GAAAAGATAG 
AAGATGTGCC 
AACATCACTG 
GACAGTGGCC 
GCGATGGAAG 
ACCAAATTCT 
GCATGGCAAG 



21 

1 

TGT CCTATCA 
CCAGCAAGAT 
ACAACCAACT 
ATGTGGTACC 
TGTCCTGTGT 
ACCTGAGCGA 
CCACCACCAG 
CTGACCAGCC 
ACT TCCAGGA 
GACTGCAGGG 



31 
1 

GGCCCTCCCC 
GCAAGCCTTC 
AGTTGCCGGA 
CATTGAGCCT 
CAAGTCTGGT 
GAACAGAAAG 
TTTTGAGTCT 
CGTCAGCCTC 
GGACGAGTAG 
ACTGCCAGTC 



41 
1 

ATGGCTTTAG 
AGAATCTGGG 
TACTTGCAAG 
CATGCTCTGT 
GATGAGACCA 
CAGGACAAGC 
GCCGCCTGCC 
ACCAATATGC 
TACTGCCCAG 
CCCCTGCCCC 



51 
I 

AGACGATCTG 
ATGTTAACCA 
GACCAAATGT 
TCTTGGGAAT 
GACTCCAGCT 
GCTTCGCCTT 
CCGGTTGGTT 
CTGACGAAGG 
GCCTGCCTGT 
AGGGCTCCCG 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



60 



60 
120 
180 
- 240 
300 
360 
420 
480 
540 
600 



373 



WO 02/086443 

GCTATGGGGG CACTGAGGAC CAGCCATTGA GGGGTGGACC CTCAGAAGGC GTCACAACAA 660 

CCTGGTCACA GGACTCTGCC TCCTCTTCAA CTGACCAGCC TCCATGCTGC CTCCAGAATG 720 

GTCTTTCTAA TGTGTGAATC AGAGCACAGC AGCCCCTGCA CAAAGCCCTT CCATGTCGCC 780 

TCTGCATTCA GGATCAAACC CCGACCACCT GCCCAACCTG CTCTCCTCTT GCCACTGCCT 840 

CTTCCTCCCT CATTCCACCT TCCCATGCCC TGGATCCATC AGGCCACTTG ATGACCCCCA 900 

ACCAAGTGGC TCCCACACCC TGTTTTACAA AAAAGAAAAG ACCAGTCCAT GAGGGAGGTT 960 

TTTAAGGGTT TGTGGAAAAT GAAAATTAGG ATTTCATGAT TTTTTTTTTT CAGTCCCCGT 102 0 

GAAGGAGAGC CCTTCATTTG GAGATTATGT TCTTTCGGGG AG AGG CTGAG GACTTAAAAT 1080 

ATTCCTGCAT TTGTGAAATG ATGGTGAAAG TAAGTGGTAG CTTTTCCCTT CTTTTTCTTC 1140 

TTTTTTTGTG ATGTCCCAAC TTGTAAAAAT TAAAAGTTAT GGTACTATGT TAGCCCCATA 120 0 

ATTTTTTTTT TCCTTTTAAA ACACTTCCAT AAT CTGG ACT CCTCTGTCCA GGCACTGCTG 1260 

CCCAGCCTCC AAGCTCCATC TCCACTCCAG ATTTTTTACA GCTGCCTGCA GTACTTTACC 132 0 

TCCTATCAGA AGTTTCTCAG CTCCCAAGGC TCTGAGCAAA TGTGGCTCCT GGGGGTTCTT 13 80 

TCTTCCTCTG CTGAAGGAAT AAATTGCTCC TTGACATTGT AGAGCTTCTG GCACTTGGAG 1440 

ACTTGTATGA AAGATGG CTG TGCCTCTGCC TGTCTCCCCC ACCAGGCTGG GAGCTCTGCA 150 0 

GAGCAGGAAA CATGACTCGT ATATGTCTCA GGTCCCTGCA GGGCCAAGCA CCTAGCCTCG 1560 

CTCTTGGCAG GTACTCAGCG AATGAATGCT GTATATGTTG GGTG CAAAGT TCCCTACTTC 1620 

CTGTGACTTC AGCTCTGTTT TACAATAAAA TCTTGAAAAT GCCTAAAAAA AAAAAAAAAA 1680 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID NO: 4 93 Protein sequence 
Protein Accession #: NP_000568.1 

1 11 21 31 41 51 

1 I I I I I . 

MALETICRPS GRKSSKMQAF RIWDVNQKTF YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 60 

HALFLGIHGG KMCLSCVKSG DETRLQLEAV NITDLSENRK QDKRFAFIRS DSGPTTSFES 12 0 
AACPGWFLCT AMEADQPVSL TNMPD EGVMV TKFYFQEDE 

Seq ID NO: 494 DNA sequence 

Nucleic Acid Accession #: NM_002081.1 

Coding sequence: 222.. 1898 

1 11 21 31 41 51 

I I I I I 1 

GGCTGCCCGA GCGAGCGTTC GGACCTCGCA CCCCGCGCGC CCCGCGCCGC CGCCGCCGCC 60 

GGCTTTTGTT GTCTCCGCCT CCTCGGCCGC CGCCGCCTCT GGACCGCGAG CCGCGCGCGC 12 0 

CGGGACCTTG GCTCTGCCCT TCGCGGGCGG GAACTGCGCA GGACCCGGCC AGGATCCGAG 180 

AGAGGCGCGG GCGGGTGGCC GGGGGCGCCG CCGGCCCCGC CATGGAGCTC CGGGCCCGAG 2 40 

GCTGGTGGCT GCTATGTGCG GCCGCAGCGC TGGTCGCCTG CGCCCGCGGG GACCCGGCCA 300 

GCAAGAGCCG GAGCTGCGGC GAGGTCCGCC AGATCTACGG AGCCAAGGGC TTCAGCCTGA 3 60 

GCGACGTGCC CCAGGCGGAG ATCTCGGGTG AGCACCTGCG GATCTGTCCC CAGGGCTACA 42 0 

CCTGCTGCAC CAGCGAGATG GAGGAGAACC TGGCCAACCG CAGCCATGCC GAGCTGGAGA 4 80 

CCGCGCTCCG GGACAGCAGC CGCGTCCTGC AGGCCATGCT TGCCACCCAG CTGCGCAGCT 540 

TCGATGACCA CTTCCAGCAC CTGCTGAACG ACTCGGAGCG GACGCTGCAG GCCACCTTCC 600 

CCGGCGCCTT CGGAGAGCTG TACACGCAGA ACGCGAGGGC CTTCCGGGAC CTGTACTCAG 660 

AGCTGCGCCT GTACTACCGC GGTGCCAACC TGCACCTGGA GGAGACGCTG GCCGAGTTCT 720 

GGGCCCGCCT GCTCGAGCGC CTCTTCAAGC AGCTGCACCC CCAGCTGCTG CTGCCTGATG 780 

ACTACCTGGA CTGCCTGGGC AAGCAGGCCG AGGCGCTGCG GCCCTTCGGG GAGGCCCCGA 840 

GAGAGCTGCG CCTGCGGGCC ACCCGTGCCT TCGTGGCTGC TCGCTCCTTT GTGCAGGGCC 900 

TGGGCGTGGC CAGCGACGTG GTCCGGAAAG TGGCTCAGGT CCCCCTGGGC CCGGAGTGCT 960 

CGAGAGCTGT CATGAAGCTG GTCTACTGTG CTCACTGCCT GGGAGT CCCC GGCGCCAGGC 102 0 

CCTGCCCTGA CTATTGCCGA AATGTGCTCA AGGGCTGCCT TGCCAACCAG GCCGACCTGG 10 80 

ACGCCGAGTG GAGGAACCTC CTGGACTCCA TGGTGCTCAT CACCGACAAG TTCTGGGGTA 114 0 

CATCGGGTGT GGAGAGTGTC ATCGGCAGCG TGCACACGTG GCTGGCGGAG GCCATCAACG 12 00 

CCCTCCAGGA CAACAGGGAC ACGCTCACGG CCAAGGTCAT CCAGGGCTGC GGGAACCCCA 1260 

AGGTCAACCC CCAGGGCCCT GGGCCTGAGG AGAAGCGGCG CCGGGGCAAG CTGGCCCCGC 1320 

GGGAGAGGCC ACCTTCAGGC ACGCTGGAGA AGCTGGTCTC TGAAGCCAAG GCCCAGCTCC 13 80 

GCGACGT CCA GGACTTCTGG ATCAGCCTCC CAGGGACACT GTGCAGTGAG AAGATGGCCC 1440 

TGAGCACTGC CAGTGATGAC CGCTGCTGGA ACGGGATGGC CAGAGGCCGG TACCTCCCCG 15 00 

AGGTCATGGG TGACGGCCTG GCCAACCAGA TCAACAACCC CGAGGTGGAG GTGG AC AT C A 1560 

CCAAGCCGGA CATGACCATC CGGCAGCAGA TCATGCAGCT GAAGATCATG ACCAACCGGC 1620 

TGCGCAGCGC CTACAACGGC AACGACGTGG ACTT CCAGGA CGCCAGTGAC GACGGCAGCG 1680 

GCTCGGGCAG CGGTGATGGC TGT CTGGATG ACCTCTGCGG CCGGAAGGTC AGCAGGAAGA 1740 

GCTCCAGCTC CCGGACGCCC TTGACCCATG CCCTCCCAGG CCTGTCAGAG CAGGAAGGAC 1800 

AGAAGACCTC GGCTGCCAGC TGCCCCCAGC CCCCGACCTT CCTCCTGCCC CTCCTCCTCT 1860 

TCCTGGCCCT TACAGTAGCC AGGCCCCGGT GGCGGTAACT GCCCCAAGGC CCCAGGGACA 192 0 

GAGGCCAAGG ACTGACTTTG CCAAAAATAC AACACAGACG ATATTTAATT CACCTCAGCC 1980 

TGGAGAGGCC TGGGGTGGGA CAGGGAGGGC CGGCGGCTCT GAGCAGGGGC AGGCGCAGAG 2040 

GTCCCAGCCC CAGGCCTGGC CTCGCCTGCC TTTCTGCCTT TTAATTTTGT ATGAGGTCCT 2100 

CAGGTCAGCT GGGAGCCAGT GTGCCCAAAA GCCATGTATT TCAGGGACCT CAGGGGCACC 2160 

TCCGGCTGCC TAGCCCTCCC CCCAGCTCCC TGCACCGCCG CAGAAGCAGC CCCTCGAGGC 2220 

CTACAGAGGA GGCCTCAAAG CAACCCGCTG GAGCCCACAG CGAGCCTGTG CCTTCCTCCC 2280 

CGCCTCCTCC CACTGGGACT CCCAGCAGAG CCCACCAGCC AGCCCTGGCC CACCCCCCAG 2340 

CCTCCAGAGA AGCCCCGCAC GGGCTGTCTG GGTGTCCGCC AT CCAGGGTC TGGCAGAGCC 2400 

T CTGAGATG A TGCATGATGC CCTCCCCTCA GCGCAGGCTG CAGAGCCCGG CCCCACCTCC 2460 

CTGCGCCCTT GAGGGGCCCC AGCGTCTGCA GGGTGACGCC TGAGACAGCA CCACTGCTGA 2520 

GGAGTCTGAG GACTGTCCTC CCACAGACCC TGCAGTGAGG GGCCCTCCAT GCGCAGATGA 2 5 80 

GGGGCCACTG ACCCACCTGC GCTTCTGCTG GAGGAGGGGA AGCTGGGCCC AAAGGCCCAG 2640 

GGAGGCAGCG TGGGCT CTGC CAATGTGGGC TGCCCCTCGC ACACAGGGCT CACAGGGCAG 2700 

GCCTTGCTGG GGTCCAGGGC TGTTGGAGGA CCCCGAGGGC TGAGGAGCAG CCAGGACCCG 2 760 

CCTGCTCCCA TCCTCACCCA GAT CAGGAAC CAGGGCCTCC CTGTTCACGG TGACACAGGT 2 82 0 

CAGGGCTCAG AGTGACCCTC GGCTGTCACC TGCTCACAGG GATGCTGGTG GCTGGTGAGA 2880 

CCCCGCACTG CACACGGGAA TGCCTAGGTC CCTTCCCGAC CCAGCCAGCT GCACTGCAGG 2940 

GCACGGGGAC CTGGATAGTT AAGGGCTTTT CCAAACATGC ATCCATTTAC TGACACTTCC 3000 

TGTCCTTGTT CATGGAGAGC TGTTCGCTCC TCCCAGATGG CTTCGGAGGC CCGCAGGGCC 3 0 60 

CACCTTGGAC CCTGGTGACC TCCTGTCACT CACTGAGGCC AT CAGGGCCC TGCCCCAGGC 3120 



374 



10 



15 



20 



25 



WO 02/086443 

CTGGACGGGC CCTCCTTCCC TCCTGTGCCC CAGCTGCCAG GTGGCCCTGG GGAGGGGTGG 3180 

TGTGGTGTTG GGAAGGGGTC CTGCAGGGGG AGGAGGACTT GGAGGGTCTG GGGGCAGCTG 3240 

TCCTGAACCG ACTGACCCTG AGGAGGCCGC TTAGTGCTGC TTTGCTTTTC ATCACCGTCC 3300 

CGCACAGTGG ACGGAGGTCC CCGGTTGCTG GTCAGGTCCC CATGGCTTGT TCTCTGGAAC 3360 

CTGACTTTAG ATGTTTTGGG ATCAGGAGCC CCCAACACAG GCAAGTCCAC CCCATAATAA 342 0 

CCCTGCCAGT GCCAGGGTGG GCTGGGGACT CTGGCACAGT GATGCCGGGC GCCAGGACAG 3480 

CAGCACTCCC GCTGCACACA GACGGCCTAG GGGTGGCGCT CAGACCCCAC CCTACGCTCA 3540 

TCTCTGGAAG GGGCAGCCCT GAGTGGTCAC TGGTCAGGGC AGTGGCCAAG CCTGCTGTGT 3600 

CCTTCCTCCA CAAGGTCCCC CCACCGCTCA GTGTCAGCGG GTGACGTGTG TTCTTTTGAG 3 660 
TCCTTGTATG AATAAAAGGC TGGAAACCTA AA 



PCT/US02/12476 



Seq ID NO: 4 95 Protein sequence 
Protein Accession #: NP 002072.1 



I 

MELRARGWWL 
ICPQGYTCCT 
TLQATFPGAF 
QLLLPDDYLD 
PLGPECSRAV 
TDKFWGTSGV 
RGKLAPRERP 
RGRYLPEVMG 
ASDDGSGSGS 



11 
I 

LCAAAALVAC 
SEMEENLANR 
GELYTQNARA 
CLGKQAEALR 
MKLVYCAHCL 
ESVIGSVHTW 
PSGTLEKLVS 
DGLANQ INNP 



21 
I 

ARGDPASKSR 
SHAELETALR 
FRDLYSELRL 
PFGEAPRELR 
GVPGARPCPD 
LAEAINALQD 
EAKAQLRDVQ 
EVEVDITKPD 



31 
I 

SCGEVRQIYG 
DSSRVLQAML 
YYRGANLHLE 
LRATRAFVAA 
YCRNVLKGCL 
NRDTL»T AKV I 
DFWISLPGTL 
MTIRQQIMQL 



41 
I 

AKGFSLSDVP 
ATQLRSFDDH 
ETLAEFWARL 
RSFVQGLGVA 
ANQADLDAEW 
QGCGNPKVNP 
CSEKMALSTA 
KIMTNRLRSA 



51 
I 

QAEISGEHLR 
FQHLIiNDSER 
LERLFKQLHP 
SDWRKVAQV 
RNLLDSMVL I 
QGPGPEEKRR 
SDDRCWNGMA 
YNGNDVDFQD 



60 
120 
180 
240 
300 
360 
420 
480 



30 
35 
40 
45 
50 
55 



Seq ID NO: 496 DNA sequence 

Nucleic Acid Accession NM_001650.2 

Coding sequence: 40.1011 



GGGGCAGGCA 
AGGCGGTGGG 
GGGGTCTGGA 
TTTGTTCTCC 
GTCGACATGG 
TTTGGCCATA 
AGGAAGATCA 
ATTGGAGCAG 
ACCATGGTTC 
TTTCAATTGG 
TCAATAGCTT 
ACTGGTGCCA 
GAAAACCATT 
TATGAGTATG 
AAAGCTGCCC 
GAGACGGATG 
GAGGAGAAGA 
CGCACTGAAA 
GAAACAGATT 
GTCTAAACAA 
TCCAAATCTA 
TCTAGTTACC 
CCTGACAGAA 
AGTCAATTCT 



11 

i 

ATGAGAGCTG 
GTAAGTGTGG 
CTCAAGCTTT 
TCAGCCTGGG 
TTCTCATCTC 
TCAGCGGTGG 
GCATCGCCAA 
GAATCCTCTA 
ATGGAAATCT 
TGTTTACTAT 
TAGCAATTGG 
GCATGAATCC 
GGATATATTG 
TCTTCTGTCC 
AGCAAACAAA 
ACCTGATTCT 
AGGGGAAAGA 
GCAGACAAGA 
TGTTATAAAT 
TAAATATTTC 
AAAAAAGAAA 
TTTCATTAAC 
CTCAAAGACA 
TATTTGAATA 



21 
1 

CACTCTGGCT 
ACCTTTGTGT 
CTGGAAAGCA 
ATCCACCATC 
CCTTTGCTTT 
CCACATCAAC 
GTCTGTCTTC 
TCTGGTCACA 
TACCGCTGGT 
CTTTGCCAGC 
ATTTTCTGTT 
CGCCCGATCC 
GGTTGGGCCC 
AGATGTTGAA 
AGGAAGCTAC 
AAAACCTGGA 
CCAATCTGGA 
CTCCTTAGAA 
TAGAAATGTG 
ATAATTTACA 
TATTTTTAAG 
AACCAATTTT 
CGTCTAT CAG 
TTTATTCTAT 



31 

I 

GGGGAAGG CA 
ACCAGAGAGA 
GTCACAGCGG 
AACTGGGGTG 
GGACT CAGCA 
CCTGCAGTGA 
TACATCGCAG 
CCTCCCAGTG 
CATGGTCTCC 
TGTGATTCCA 
GCAATTGGAC 
TTTGGACCTG 
AT CAT AGGAG 
TTCAAACGTC 
ATGGAGGTGG 
GTGGTGCATG 
GAGGTATTGT 
CTGTCCTCAG 
CAGGTTTGTT 
AAGGAGGAAC 
ATGTTCTTAA 
AACCGTGTGT 
CTTATTCCTT 
TAAACTGAGT 



41 


51 




1 

TGAGTGACAG 


1 

ACCCACAGCA 


60 


ACATCATGGT 


GGCTTTCAAA 


120 


AATTTCTGGC 


CATGCTTATT 


180 


GAACAGAAAA 


GCCTTTACCG 


240 


TTGCAACCAT 


GGTGCAGTGC 


300 


CTGTGGCCAT 


GGTGTGCACC 


360 


CCCAGTGCCT 


GGGGGCCATC 


420 


TGGTGGGAGG 


CCTGGGAGTC 


480 


TGGTTGAGTT 


GATAATCACA 


540 


AACGGACTGA 


TGTCACTGGC 


600 


ATTTATTTGC 


AATCAATTAT 


660 


CAGTTATCAT 


GGGAAATTGG 


72 0 


CTGTCCTCGC 


TGGTGGCCTT 


780 


GTTTTAAAGA 


AGCCTTCAGC 


840 


AGGACAACAG 


GAGTCAGGTA 


900 


TGATTGACGT 


TGACCGGGGA 


960 


CTTCAGTATG 


ACTAGAAGAT 


1020 


ATTTCCTTCC 


ACCCATTAAG 


1080 


GTTTCATGTC 


ATATTACT CA 


1140 


GGAAGAAACC 


TATTGTGAAT 


1200 


GCAAATATAT 


ACCTATTTTA 


1260 


CAAGATTTGG 


TTAAGTCTTG 


1320 


CTCTACTGGA 


ATAT TGGTAT 


1380 


TTAACAATGG 


C 





60 
65 
70 
75 
80 
85 



Seq ID NO: 497 Protein sequence 
Protein Accession #: NP 001641.1 



I 

MSDRPTARRW 
GTEKPLPVDM 
AQCLGAI iga 
KRTDVTGSIA 
AVLAGGIiYBY 
VIDVDRGEEK 



11 

I 

GKCGPLCTRE 
VLISLCFGLS 
GILYLVTPPS 
LAIGFSVAIG 
VFCPDVEFKR 
KGKDQSGEVIi 



21 
I 

NIMVAFKGVW 
IATMVQCFGH 
WGGLGVTMV 
HLFAINYTGA 
RFKEAFSKAA 
SSV 



31 



41 



TQAFWKAVTA EFLAMLIFVL 
ISGGHINPAV TVAMVCTRKI 
HGNLTAGHGL LVELIITFQL 
SMNPARSFGP AVIMGNWENH 
QQTKGSYMEV EDNRSQVETD 



51 

t 

LSLGSTINWG 
SIAKSVFYIA 
VFTIFASCDS 
WIYWVGPIIG 
DLILKPGWH 



Seq ID NO: 498 DNA sequence 

Nucleic Acid Accession ft: AB020684. 

Coding sequence : 1 . . 1744 



CCCCCTTGTC 
TTGGTACCGG 
GACGGTTACC 
TGCTTGCTTT 
CATATATGGC 
CTTTTTCAAT 
CTCATATCCA 
ACTTTATAGA 
GCAGTTTGCT 
CGGGTACATT 
ACTTTGTTTT 
TTTGGTAATT 



11 

I 

ATTAATACAT 
ATTTATACCA 
AGAGGAGAAG 
TATGTTGCTG 
ACATATTTAA 
CATGGAGAGT 
TTTCTTGTTC 
GGAAGCTTGA 
CAGTTTGTAC 
GATATATGTA 
GTTTTGATGT 
ATTTGGGGTA 



21 
I 

TAAAAAGATT 
AAATAATGGA 
GACTCAGTCC 
TAATTTTTAT 
GTGGCAGCCG 
GTACCCGTGT 
TTCAGATGTT 
TTGCACTCTG 
TTCTTACTCA 
AATTACGGAA 
TTGGGAACTC 
TTCTGGCAAT 



31 
I 

CAATCTTTAC 
CTTGATTGGT 
TATTGAAAGC 
TTTAAATGGA 
ATTAGGAGGC 
AATGTGGACA 
GCTAGTGACT 
CATTTCCAAT 
GATTGCATCA 
GATCATTTAT 
AATGTTATTA 
GAAACCACAT 



41 

1 

CCTGAGGTAA 
ATTCAAACCA 
TGTGAAGGAT 
CTAATGATGG 
CTGGTTACAG 
CCACCTCTCC 
CATATTCTCA 
GTATTTTTCA 
TTATTTGCAG 
ATACACATGA 
ACTTCTTATT 
TTCCTGAAAA 



51 
I 

TTTTGGCCAG 
AGATATGTTG 
TGGGAGATCC 
CATTATTCTT 
TGTTGTGCTT 
GTGAAAGCTT 
GGGCTACAAA 
TGCTTCCTTG 
TATATGTTGT 
TTTCTCTTGC 
ATGCTTCTTC 
TAAATGTATC 



60 
12 0 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



375 



WO 02/086443 

TGAACTTAGT TTATGGGTTA TTCAAGGATG TTTTTGGTTA TTTGGAACTG T CAT ACTTAA 780 

ATACTTGACA TCTAAAATTT TTGGTATTGC AGATGACGCT CATATTGGCA ACTTACTAAC 840 

ATCAAAATTC TTTAGT TATA AGGATTTTGA TACTTTATTG TATACCTGTG CAGCGGAGTT 900 

TGACTTTATG GAAAAAGAGA CTCCACTGAG ATACACAAAG ACATTATTGC TTCCAGTTGT 960 

5 TCTTGTAGTG TTTGTTGCTA TTGTTAGAAA GATTATTAGT GATATGTGGG GTGTCTTAGC 102 0 

TAAACAACAG ACACATGTAA GAAAACACCA GTTTGATCAT GGAGAGCTGG TTTACCATGC 10 80 

ATTGCAATTG TTAGCATATA CAGCCCTTGG TATTTTAATT ATGAGACTAA AACTCTTCTT 1140 

GACACCACAC ATGTGTGTTA TGGCATCACT GATCTGCTCA AGACAGCTAT TTGGATGGCT 12 00 

CTTTTGCAAA GTACATCCTG GTGCTATTGT GTTTGCTATA TTAGCAGCAA TGTCAATACA 1260 

10 AGGTTCAGCA AATCTGCAAA CCCAGTGGAA TATTGTAGGG GAGTTCAGCA ATTTGCCCCA 13 20 

AGAAGAACTT ATAGAATGGA TCAAATATAG TACTAAACCA GATGCAGTGT TTGCGGGTGC 13 80 

CATGCCCACG ATGGCAAGTG TTAAGCTCTC TGCACTTCGG CCCATTGTGA ATCATCCACA 1440 

TTATGAAGAC GCAGGCTTGA GAGCCAGAAC AAAAATAGTA TACTCAATGT ATAGTCGGAA 1500 

AGCAGCCGAA GAAGTGAAGC GAGAACTGAT AAAGTTAAAA GTGAACTATT ACATTCTAGA 1560 

15 AGAGT CATGG TGTGTAAGAA GATCCAAGCC TGGTTGCAGT ATGCCTGAAA TTTGGGATGT 162 0 

AGAAGAT CCT GCCAATGCTG GGAAAACTCC CTTATGTAAC CTCTTGGTGA AGGATTCCAA 1680 

ACCTCACTTC ACCACTGTAT TCCAGAACAG TGTTTACAAA GTCCTAGAAG TTGTAAAAGA 1740 

ATGACTGCTA CATGACCTGC TGCCTACGGA GAACTACATC TGTAATGGTT TTAATGTTTT 1800 

GCTAAGTCAT GTGTTGTTCA TATCCCAAAA ACTTTTATAG GTAACTGTTT TCAAATAGAA 1860 

20 AACGTTTTAT TTGGTCAATT TGAATGTCAT TCTAATTATA AAAATGACTT ACACCTTTAT 192 0 

CAATTGGTTA CTATTTCAAT GCACCCTTTA AAATTTGCTA TGCAAATGAG TATATGCTTG 1980 

TACTTGACTT TAATATTTGT GCTAAAGTGA GCAAAGCTAC CTGTATAAAG AAAACACAGT 2040 

GGGTTGTGAC AAGGATGACA TGAAAATACA GGACAATTCT GACAATGTAG GGGCTGATTT 2100 

TATAGTGTAA GAACTATTAA TGCCCCTTGC TTCTTTTTTC TGCCTCTTGC TCTTGTCTTT 2160 

25 TGGACATTTC AGTGATTGTA AGTTCTTCGG TCATGTCAGC CCCTGTCATC AACTTGAGTT 2220 

ACAGTAGATG GGGCAGACAT GGAGTGTTTG CTATATAAAA CTATCTGTTT GTTTTACTTC 22 80 

CTTGTGCGCT TTTTGTTCTC TGTTCTCTTG TTAATGAAGC TTTTCCTGCC CATTATTAAT 2340 

CCAAACTCTT GGACCTTGTG GTTAGGAAAT TCCCTTAACT TCCAGCCATA TGGCATTATC 2400 

GTGTCTCTTT CTCTCTCTCT CTTGCTCTCT CTCTTCTCCT CTTCCCCATA TTTTCTGTCA 2460 

30 AATAAGTACT GTTTACTCAT TTAGTTGCTT AT CAAGTACT TATTCTTGGT TTTAAAAAAA 2520 

ATTAATGGTA ACTGTATTTT TCTCATTTTT AGCATTATTC AAATGTTTAT ATTTTAATAC 25 8 0 

CTTTAAACCA CTTTAAAGTT TTTTCATGTT TAATTATAGT TTTAAGAAAA ACT ATTT TGA 2 640 

ACAACCCCAA ATATAGTGCA TCTAGAAACT AATGTATATT TGATTAGACA TCATTTATAG 2700 

TGGAACAGTA GACTGTAGTA CATGGTAATT TTTCTTTTAC TATTAAGATA CAATAAAACA 2760 

35 TGACTAATTT TGCTGTCAAA AATGTAAAGA ATAATGATAA ATGGAGTTTT TTATATTTTA 2 820 

CTTTTAAGAT TGCCTGTCTT TAATAAGACA AAGCCTTAAG CCTTATGTTA TAATTTTGGT 2 880 

TCTAAAAACC ATCATTTCAG TATAAGGAAT AAGTATATTT CGTCCTCCTC TTTAGTTTTT 2 940 

TTCTTCCTAT TTATTT TTAT TTTGAAAAAT TTCTACACCT TCTTTGAATT CCTTGTATGA 3 00 0 

ATTTTTGTTT CTTAGAAGTT AATTTGTGTG AAATGAGATT CTTCAAAACG ATGAAACCTC 3 0 60 

40 ATAGCTCTGA GAAAAGGTTT TAGGGTTTTA AATTCTAAGC AAAGCGTGAC TATGGCTGAC 3120 

AGACTACACA TTTAATTATA CAGCTTCTCT TTCTTAACCA CAGGCAGATT AACCTCATTG 3180 

TGGATTGTCC TTCAGACCTT AGTCCTCAGG CATGGTTTCT GGTGCCCACT CCTGGAAGCC 3240 

GCTGTTCCCT TTCTACCTTC TTACCAGAGC CCAAGGGCAG GCCTGGTCCC GGGGAAGCAG 33 00 

CAGCTTGCTG ACATAAGTCA GCTGCAAAGG CTGAGGAGTG TGCCCTCAGA GAAGCACCGC 33 60 

45 CCCCCAGTCT TGTGCCAGCG CCTAGAGCCG CAGCTCCCAG GGATGCTCCT TCCCTGGAGG 3420 

' CAGCCCAGGA GAGGGACTCT GGCAGCGTTC TTCAGATTTG TGGCCACTGT TTCTCATTTG 3480 

CTGGTTGACT GTTTTTATTT CTTAGGCTTT TGCTAGTTTT AGAAAATAGG GAAGCAGCCC 3 540 

TTGATTTGTG GATTAAAAGC AACATTTGAG CGATGATGCA CAACAGTCCA GGAAAATGGG 3 600 

CGGTGGACAC TTGAGG CTGA GGATGGGAGT TGACATGAGC AGGGAGAGGG AGGTGCGCGC 3660 

50 TGCTTATCTG TGATTGTTGC TCACCTGAGT GTGGCTGATT GTGTACATCC AGCAGTTACA 3720 

ATTTTTAAAA ATTATACTTT TACATTTATT TTATATTTTT CTCACCCCCA GTAATTTCCT 3780 

TCCAAAGAAG TTCACATGTA ATAAGTAGAA ATTCTGTATA GGAAAAAAGC ATTAAAAATA 3 840 

CTATTATAAC TGCTTCATTT GCTGGGAACC ATTAAAAGTA ATATAAAT T A GCTTTTTCCA 3 900 

GAAGGATCCT TTTGTAGCAG TGTTTATGAA TGTAACCCCC AGCAAAATAT GGCTATATAT 3 960 

55 TAGGGGAGCC AGTTTGGAGC AGAGGCCTGA AGGTCCCTGC TATGCAGCCG TGGCCACAGC 4020 

TCGCAGCCCA AGCACTGTGG AGCAT CCACA CCTTTGATGG CAATGCAGAT TGGTAGCAGG 40 80 

TTCCATAGGC GTACAAAACA GTATTAAAGC TCAGTGTTTT GCATATTGTT AGCATTTACA 4140 

AATATTTTTG CTTTAGTATG AGGAAAGTAA GGATGGGCAA AGAAGCGATC AAAATAGCTA 4200 

TTGCTACAAC ATTTTCGAAA ACAAAGTTGG GGCTGTATTT CTTTAAAAAG ATAAGCCTCT 42 60 

60 AAAAATGCTT GGCAAAAAAA ATATAGTGTT AAAATAGGCC AGTGATATTA ATGAGAAAAT 4320 

GAAAGTATGT ATCAGGAATA AAGTGATATT GCATAGGAGT ATTGTATTTT TATGAATTTT 4380 
ATGCCAGTTG TTTACATGTA CTATATATGT TAAATTAAAA AAAATCATGA GAAATG 



PCT/US02/12476 



65 



Seq ID NO: 4 99 Protein sequence 
Protein Accession #: BAA74900.1 



70 
75 
80 



PLVINTLKRF 
ACFYVAVIFI 
SYPFLVLQML 
GYIDICKLRK 
ELSLWVIQGC 
DFMEKETPLR 
LQLLAYTALG 
GSANLQTQWN 
YEDAGLRART 
ED PANAGKTP 



11 
I 

NLYPEVILAS 
LNGLMMALFF 
LVTHI LRATK 
IIYIHMISLA 
FWLFGTVILK 
YTKTLLLPW 
ILIMRLKLFL 
IVGEFSNLPQ 
KIVYSMYSRK 
LCNLLVKDSK 



21 
I 

WYRIYTKIMD 
IYGTYLSGSR 
LYRGSIiIALC 
LCFVLMFGNS 
YLTSKIFGIA 
LWFVAIVRK 
TPHMCVMASL 
EELIEWIKYS 
AAEEVKRELI 
PHFTTVFQNS 



31 
I 

LIGIQTKICW 
LGGLVTVLCF 
ISNVFFMLPW 
MLLTSYYASS 
DD AH I GNLLT 
IISDMWGVLA 
ICSRQLFGWL 
TKPDAVFAGA 
KLKVNYYIIiE 
VYKVLEWKE 



41 
I 

TVTRGEGLSP 
FFNHGECTRV 
QFAQFVLLTQ 
LVIIWGILAM 
SKFFSYKDFD 
KQQTHVRKHQ 
FCKVHPGAIV 
MPTMASVKLS 
ESWCVRRSKP 



51 
I 

IESCEGLGDP 
MWTPPLRESF 
IASLFAVYW 
KPHFLKINVS 
TLLYTCAAEF 
FDHGELVYHA 
FAILAAMSIQ 
ADRPIVNHPH 
GCSMPEIWDV 



60 
120 
180 
240 
300 
360 
420 
480 
540 



Seq ID NO: 50 0 DNA sequence 

Nucleic Acid Accession ft: NM_001276. 

Coding sequence: 127.. 1278 



1 11 21 31 41 51 

85 j | j | | | 

AGTGGAGTGG GACAGGTATA TAAAGGAAGT ACAGGGCCTG GGGAAGAGGC CCTGTCTAGG 
TAGCTGGCAC CAGGAGCCGT GGGCAAGGGA AGAGGCCACA CCCTGCCCTG CTCTGCTGCA 



60 
120 



376 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 



WO 02/086443 

GCCAGAATGG GTGTGAAGGC GTCTCAAACA 
TGCTCTGCAT ACAAACTGGT CTGCTACTAC 
GGGAGCTGCT TCCCAGATGC CCTTGACCGC 
GCCAATATAA GCAACGATCA CATCGACACC 
ATGCTCAACA CACTCAAGAA CAGGAACCCC 
TGGAACTTTG GGTCTCAAAG ATTTTCCAAG 
TTCATCAAGT CAGTACCGCC ATTCCTGCGC 
TGGCTCTACC CTGGACGGAG AGACAAACAG 
GCCGAATTTA TAAAGGAAGC CCAGCCAGGG 
TCTGCGGGGA AGGTCACCAT TGACAGCAGC 
GATTTCATTA GCATCATGAC CTACGATTTT 
CACAGTCCCC TGTTCCGAGG TCAGGAGGAT 
TATGCTGTGG GGTACATGTT GAGGCTGGGG 
CCCACCTTCG GGAGGAGCTT CACTCTGGCT 
TCAGGACCGG GAATTCCAGG CCGGTTCACC 
ATCTGTGACT TCCTCCGCGG AGCCACAGTC 
GCCACCAAGG GCAACCAGTG GGTAGGATAC 
CAGTACCTGA AGGATAGGCA GCTGGCAGGC 
TTCCAGGGCT CCTTCTGCGG CCAGGATCTG 
GCACTCGCTG CAACGTAGCC CTCTGTTCTG 
CCCCCTCTGG CTCCAGCTGG CCGGGAGCCT 
GCCTCAGTCT CCCTCCCTTG GGGCCTATGC 
GCCCTGGTGG GCAGAGAGGT AGGGATGGGG 
GACTCGGGAT TAGTACACAC TTGTTGATGA 
TGGCAAGGGA ATTTCTTCAA CTCCCTGCCC 
TGGCAAGCTC TATCACCAAG GAGCCAAACA 
TACCCCCTGC AAAGCCAGCT TGAAACCTTC 
ACTTCCCCTT CCTAATTCCA CAGCTGCTCA 
CGCTTTGCTT TGGTCTATCT TTGAGCGCCC 
TCTTCTGGGT TCCTTCCTCT GAGCCTTGGG 
ATGTT 

Seq ID NO: 501 Protein sequence 
Protein Accession #: NP_0 01267.1 



PCT/US02/12476 



GGCTTTGTGG 
ACCAGCTGGT 
TTCCTCTGTA 
TGGGAGTGGA 
AACCTGAAGA 
ATAGCCTCCA 
ACCCATGGCT 
CATTTTACCA 
AAAAAGCAGC 
TATGACATTG 
CATGGAGCCT 
GCAAGTCCTG 
GCTCCTGCCA 
TCTTCTGAGA 
AAGGAGGCAG 
CATAGAACCC 
GACGACCAGG 
GCCATGGTAT 
CGCTTCCCTC 
CACACAGCAC 
GATCACCTGC 
AGAGGTCCAC 
CTGTGGGGAT 
TTAATGGAAA 
CCTAGCCCTC 
TCCTACAAGA 
ACTTAGGAAC 
ATAAAGTACA 
ACTAGACCCA 
ACCCCTGAGC 



TCCTGGTGCT 
CCCAGTACCG 
CCCACATCAT 
ATGATGTGAC 
CTCTCTTGTC 
ACACCCAGAG 
TTGATGGGCT 
CCCTAATCAA 
TCCTGCTCAG 
CCAAGATATC 
GGCGTGGGAC 
ACAGATTCAG 
GTAAGCTGGT 
CTGGTGTTGG 
GGACCCTTGC 
TCGGCCAGCA 
AAAGCGTCAA 
GGGCCCTGGA 
TCACCAATGC 
GGGGGCCAAG 
CCTGCTGAGT 
AACACACAGA 
AGTGAGGCAT 
TGTTTACAGA 
CTTAT CAAAG 
CACAGTGACC 
GTAATCGTGT 
AGAGTTTAAC 
CTGGACTCAC 
TTGCAGAGAT 



GCTCCAGTGC 
GGAAGGCGAT 
CTACAGCTTT 
GCTCTACGGC 
TGTCGGAGGA 
TCGCCGGACT 
GGACCTTGCC 
GGAAATGAAG 
CGCAGCACTG 
CCAACACCTG 
CACAGGCCAT 
CAACACTGAC 
GATGGGCATC 
AGCCCCAATC 
CTACTATGAG 
GGTCCCCTAT 
AAG CAAGGTG 
CCTGGATGAC 
CATCAAGGAT 
GATGCCCCGT 
CCCAGGCTGA 
TTTGAGCTCA 
CGCAATGTAA 
TCCCCAAGCC 
GACACCATTT 
ATACTAATTA 
CCCCTATCCT 
AGTGTGTTGG 
CTCCCCCATC 
GAAGGCCGCC 



MGVKASQTGF 
ISNDHIDTWE 
KSVPPFLRTH 
GKVTIDSSYD 
VGYMLRLGAP 
DFLRGATVHR 
GSFCGQDLRF 



11 
! 

WLVLLQCCS 
WNDVTLYGML 
GFDGLDLAWL 
IAKISQHLDF 
ASKIjVMGIPT 
TLGQQVPYAT 
PLTNAIKDAL 



21 
I 

AYKLVCYYTS 
NTLKNRNPNL 
YPGRRDKQHF 
ISIMTYDFHG 
FGRSFTLASS 
KGNQWVGYDD 
AAT 



31 
i 

WSQYREGDGS 
KTLLSVGGWN 
TTLIKEMKAE 
AWRGTTGHHS 
ETGVGAPISG 
QESVKSKVQY 



41 

! 

CFPDALDRFL 
FGSQRFSKIA 
FIKEAQPGKK 
PLFRGQEDAS 
PGIPGRFTKE 
LKDRQLAGAM 



51 
1 

CTHIIYSFAN 
SNTQSRRTFI 
QLLLSAALSA 
PDRFSNTDYA 
AGTLAYYEIC 
VWALDLDDFQ 



Seq ID NO: 502 DNA sequence 

Nucleic Acid Accession #: NM_006474.1 

Coding sequence: 181.. 669 



GCTGCCTAGG 
TCCGGCCCCC 
TTCCCCCAGC 
ATGTGGAAGG 
GAAGGAGCCA 
GTTGCCATGC 
AAGTCTGGCT 
GAGGATCTGC 
GCCTCAAACG 
GTTGAGAAAG 
GCCATCGGTT 
TCGCCCTAAA 
TTCTGACTCT 
CGGGCCCATT 
TCACCAGATT 



11 

i 

GTCTGGAAAG 
CCACCGTCGC 
TCAGAATCTT 
TGTCAGCTCT 
GCACAGGCCA 
CAGGTGCCGA 
TGACAACTCT 
CAACTTCAGA 
TGGCCACCAG 
ATGGTTTGTC 
TCATTGGTGG 
GAGCTGAAGG 
GTGGCCCTGT 
CAGATTCCAC 
TGGTTCTTAA 



21 
I 

CTCGGGCACC 
GCTCCTCCAG 
GCTGCTCGGC 
GCTCTTCGTT 
GCCAGAAGAT 
AGATGATGTG 
GGTGGCAACA 
AAGCACAGTC 
TCACTCCACG 
AACAGTGACC 
AATCAT CGTT 
GTTACGCCCT 
CCCTGAGCTC 
GGTGACTTTC 
ACTTT 



31 

i 

CTCCCTCTCC 
GCTGGGCCTG 
CCCCAGGAGA 
TTGGGAAGCG 
GACACTGAGA 
GTGACTCCAG 
AGTGTCAACA 
CACGCGCAAG 
GAGAAAGTGG 
CTGGTTGGAA 
GTGGTTATGC 
GCTTGCCAAC 
GTGGGGAGAA 
CGTTTGCCAA 



41 

I 

GGGGCTCCTG 
TGGCCGCGGT 
GCAACAACTC 
CGTCGCTCTG 
CTACAGGTTT 
GAACCAGCGA 
GTGTAACAGG 
AACAAAGTCC 
ATGGAGACAC 
TCATAGTTGG 
GAAAAATGTC 
GTGCTTTAAA 
GATGACCCTG 
ATTAACCGAG 



51 
I 

CTCCCACCCC 
GCTTTTAATT 
AACGGGAACG 
GGT CCTGGCA 
GGAAGGCGGC 
AGACCGCTAT 
CATTCGCATC 
AAGCGCCACA 
ACAGACAACA 
GGT CTT ACTA 
GGGAAGGTAC 
AAAAGACCGT 
GGAACATTTG 
GAAAGACCTT 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114,0 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO: 503 Protein sequence 
Protein Accession #: NP_006465.1 



11 



21 



31 



41 



51 



MWKVSALLFV LGSASLWVLA EGASTGQPED DTETTGLEGG VAMPGAEDDV VTPGTSEDRY 60 
KSGLTTLVAT SVNSVTGIRI EDLPTSESTV HAQEQSPSAT ASNVATSHST EKVDGDTQTT 12 0 
75 VEKDGLSTVT LVGIIVGVLD AIGFIGGIIV WMRKMSGRY SP 



80 



85 



Seq ID NO: 504 DNA sequence . 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 62.. 8 95 



1 
I 

CACTGCTCTG 
TATGATCATC 
CAAGGATGGA 
AGAAGCACGG 
TGAAGGCGGC 



11 

1 

AGAATTTGTG 
TTAATTTACT 
ATTTTTCATA 
TCTGGCAAAT 
CATCTCGCAA 



21 
1 

AGCAGCCCCT 
TATTTCTCTT 
ACTCCATATG 
ACAAGCTCAC 
CTTACAAGCA 



31 

I 

AACAGGCTGT 
GCTATGGGAA 
GCTTGAACGA 
CTACGCAGAA 
GCTAGAGGCA 



41 

I 

TACTTCACTA 
GACACTCAAG 
GCAGCCGGTG 
GCTAAGGCGG 
GCCAGAAAAA 



51 
I 

CAACTGACGA 
GATGGGGATT 
TGTACCACAG 
TGTGTGAATT 
TTGGATTTCA 



60 
120 
180 
240 
300 



377 



10 



15 



20 



WO 02/086443 

TGTCTGTGCT qctGGATGGA TGGCTAAGGG CAGAGTTGGA TACCCCATTG TGAAGCCAGG 360 

GCCCAACTGT GGATTTGGAA AAACTGGCAT TATTGATTAT GGAATCCGTC TCAATAGGAG 420 

TGAAAGATGG GATGCCTATT GCTACAACCC ACACGCAAAG GAGTGTGGTG GCGTCTTTAC 480 

AGAT CCAAAG CAAATTTTTA AATCTCCAGG CTTCCCAAAT GAGTACGAAG ATAACCAAAT 540 

CTGCTACTGG CACATTAGAC TCAAGTATGG TCAGCGTATT CACCTGAGTT TTTTAGATTT 60 0 

TGACCTTGAA GATGACCCAG GTTGCTTGGC TGATTATGTT GAAATATATG ACAGTTACGA 660 

TGATGT CCAT GGCTTTGTGG GAAGATACTG TGGAGATGAG CTTCCAGATG ACAT CATCAG 720 

TACAGGAAAT GTCATGACCT TGAAGTTTCT AAGTGATGCT TCAGTGACAG CTGGAGGTTT 780 

CCAAATCAAA TATGTTGCAA TGGATCCTGT ATCCAAATCC AGTCAAGGAA AAAATACAAG 840 

TACTACTTCT ACTGGAAATA AAAACTTTTT AGCTGGAAGA TTTAGCCACT TATAAAAAAA 900 

AAAAAAAGGA TGATCAAAAC ACACAGTGTT TATGTTGGAA TCTTTTGGAA CTCCTTTGAT 960 

CTCACTGTTA TTATTAACAT TTATTTATTA TTTTTCTAAA TGTGAAAGCA ATACATAATT 1020 

TAGGGAAAAT TGGAAAATAT AGGAAACTTT AAACGAGAAA ATGAAACCTC TCATAATCCC 1080 

ACTGCATAGA AATAACAAGC GTTAACATTT TCATATTTTT TTCTTTCAGT CATTTTTCTA 1140 

TTTGTGGTAT ATGTATATAT GTACCTATAT GTATTTGCAT TTGAAATTTT GGAATCCTGC 1200 

TCTATGTACA GTTTTGTATT ATACTTTTTA AATCTTGAAC TTTATAAACA TTTTCTGAAA 1260 

TCATTGATTA TTCTACAAAA ACATGATTTT AAACAGCTGT AAAATATTCT ATGATATGAA 132 0 

TGTTTTATGC ATTATTTAAG CCTGTCTCTA TTGTTGGAAT TTCAGGTCAT TTTCATAAAT 1380 
ATTGTTGCAA TAAATATCCT TGAACACACA AAAAAAAAAA AA 

Seq ID NO: 505 Protein sequence 
Protein Accession Eos sequence 



PCT/US02/12476 



25 



30 



i 

I 

MIILIYLFLL 
EGGHLATYKQ 
ERWDAYCYNP 
DLEDDPGCLA 
QIKYVAMDPV 



11 
) 

LWEDTQGWGF 
LEAARKI GFH 
HAKECGGVFT 
DYVEIYDSYD 
SKSSQGKNTS 



21 
I 

KDGIFHNSIW 
VCAAGWMAKG 
DPKQIFKSPG 
DVHGFVGRYC 
TTSTGNKNFL 



31 



41 



LERAAGVYHR EARSGKYKLT 
RVGYP I VKPG PNCGFGKTGI 
FPNEYEDNQI CYWHIRLKYG 
GDELPDDIIS TGNVMTLKFL 
AGRFSHL 



51 
I 

YAEAKAVCEF 
IDYGIRLNRS 
QRIHLSFLDF 
SDASVTAGGF 



60 
120 
180 
240 



35 



40 



45 



50 



55 



60 



Seq ID NO: 50 6 DNA sequence 

Nucleic Acid Accession #: NM_007115.1 

Coding sequence: 69.. 902 



GAATTCGCAC 
CTGACGATAT 
GGGGATTCAA 
ACCACAGAGA 
GTGAATTTGA 
GATTTCATGT 
AGCCAGGGCC 
ATAGGAGTGA 
TCTTTACAGA 
ACCAAATCTG 
TAGATTTTGA 
GTTACGATGA 
TCATCAGTAC 
GAGGTTTCCA 
ATACAAGTAC 
AAAAAAAAAA 
TTGATCTCAC 
TAATTTAGGG 
ATCCCACTGC 
TTGTATTTGT 
„ CCTGCTCTAT 
TGAAATCATT 
ATGAATGTTT 
TAAATATTGT 



11 
I 

TGCTCTGAGA 
GATCATCTTA 
GGATGGAATT 
AGCACGGTCT 
AGGCGGCCAT 
CTGTGCTGCT 
CAACTGATGA 
AAGATGGGAT 
TCCAAAGCGA 
CTACTGGCAC 
CCTTGAAGAT 
TGTCCATGGC 
AGGAAATGTC 
AATCAAATAT 
TACTTCTACT 
AAGGATGATC 
TGTTATTATT 
AAAATTGGAA 
ATAGAAATAA 
GGTATATGTA 
GTACAGTTTT 
GATTATTCTA 
TATGCATTAT 
TGCAATAAAT 



21 

i 

ATTTGTGAGC 
ATTTACTTAT 
TTTCATAACT 
GGCAAATACA 
CTCGCAACTT 
GGATGGATGG 
TTTGGAAAAA 
GCCTATTGCT 
ATTTTTAAAT 
ATTAGACTCA 
GACCCAGGTT 
TTTGTGGGAA 
ATGACCTTGA 
GTTGCAATGG 
GGAAATAAAA 
AAAACACACA 
AACATTTATT 
AATATAGGAA 
CAAGCGTTAA 
TATATGTACC 
GTATTATACT 
CAAAAACATG 
TTAAGCCTGT 
ATCCTTCGGA 



31 
I 

AGCCCCTAAC 
TTCTCTTGCT 
CCATATGGCT 
AGCTCACCTA 
ACAAGCAGCT 
CTAAGGGCAG 
CTGGCATTAT 
ACAACCCACA 
CTCCAGGCTT 
AGTATGGTCA 
GCTTGGCTGA 
GATACTGTGG 
AGTTTCTAAG 
ATCCTGTATC 
ACTTTTTAGC 
GTGTTTATGT 
TATTATTTTT 
ACTTTAAACG 
CATTTTCATA 
TATATGTATT 
TTTTAAATCT 
ATTTTAAACA 
CTCTATTGTT 
ATTC 



41 

I 

AGGCTGTTAC 
ATGGGAAGAC 
TGAACGAGCA 
CGCAGAAGCT 
AGAGGCAGCC 
AGTTGGATAC 
TGATTATGGA 
CGCAAAGGAG 
CCCAAATGAG 
GCGTATTCAC 
TTATGTTGAA 
AGATGAGCTT 
TGATGCTTCA 
CAAATCCAGT 
TGGAAGATTT 
TGGAATCTTT 
CTAAATGTGA 
AGAAAATGAA 
TTTTTTTCTT 
TGCATTTGAA 
TGAACTTTAT 
GCTGTAAAAT 
GGAATT TCAG 



51 
I 

TTCACTACAA 
ACTCAAGGAT 
GCCGGTGTGT 
AAGGCGGTGT 
AGAAAAATTG 
CCCATTGTGA 
ATCCGTCTCA 
TGTGGTGGCG 
TACGAAGATA 
CTGAGTTTTT 
ATATATGACA 
CCAGATGACA 
GTGACAGCTG 
CAAGGAAAAA 
AGCCACTTAT 
TGGAACTCCT 
AAGAAATACA 
ACCTCTCATA 
TCAGTCATTT 
ATTTTGGAAT 
GAACATTTTC 
ATTCTATGAT 
GTCATTTTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



65 
70 



Seq ID NO: 507 Protein sequence 
Protein Accession #: NP 009046.1 



MIILIYLFLL 
EGGHLATYKQ 
ERWDAYCYNP 
DLEDDPGCLA 
QIKYVAMDPV 



11 
I 

LWEDTQGWGF 
LEAARKIGFH 
HAKECGGVFT 
DYVEIYDSYD 
SKSSQGKNTS 



21 



31 



41 



51 
i 



KDGIFHNSIW LERAAGVYHR EARSGKYKLT YAEAKAVCEF 60 

VCAAGWMAKG RVGYP I VKPG PNXXFGKTGI IDYGIRLNRS 12 0 

DPKRIFKSPG FPNEYEDNQI CYWHIRLKYG QRIHLSFLDF 180 

DVHGFVGRYC GDELPDDIIS TGNVMTLKFL SDASVTAGGF 240 
TTSTGNKNFL AGRFSHL 



75 



Seq ID NO: 508 DNA sequence 

Nucleic Acid Accession #: NM_0 01044,1 

Coding sequence: 129.. 1991 



80 



85 



i 
I 

ACCGCTCCGG 
AAAGCCCAGG 
GTGTGCCCAT 
CTAAGGAGCC 
ACGGAGTGCA 
AGGATCGGGA 
TGGACCTGGC 



11 

I 

AGCGGGAGGG 
CCCGGGCGGC 
GAGTAAGAGC 
CAATGCCGTG 
GCTCACCAGC 
GACCTGGGGC 
CAACGTCTGG 



21 

I 

GAGGCTTCGC 
CAGACCAAGA 
AAATGCTCCG 
GGCCCGAAGG 
TCCACCCTCA 
AAGAAGATCG 
CGGTTCCCCT 



31 
I 

GGAACGCTCT 
GGGAAGAAGC 
TGGGACTCAT 
AGGTGGAGCT 
CCAACCCGCG 
ACTTTCT CCT 
ACCTGTGCTA 



41 
I 

CGGCGCCAGG 
ACAGAATTCC 
GTCTTCCGTG 
CATCCTTGTC 
GCAGAGCCCC 
GTCCGTCATT 
CAAAAATGGT 



51 

I 

ACTCGCGTGC 
TCAACTCCCA 
GTGGCCCCGG 
AAGGAGCAGA 
GTGGAGGCCC 
GGCTTTGCTG 
GGCGGTGCCT 



60 
120 
180 
240 
300 
360 
420 
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TCCTGGTCCC CTACCTGCTC TTCATGGTCA TTGCTGGGAT GCCACTTTTC TACATGGAGC 480 

TGGCCCTCGG CCAGTTCAAC AGGGAAGGGG CCGCTGGTGT CTGGAAGATC TGCCCCATAC 540 

TGAAAGGTGT GGGCTTCACG GTCATCCTCA TCTCACTGTA TGTCGGCTTC TTCTACAACG 60 0 

TCATCATCGC CTGGGCGCTG CACTATCTCT TCTCCTCCTT CACCACGGAG CTCCCCTGGA 660 

TCCACTGCAA CAACTCCTGG AACAGCCCCA ACTGCTCGGA TGCCCATCCT GGTGACTCCA 720 

GTGGAGACAG CTCGGGCCTC AACGACACTT TTGGGACCAC ACCTGCTGCC GAGTACTTTG 780 

AACGTGGCGT GCTGCACCTC CACCAGAGCC ATGGCATCGA CGACCTGGGG CCTCCGCGGT 840 

GGCAGCT CAC AGCCTGCCTG GTGCTGGTCA TCGTGCTGCT CTACTTCAGC CTCTGGAAGG 900 

GCGTGAAGAC CTCAGGGAAG GTGGTATGGA TCACAGCCAC CATGCCATAC GTGGTCCTCA 960 

CTGCCCTGCT CCTGCGTGGG GTCACCCTCC CTGGAGCCAT AGACGGCATC AGAGCATACC 102 0 

TGAGCGTTGA CTTCTACCGG CTCTGCGAGG CGTCTGTTTG GATTGACGCG GCCACCCAGG 10 80 

TGTGCTTCTC CCTGGGCGTG GGGTTCGGGG TGCTGATCGC CTTCTCCAGC TACAACAAGT 1140 

TCACCAACAA CTGCTACAGG GACGCGATTG TCACCACCTC CATCAACTCC CTGACGAGCT 1200 

TCTCCTCCGG CTTCGTCGTC TTCTCCTTCC TGGGGTACAT GGCACAGAAG CACAGTGTGC 1260 

CCATCGGGGA CGTGGCCAAG GACGGGCCAG GGCTGATCTT CATCATCTAC CCGGAAGCCA 1320 

TCGCCACGCT CCCTCTGTCC TCAGCCTGGG CCGTGGT CTT CTTCATCATG CTGCTCACCC 13 80 

TGGGTATCGA CAGCGCCATG GGTGGTATGG AGT CAGTGAT CACCGGGCTC ATCGATGAGT 1440 

TCCAGCTGCT GCACAGACAC CGTGAGCTCT TCACGCTCTT CATCGTCCTG GCGACCTTCC 1500 

TCCTGTCCCT GTTCTGCGTC ACCAACGGTG GCATCTACGT CTTCACGCTC CTGGACCATT 1560 

TTGCAGCCGG CACGTCCATC CTCTTTGGAG TGCTCATCGA AGCCATCGGA GTGGCCTGGT 162 0 

TCTATGGTGT TGGGCAGTTC AGCGACGACA TCCAGCAGAT GACCGGGCAG CGGCCCAGCC 1680 

TGTACTGGCG GCTGTGCTGG AAGCTGGTCA GCCCCTGCTT TCTCCTGTTC GTGGTCGTGG 1740 

TCAGCATTGT GACCTTCAGA CCCCCCCACT ACGGAGCCTA CATCTTCCCC GACTGGGCCA 1800 

ACGCGCTGGG CTGGGTCATC GCCACATCCT CCATGGCCAT GGTGCCCATC TATGCGGCCT 1860 

ACAAGTTCTG CAGCCTGCCT GGGTCCTTTC GAGAGAAACT GGCCTACGCC ATTGCACCCG 1920 

AGAAGGACCG TGAGCTGGTG GACAGAGGGG AGGTGCGCCA GTTCACGCTC CGCCACTGGC 1980 

TCAAGGTGTA GAGGGAGCAG AGACGAAGAC CCCAGGAAGT CATCCTGCAA TGGGAGAGAC 2 040 

ACGAACAAAC CAAGGAAATC TAAGTTTCGA GAGAAAGGAG GGCAACTTCT ACTCTTCAAC 2100 

CTCTACTGAA AACACAAACA ACAAAGCAGA AGACTCCTCT CTTCTGACTG TTTACACCTT 2160 

TCCGTGCCGG GAGCGCACCT CGCCGTGTCT TGTGTTGCTG TAATAACGAC GTAGATCTGT 2220 

GCAGCGAGGT CCACCCCGTT GTTGTCCCTG CAGGGCAGAA AAACGTCTAA CTTCATGCTG 2280 

TCTGTGTGAG GCTCCCTCCC TCCCTGCTCC CTGCTCCCGG CTCTGAGGCT GCCCCAGGGG 2340 

CACTGTGTTC TCAGGCGGGG AT CACGAT CC TTGTAGACGC ACCTGCTGAG AATCCCCGTG 2400 

CTCACAGTAG CTTCCTAGAC CATTTACTTT GCCCATATTA AAAAGCCAAG TGTCCTGCTT 2460 

GGTTTAGCTG TGCAGAAGGT GAAATGGAGG AAACCACAAA TTCATGCAAA GTCCTTTCCC 2 52 0 

GATGCGTGGC TCCCAGCAGA GGCCGTAAAT TGAGCGTTGA. GTTGACACAT TGCACACACA 2 580 

GTCTGTTCAG AGGCAT TGGA GGATGGGGGT CCTGGTATGT CTCACCAGGA AATTCTGTTT 2640 

ATGTTCTTGC AGCAGAGAGA AATAAAACTC CTTGAAACCA GCTCAGGCTA CTGCCACTCA 270 0 

GGCAGCCTGT GGGTCCTTGT GGTGTAGGGA ACGGCCTGAG AGGAGCGTGT CCTATCCCCG 2 760 

GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG GACGCATGCA GGGCCCCCAC 2 82 0 

AGGAGCATGT CCTATCCCTG GACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG 2880 

AACGCATGCA GGGCCCCCAC AGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC 2 940 

TGGAGCGTGT ACTACCCCAG GACGCATGCA GGGCCCCCAC AGGAGCGTGT CCTATCCCCG 3000 

GACCGGACGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC CCCAGGACGC ATGCAGGGCC 30 60 

CCCACAGGAG CGTGTACTAC CCCAGGATGC ATGCAGGGCC CCCACAGGAG CGTGTACTAC 3120 

CCCAGGACGC ATGCAGGGCC CCCATGCAGG CAGCCTGCAG ACCAACACTC TGCCTGGCCT 3180 

TGAGCCGTGA CCTCCAGGAA GGGACCCCAC TGGAATTTTA TTTCTCTCAG GTGCGTGCCA 3240 

CATCAATAAC AACAGTTTTT ATGTTTGCGA ATGGCTTTTT AAAATCATAT TTACCTGTGA 33 00 

ATCAAAACAA ATTCAAGAAT GCAGT AT CCG CGAGCCTGCT TGCTGATATT GCAGTTTTTG 33 60 

TTTACAAGAA TAATTAGCAA TACTGAGTGA AGGATGTTGG CCAAAAGCTG CTTTCCATGG 342 0 

CACACTGCCC TCTGCCACTG ACAGGAAAGT GGATGCCATA GTTTGAATTC ATGCCTCAAG 3 480 

TCGGTGGGCC TGCCTACGTG CTGCCCGAGG GCAGGGGCCG TGCAGGGCCA GTCATGGCTG 3540 

TCCCCTGCAA GTGGACGTGG GCTCCAGGGA CTGGAGTGTA ATGCTCGGTG GGAGCCGTCA 3 60 0 

GCCTGTGAAC TGCCAGGCAG CTGCAGTTAG CACAGAGGAT GGCTTCCCCA TTGCCTTCTG 3 660 

GGGAGGGACA CAGAGGACGG CTTCCCCATC GCCTTCTGGC CGCTGCAGTC AGCAGAGAGA 3720 

GCGGCTTCCC CATTGCCTTC TGGGGAGGGA CACAGAGGAC AGTTTCCCCA TCGCCTTCTG 3 780 

GTTGTTGAAG ACAGCACAGA GAGCGGCTTC CCCATCGCCT TCTGGGGAGG GGCTCCGTGT 3 840 

AGCAACCCAG GTGTTGTCCG TGTCTGTTGA CCAATCTCTA TTCAGCATCG TGTGGGTCCC 3 900 
TAAGCACAAT AAAAGACATC CACAATGGAA AAAAAAAAAG GAATTC 

Seq ID NO: 50 9 Protein sequence 
Protein Accession #i NP_001035.1 

1 11 21 31 41 51 

I I I I I I 

MSKSKCSVGL MSSWAPAKE PNAVGPKEVE LILVKEQNGV QLTSSTLTNP RQSPVEAQDR 60 

ETWGKKIDFL LSVIGFAVDL ANVWRPPYLC YKNGGGAFLV PYLLFMVIAG MPLFYMELAL 120 

GQFNREGAAG VWKICPILKG VGFTVILISL YVGFFYNVI I AWALHYLFSS FTTELPWIHC 180 

NNSWNSPNCS DAHPGDSSGD SSGLNDTFGT TPAAEYFERG VLHLHQSHGI DDLGPPRWQL 240 

TACLVLVIVL LYFSLWKGVK TSGKWWITA TMPYWLTAL LLRGVTLPGA IDGIRAYLSV 3 00 

DFYRLCEASV WIDAATQVCF SLGVGFGVDI AFSSYNKFTN NCYRDAIVTT S1NSLTSFSS 3 60 

GFWFSFLGY MAQKHSVPIG DVAKDGPGLI FIIYPEAIAT LPLSSAWAW FFIMLLTLGI 42 0 

DSAMGGMESV ITGLIDEFQL LHRHRELFTL FIVLATFLLS LFCVTNGGIY VFTLLDHFAA 480 

GTSILFGVLI EAIGVAWFYG VGQFSDDIQQ MTGQRPSLYW RLCWKLVSPC FLLFWWSI 540 

VTFRPPHYGA YIFPDWANAL GWVIATSSMA MVP I YAAYKF CSLPGSFREK LAYAIAPEKD 600 
REDVDRGEVR QFTLRHWLKV 

Seq ID NO: 510 DNA sequence 

Nucleic Acid Accession #: NM_001216.1 

Coding sequence: 43. , 1422 

1 11 21 31 41 51 

I I 1 1 1 I 

GCCCGTACAC ACCGTGTGCT GGGACACCCC ACAGTCAGCC GCATGGCTCC CCTGTGCCCC 60 

AGCCCCTGGC TCCCTCTGTT GATCCCGGCC CCTGCTCCAG GCCTCACTGT GCAACTGCTG 120 

CTGTCACTGC TGCTTCTGAT GCCTGTCCAT CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT 180 

TCCCCCTTGG GAGGAGGCTC TTCTGGGGAA GATGACCCAC TGGGCGAGGA GGATCTGCCC 240 
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AGTGAAGAGG ATTCACCCAG AGAGGAGGAT CCACCCGGAG AGGAGGATCT ACCTGGAGAG 3 00 

GAGGATCTAC CTGGAGAGGA GGATCTACCT GAAGTTAAGC CTAAATCAGA AGAAGAGGGC 3 60 

TCCCTGAAGT TAGAGGATCT ACCTACTGTT GAGGCTCCTG GAGATCCTCA AGAACCCCAG 420 

AATAATGCCC ACAGGGACAA AGAAGGGGAT GACCAGAGTC ATTGGCGCTA TGGAGGCGAC 480 

CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC GCGGGCCGCT TCCAGTCCCC GGTGGATATC 540 

CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC CTGCGCCCCC TGGAACTCCT GGGCTTCCAG 60 0 

CTCCCGCCGC TCCCAGAACT GCGCCTGCGC AACAATGGCC ACAGTGTGCA ACTGACCCTG 660 

CCTCCTGGGC TAGAGATGGC TCTGGGTCCC GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT 720 

CTGCACTGGG GGGCTGCAGG TCGTCCGGGC TCGGAGCACA CTGTGGAAGG CCACCGTTTC 780 

CCTGCCGAGA TCCACGTGGT TCACCTCAGC ACCGCCTTTG CCAGAGTTGA CGAGGCCTTG 840 

GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC GCCTTTCTGG AGGAGGGCCC GGAAGAAAAC 900 

AGTGCCTATG AGCAGTTGCT GTCTCGCTTG GAAGAAATCG CTGAGGAAGG CTCAGAGACT 960 

CAGGTCCCAG GACTGGACAT ATCTGCACTC CTGCCCTCTG ACTTCAGCCG CTACTTCCAA 102 0 

TATGAGGGGT CTCTGACTAC ACCGCCCTGT GCCCAGGGTG TCATCTGGAC TGTGTTTAAC 10 80 

CAGACAGTGA TGCTGAGTGC TAAGCAGCTC CACACCCTCT CTGACACCCT GTGGGGACCT 1140 

GGTGACTCTC GGCTACAGCT GAACTTCCGA GCGACGCAGC CTTTGAATGG GCGAGTGATT 1200 

GAGGCCTCCT TCCCTGCTGG AGTGGACAGC AGTCCTCGGG CTGCTGAGCC AGTCCAGCTG 12 60 

AATTCCTGCC TGGCTGCTGG TGACATCCTA GCCCTGGTTT TTGGCCTCCT TTTTGCTGTC 13 2 0 

ACCAGCGTCG CGTTCCTTGT GCAGATGAGA AGGCAGCACA GAAGGGGAAC CAAAGGGGGT 13 80 

GTGAGCTACC GCCCAGCAGA GGTAGCCGAG ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA 1440 

TGTGAGAAGC CAGCCAGAGG CATCTGAGGG GGAGCCGGTA ACTGTCCTGT CCTGCTCATT 1500 
ATGCCACTTC CTTTTAACTG CCAAGAAATT TTTTAAAATA AATATTTATA AT 



Seq ID NO: 511 Protein sequence 
Protein Accession #: NP_001207.1 



1 11 21 31 41 51 

r r i i i i 

MAPLCPSPWL PLLIPAPAPG LTVQLLLSLL LLMPVHPQRL PRMQEDSPLG GGSSGEDDPL 60 

GEEDLPSEED SPREEDPPGE EDLPGEEDLP GEEDLPEVKP KSEEEGSLKL EDLPTVEAPG 12 0 

DPQEPQNNAH RDKEGDDQSH WRYGGDPPWP RVSPACAGRF QSPVDIRPQL AAFCPALRPL 180 

ELLGFQLPPL PELRLRNNGH SVQLTLPPGL EMALGPGREY RALQLHLHWG AAGRPGSEHT 240 

VEGHRFPAEI HWHLSTAFA RVDEALGRPG GLAVLAAFLE EGPEENSAYE QLLSRLEEIA 300 

EEGSETQVPG LD IS ALL PSD FSRYFQYEGS LTTPPCAQGV IWTVFNQTVM LSAKQLHTLS 360 

DTLWGPGDSR LQLNFRATQP LNGRVIEASF PAGVDSSPRA AEPVQLNSCL AAGDILALVF 420 

GLLFAVTSVA FLVQMRRQHR RGTKGGVSYR PAEVAETGA 



Seq ID NO: 512 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..3 978 

1 11 21 31 41 51 

1 I 1 I I 1 

ATGGTGGGTG AAGGACCCTA CCTTATCTCA GATCTGGACC AGCGAGGCCG GCGGAGATCC 60 

TTTGCAGAAA GATATGACCC CAGCCTGAAG ACCATGATCC CAGTGCGACC CTGTGCAAGG 120 

TTAGCACCCA ACCCGGTGGA TGATGCCGGG CTACTCTCCT TCGCCACATT TTCCTGGCTC 180 

ACGCCGGTGA TGGTGAAAGG CTACCGGCAA AGGCTGACCG TAGACACCCT GCCCCCATTG 240 

TCGACATATG ACTCATCTGA CACCAATGCC AAAAGATTTC GAGTCCTTTG GGATGAAGAG 300 

GTAGCAAGGG TGGGT CCTG A GAAGGCCTCT CTGAGCCACG TGGTGTGGAA ATTCCAGAGG 3 60 

ACACGCGTGT TGATGGACAT CGTGGCCAAC ATCCTGTGCA TCATCATGGC AGCCATAGGG 420 

CCGACAGTTC TCATTCACCA AATCCTCCAG CAGACTGAGA GGACCTCTGG GAAAGTCTGG 480 

GTTGGCATTG GACTGTGCAT AGCCCTTTTT GCCACCGAGT TTACCAAAGT CTTCTTTTGG 540 

GCCCTTGCCT GGGCCATCAA CTACCGCACG GCCATCCGGT TGAAGGTGGC GCTCTCCACC 600 

TTGGTTTTTG AAAACCTAGT GTCCTTCAAG ACATTGACCC ACATCTCTGT TGGCGAGGTG 660 

CTCAATATAC TGTCAAGTGA TAGCTATTCT TTGTTTGAAG CTGCCTTGTT TTGTCCTTTG 720 

CCAGCCACCA TCCCGATCCT AATGGTCTTT TGTGCGGCGT ACGCCTTTTT CATTCTGGGG 780 

CCCACAGCTC TCATCGGGAT ATCAGTGTAT GTCATATTCA TACCCGTCCA GATGTTTATG 840 

GCCAAGCTCA ATTCAGCTTT CCGAAGGTCA GCAATTTTGG TGACAGACAA GCGAGTTCAG 90 0 

ACAATGAATG AGTTTCTGAC CTGCATCAGG CTGATCAAAA TGTATGCCTG GGAGAAATCT 960 

TTTACCAACA CTATCCAAGA TATAAGAAGG AGGGAAAGAA AATTACTGGA AAAAGCTGGA 102 0 

TTTGTCCAAA QTGGAAACTC TGCCCTGGCC CCCATCGTGT CCACCATAGC CATCGTGCTG 1080 

AGATTATCCT GCCACATCCT CCTGAGACGC AAACTCACCG CACCCGTGGC ATTTAGTGTG 1140 

ATTGCCATGT TTAATGTAAT GAAGTTTTCC ATTGCAATCT TGCCCTTCTC CATCAAAGCA 120 0 

ATGGCTGAAG CGAATGTCTC TCTAAGGAGA ATGAAGAAAA TTCTCATAGA TAAAAGCCCC 12 60 

CCATCTTACA TCACCCAACC AGAAGACCCA GATACTGTCT TGCTTTTAGC AAATGCCACC 132 0 

TTGACATGGG AGCATGAAGC CAGCAGGAAA AGTACCCCAA AGAAATTGCA GAACCAGAAA 1380 

AGGCATTTAT GCAAGAAACA GAGGTCAGAG GCATACAGTG AGAGGAGTCC ACCAGCCAAG 1440 

GGAGCCACTG GCCCAGAGGA GCAAAGTGAC AGCCTCAAAT CGGTTCTGCA CAGCATAAGC 1500 

TTTGTGGTGA GAAAGTTATG TCGTTATCCC GAAGCCCAGC TCCTGGCTTG GAGGTGGCCA 1560 

GCAGTGTTTG TTGGGAGAAT CATCAGAGGA TACAGGCCTC ATGGATTTTC TGCTAAAGAC 1620 

AAGGATGAAT CTAGAAGGCT TCTTACTTGG CCCCAAGAAG TGGATAGGAC TCAAAGGGCA 168 0 

GCCAAATACC TGGGGAAGAT CTTGGGAATA TGTGGGAATG TGGGAAGTGG AAAGAGCTCC 1740 

CTCCTTGCAG CTCTCCTAGG ACAGATGCAG CTGCAGAAAG GGGTGGTGGC AGTCAATGGA 1800 

ACTTTGGCCT ACGTTTCACA GCAGGCATGG ATCTTTCATG GAAATGTGAG AGAAAACATA 1860 

CTCTTTGGAG AAAAGTATGA TCACCAAAGG TATCAGCACA CAGTCCGCGT CTGTGGCCTC 1920 

CAGAAGGACC TGAGCAACCT CCCCTATGGA GACCTGACTG AGATTGGGGA GCGGGGCCTC 1980 

AACCTCTCTG GGGGGCAGAG GCAGAGGATT AGCCTGGCCC GCGCTGTCTA CTCCGACCGT 2040 

CAGCTCTACC TGCTGGACGA CCCCCTGTCG GCCGTGGACG CCCACGTGGG GAAGCACGTC 2100 

TTTGAGGAGT GCATTAAGAA GACG CTCAGG GGAAAGACAG TCGTCCTGGT GACCCACCAG 2160 

CTACAGTTCT TAGAGTCTTG TGATGAAGTT ATTTTATTAG AAGATGGAGA GATTTGTGAA 2220 

AAGGGAACCC ACAAGGAGTT AATGGAGGAG AGAGGGCGCT ATGCAAAACT GATTCACAAC 2280 

CTGCGAGGAT TGCAGTTCAA GGATCCTGAA CACCTTTACA ATGCAGCAAT GGTGGAAGCC 2340 

TTCAAGGAGA GCCCTGCTGA GAGAGAGGAA GATGCTGGTA TAATCGGGTA CCTCCTTTCT 240 0 

CTCTTCACTG TGTTCCTCTT CCTCCTGATG ATTGGCAGCG CTGCCTTCAG CAACTGGTGG 2460 

CTGGGTCTCT GGT TGGACAA GGGCTCACGG ATGACCTGTG GGCCCCAGGG CAACAGGACC 252 0 

ATGTGTGAGG TCGGCGCGGT GCTGGCAGAC ATCGGTCAGC ATGTGTACCA GTGGGTGTAC 2580 

ACTGCAAGCA TGGTGTTCAT GCTGGTGTTT GGCGT CACCA AAGGCTTCGT CTTCACCAAG 2640 



380 



WO 02/086443 PCT/US02/12476 

ACCACACTGA TGGCATCCTC CTCTCTGCAT GACACGGTGT TTGATAAGAT CTTAAAGAGC 2700 

CCAATGAGTT TCTTTGACAC GACTCCCACT GGCAGGCTAA TGAACCGTTT TTCCAAGGAT 2760 

ATGGACGAGC TGGATGTGAG GCTGCCGTTT CACGCAGAGA ACTTTCTGCA GCAGTTTTTT 2820 

ATGGTGGTGT TTATTCTCGT GATCTTGGCT GCTGTGTTTC CTGCTGTCCT TTTAGTCGTG 2 880 

5 GCCAGCCTTG CTGTAGGCTT CTTCATTCTG TTACGCATTT TCCACAGAGG AGTCCAGGAG 2 940 

CTCAAGAAGG TGGAGAATGT CAGCCGGTCA CCCTGGTTCA CCCACATCAC CTCCTCCATG 3000 

CAGGGCCTGG G CATC ATT CA CGCCTATGGC AAGAAGGAGA GCTGCATCAC CTATACTTCA 3 0 60 

TCCAAAGGCC TGTCATTGTC AT AC AT CAT C CAGCTGAGCG GACTGCTCCA AGTGTGTGTG 3120 

CGAACGGGAA CAGAGACGCA AGCCAAATTC ACCTCCGTGG AGCTGCTCAG GGAATACATT 3180 

10 TCGACCTGTG TTCCTGAATG CACTCATCCC CTCAAAGTGG GGACCTGTCC CAAGGACTGG 3240 

CCCAGCTGTG GGGAG AT CAC CTTCAGAGAC TATCAGATGA GATACAGAGA CAACACCCCC 3 3 00 

CTTGTTCTCG ACAGCCTGAA CTTGAACATA CAAAGTGGGC AGACAGTCGG GATTGTTGGA 3360 

AGAACAGGTT CCGGAAAGTC ATCGTTAGGA ATGGCTTTGT TTCGTCTGGT GGAGCCAGCC 342 0 

AGTGGCACAA TCTTTATTGA TGAGGTGGAT ATCTGCATTC TCAGCTTGGA AGACCTCAGA 3480 

15 ACCAAGCTGA CTGTGATCCC ACAGGAT CCT GTCCTGTTTG TAGGTACAGT AAGGTACAAC 3540 

TTGGATCCCT TTGAGAGTCA CACCGATGAG ATGCTCTGGC AGGTTCTGGA GAGAACATTC 3600 

ATGAGAGACA CAATAATGAA ACT CCCAGAA AAATTACAGG CAGAAGTCAC AGAAAATGGA 3660 

GAAAACTTCT CAGTAGGGGA ACGTCAGCTG CTTTGTGTGG CCCGAGCTCT TCTCCGTAAT 3720 

TCAAAGATCA TTCTCCTTGA TGAAGCCACC GCCTCTATGG ACTCCAAGAC TGACACCCTG 3780 

20 GTTCAGAACA CCATCAAAGA TGCCTTCAAG GGCTGCACTG TGCTGACCAT CGCCCACCGC 3 840 

CTCAACACAG TTCT CAACTG CGAT CACGTC CTGGTTATGG AAAATGGGAA GGTGATTGAG 3 900 

TTTGACAAGC CTGAAGTCCT TGCAGAGAAG CCAGATT CTG CATTTGCGAT GTTACTAGCA 3 960 

GCAGAAGTCA GATTGTAG 

25 Seq ID NO: 513 Protein sequence 

Protein Accession #: Eos sequence 

1 11 21 31 41 51 

JU MVGEGPYLIS DLDQRGRRRS FAERYDPSLK TMIPVRPCAR LAPNPVDDAG LLSFATFSWL 60 

TPVMVKGYRQ RLTVDTLPPL STYDSSDTNA KRFRVLWDEE VARVGPEKAS LSHWWKFQR 120 

TRVLMD I VAN ILCIIMAAIG PTVLIHQILQ QTERTSGKVW VGIGLCIALF ATEFTKVFFW 180 

' ALAW A INYRT AIRLKVALST LVFENLVSFK TLTHISVGEV LNILSSDSYS LFEAALFCPL 240 

PATIPILMVF CAAYAFFILG PTALIGISVY VIFIPVQMFM AKLNSAFRRS AILVTDKRVQ 3 00 

35 TMNEFLTCIR LIKMYAWEKS FTNTIQDIRR RERKLLEKAG FVQSGNSALA PIVSTIAIVLi 360 

TLSCHILLRR KLTAPVAFSV IAMFNVMKFS IAILPFSIKA MAEANVSLRR MKKILIDKSP 420 

PSYITQPEDP DTVLLLANAT LTWEHEASRK STPKKLQNQK RHLCKKQRSE AYSERSPPAK 480 

GATGPEEQSD SLKSVLHSIS FWRKLCRYP EAQLLAWRWP AVFVGRIIRG YRPHGFSAKD 540 

KDESRRLLTW PQEVDRTQRA AKYLGKILGI CGNVGSGKSS LLAALLGQMQ LQKGWAVNG 60 0 

40 TLAYVSQQAW IFHGNVRENI LFGEKYDHQR YQHTVRVCGL QKDLSNLPYG DLTEIGERGL 660 

NLSGGQRQRI SLARAVYSDR QLYLLDDPLS AVDAHVGKHV FEECIKKTLR GKTWLVTHQ 72 0 

LQFLESCDEV ILLEDGEICE KGTHKELMEE RGRYAKLIHN LRGLQFKDPE HLYNAAMVEA 780 

FKESPAEREE DAGIIGYLLS LFTVFLFLLM IGSAAFSNWW LGLWLDKGSR MTCGPQGNRT 840 

MCEVGAVLAD IGQHVYQWVY TASMVFMLVF GVTKGFVFTK TTLMASSSLH DTVFDKILKS 900 

45 PMSFFDTTPT GRLMNRFSKD MDELDVRLPF HAENFLQQFF MWFILVILA AVFPAVLLW 960 

ASLAVGFFIL LRIFHRGVQE LKKVENVSRS PWFTHITSSM QGLGIIHAYG KKESCITYTS 1020 

SKGLSLSYII QLSGLLQVCV RTGTETQAKF TSVELLREYI STCVPECTHP LKVGTCPKDW 1080 

PSCGEITFRD YQMRYRDNTP LVLDSLNLNI QSGQTVGIVG RTGSGKS SLG MALFRLVEPA 1140 

SGTIFIDEVD ICILSLEDLR TKLTVIPQDP VLFVGTVRYN LDPFESHTDE MLWQVLERTF 1200 

50 MRDTIMKLPE KLQAEVTENG ENFSVGERQL LCVARALLRN SKIILLDEAT ASMDSKTDTL 1260 

VQNTIKDAFK GCTVLTIAHR LNTVLNCDHV LVMENGKVIE FDKPEVLAEK PDSAFAMLLA 1320 

AEVRL 

Seq ID NO: 514 DNA sequence 
55 Nucleic Acid Accession #: Z31560 
Coding sequence: 1-966 

1 11 21 31 41 51 

, n I I I I I I 

OU CACAGCGCCC GCATGTACAA CATGATGGAG ACGGAGCTGA AGCCGCCGGG CCCGCAGCAA 60 

ACTTCGGGGG GCGGCGGCGG CAACTCCACC GCGGCGGCGG CCGGCGGCAA CCAGAAAAAC 12 0 

AGCCCGGACC GCGTCAAGCG GCCCATGAAT GCCTTCATGG TGTGGTCCCG CGGGCAGCGG 180 

CGCAAGATGG CCCAGGAGAA CCCCAAGATG CACAACT CGG AGAT CAGCAA GCGCCTGGGC 240 

GCCGAGTGGA AACTTTTGTC GGAGACGGAG AAGCGGCCGT TCATCGACGA GGCTAAGCGG 3 00 

65 CTGCGAGCGC TGCACATGAA GGAGCACCCG GATTATAAAT ACCGGCCCCG GCGGAAAACC 3 60 

AAGACGCTCA TGAAGAAGGA TAAGTACACG CTGCCCGGCG GGCTGCTGGC CCCCGGCGGC 42 0 

AATAGCATGG CGAGCGGGGT CGGGGTGGGC GCCGGCCTGG GCGCGGGCGT GAACCAGCGC 480 

ATGGACAGTT ACGCGCACAT GAACGGCTGG AGCAACGGCA GCTACAGCAT GATGCAGGAC 540 

CAGCTGGGCT ACCCGCAGCA CCCGGGCCTC AATGCGCACG GCGCAGCGCA GATGCAGCCC 600 

70 ATGCACCGCT ACGACGTGAG CGCCCTGCAG TACAACTCCA TGACCAGCTC GCAGACCTAC 660 

ATGAACGGCT CGCCCACCTA CAGCATGTCC TACT CGCAGC AGGGCACCCC TGGCATGGCT 720 

CTTGGCTCCA TGGGTTCGGT GGT CAAGTCC GAGGCCAGCT CCAGCCCCCC TGTGGTTACC 780 

TCTTCCTCCC ACTCC AGGG C GCCCTGCCAG GCCGGGGACC TCCGGGACAT GATCAGCATG 840 

TATCTCCCCG GCGCCGAGGT GCCGGAACCC GCCGCCCCCA GCAGACTTCA CATGT CCCAG 90 0 

75 CACTACCAGA GCGGCCCGGT GCCCGGCACG GCCATTAACG GCACACTGCC CCTCTCACAC 960 

ATGTGAGGGC CGGACAGCGA ACTGGAGGGG GGAGAAATTT TCAAAGAAAA ACGAGGGAAA 1020 

TGGGAGGGGT GCAAAAGAGG AGAGTAAGAA ACAGCATGGA GAAAACCCGG TACGCTCAAA 1080 

AAAAA 

80 Seq ID NO: 515 Protein sequence 
Protein Accession #: CAA83435 

1 11 21 31 41 51 

Q , I 1 I I I I 

OJ HSARMYNMME TELKPPGPQQ TSGGGGGNST AAAAGGNQKN SPDRVKRPMN AFMVWSRGQR 60 

RKMAQENPKM HNSEISKRLG AEWKLLSETE KRPFIDEAKR LRALHMKEHP DYKYRPRRKT 120 

KTLMKKDKYT LPGGLLAPGG NSMASGVGVG AGLGAGVNQR MDSYAHMNGW SNGSYSMMQD 180 
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QLGYPQHPGL NAHGAAQMQP MHRYDVSALQ YNSMTSSQTY MNGSPTYSMS YSQQGTPGMA 
LGSMGSWKS EASSSPPWT SSSHSRAPCQ AGDLRDMISM YLPGAEVPEP AAPSRLHMSQ 
HYQSGPVPGT AINGTLPLSH M 

Seq ID NO: 516 DNA sequence 
Nucleic Acid Accession #: U91618 
Coding sequence: 29.. 541 



PCT/US02/12476 



CGGACTTGGC 
CATGCTACTC 
AGCATTAGAA 
TCCCTCTTGG 
AGCTGAGGAA 
TGCTTTAGAT 
TCACAGCAGG 
TGACAAAAAT 
GCTGTATGAG 
AGAGAATAAA 
ATTATATTTG 
ATTGAATGTG 
TCTTCAAAAA 



11 

1 

TTGTTAGAAG 
CTGGCTTTCA 
GCAGATTTCT 
AAGATGACTC 
ACAGGAGAAG 
GGCTTTAGCT 
GCTTTTCAAC 
GGAAAGGAAG 
AATAAACCCA 
TCATTTATTT 
TGTGAAAATG 
TTTTTCTGCA 
AAAAAAAAAA 



21 

I 

GCTGAAAGAT 
GCTCCTGGAG 
TGACCAATAT 
TGCTAAATGT 
TTCATGAAGA 
TGGAAGCAAT 
ACTGGGAGTT 
AAGT CAT AAA 
GAAGACCCTA 
ACATGTGATT 
TGACAAACAC 
CTAATAGAAA 
AAATGGGGCC 



31 
I 

GATGGCAGGA 
TCTGTGCTCA 
GCATACATCA 
TTGCAGTCTT 
GGAGCTTGTT 
GTTGACAATA 
AATCCAGGAA 
GAGAAAAATT 
CATACTCAAA 
GTG ATT CATC 
ACTTATCTGT 
TTAGACTAAG 
GCAATT 



41 
I 

ATGAAAATCC 
GATTCAGAAG 
AAGATTAGTA 
GTAAATAATT 
GCAAGAAGGA 
TACCAGCTCC 
GATATTCTTG 
CCTTATATTC 
AGAGATTCTT 
ATCCCTTAAT 
CTCTTCTACA 
TGTTTTCAAA 



51 
I 

AGCTTGTATG 
AGGAAATGAA 
AAGCACATGT 
TGAACAGCCC 
AACTTCCTAC 
ACAAAATCTG 
ATACTGGAAA 
TGAAACGGCA 
ACTATTACTG 
TAAATAT CAA 
ATTGTGGTTT 
TAAATCTAAA 



240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 517 Protein sequence 
Protein Accession AAB50564 



21 



31 



51 



30 MMAGMKIQLV CMLLLAFSSW SLCSDSEEEM KALEADFLTN MHTSKISKAH VPSWKMTLLN 60 

VCSLVNNLNS PAEETGEVHE EELVARRKLP TALDGFSLEA MLTIYQLHKI CHSRAFQHWE 12 0 
LIQEDILDTG NDKNGKEEVI KRKI PYILKR QLYENKPRRP YILKRDSYYY 

Seq ID NO: 518 DNA sequence 
35 Nucleic Acid Accession #: NM_006536.2 
'Coding sequence: 10 9.. 2 940 



40 



45 



50 



55 



60 



65 



70 



75 



80 



85 



ACCTAAAACC 
ATGTATGCAG 
AGCATTGCAG 
GAACTCCCAT 
ATTGCAATTA 
ATAACTGAAG 
ATAAAGATTT 
TCATATGAAA 
TACACCCTAC 
TTCCTACTGA 
GAATGGGCCC 
ATAAATGGGC 
GTGTGTGAAA 
GGATGCACCT 
AGTTTATCTT 
CTACAGAACC 
TTTCACCACA 
GTACAGGCTG 
GCTGACAGAC 
AT T CAT ACCT 
CACCAAATTA 
TCAGCTAAAA 
AAACTGAATG 
CTTCTTGGCA 
CTGGGTTCAT 
TTCTTTGTTC 
TCTGGAACTG 
AAACCTCACC 
ATGTTTCTAG 
GGACGAAAAT 
TGGATTCCAG 
TCTCTGCAAG 
GCCACTGTGG 
TATGCCAATG 
GAGCCAGAGA 
GTTATAAAAA 
TATAGCTTGA 
CCAGGGAGTC 
GCTCCAAGGA 
AGCTCAGGAG 
CCACCATGCA 
TGGACAGCAC 
AGTAAAAGTC 
AAGCGAAATC 
ACGAATGGAC 
GCAATACGAG 
CCTCTGTTTA 
GGAGTTTTAA 



11 
I 

TTGCAAGTTC 
CAGGCT CAGT 
GTCCTATTTG 
TCCTGGGAGC 
ATCCTCAGGT 
CTTCATTTTA 
TAATACCTGC 
AGGCAAATGT 
AATACAGAGG 
ATGATAACTT 
ACCTCCGTTG 
AAAATCAAAT 
AAGGTCCTTG 
TTATCTACAA 
CTGTGGTTGA 
AGATGTGCAG 
GCTTTCCCAT 
GTGACAAAGT 
TCCTTCAACT 
TCGTGGGCAT 
ACAGCAATGA 
CAG AC AT CAG 
GAAAAGCTTA 
ATTGCTTACC 
CTGCAGCCCC 
CAGATATATC 
GAGACATTTT 
ATCAATTGAA 
TTACGTGGCA 
ACTACACAAA 
GAACAGCTAA 
CCCTGAAAGT 
AAGCCTTTGT 
TGAAACAGGG 
CTGGAGATCC 
ATGATGGAAT 
AAGTG CATGT 
ATGCTATGTA 
AATCAGTAGG 
GCTCCTTTTC 
AAATTATTGA 
CTGGAGAAGA 
TACAGAATAT 
CTCAGCAAGC 
CTGAACATCA 
CAATGGATAG 
TTCCCCCCAA 
CAGCAATGGG 



21 
I 

AGGAAGAAAC 
GTGAGTGAAC 
CAACCTGAAG 
TGGAGTACAG 
ACCTGAGAAT 
CCTATTTAAT 
CACATGGAAA 
CATAGTGACT 
GTGTGGAAAA 
AACAGCTGGC 
GGGTGTGTTC 
TAAAGTGACA 
CCCCCAAGAA 
TAGCACCCAA 
ATTTTGTAAT 
CCTCAGAAGT 
GAATGGGACT 
GGTCTGTTTA 
ACAACAAGCC 
TGCCAGTTTC 
TG AT CGAAAG 
CATTTGTTCA 
TGGCTCTGTG 
CACTGTGCTC 
AAAT CTGG AG 
AAACTCCAAT 
C CAG CAA CAT 
AAACACAGTG 
GGCCAGTGGT 
TAATTTTATC 
GCCTGGGCAC 
GACAGTGACC 
GGAAAGAGAC 
ATTTTATCCC 
TGTTACGCTG 
TTACTCGAGG 
CAATCACTCT 
TGTACCAGGT 
CAGAAATGAG 
AGTGCTGGGA 
CCTGGAAGCT 
CTTTGATCAG 
CCAAGATGAC 
TGGCATCAGG 
GCCAAATGGA 
GAACTCCTTA 
TTCTGATCCT 
TTTGATAGGA 



31 
I 

CATCTGCATC 
TGGAGGCTTC 
TTTGTGACTC 
CTTCAAGACA 
CAGAACCTCA 
GCTACCAAGA 
GCTAATAATA 
GACTGGTATG 
GAGGGAAAAT 
TACGGATCAC 
GATGAGTATA 
AGGTGTT CAT 
AACTGTATTA 
AATG CAACTG 
GCAAGTACCC 
GCATGGGATG 
GAGCTTCCAC 
GTGCTGGATG 
GCAGAATTTT 
G AC AG C AAAG 
TTGCTGGTTT 
GGGCTTAAGA 
ATGATATTAG 
AGCAGTGGTT 
GAAT TATCAC 
AGCATGATTG 
ATTCAGCTTG 
ACTGTGGATA 
CCTCCTGAGA 
ACCAATCTAA 
TGGACTTACA 
TCTCGCGCCT 
AGCCTCCATT 
ATTCTTAATG 
AGACTCCTTG 
TATTTTTTCT 
CCCAGCATAA 
TACACAGCAA 
GAGGAGCGAA 
GTTCCAGCTG 
GTAAAAGTAG 
GGCCAGGCTA 
TTTAACAATG 
GAGATATTTA 
GAAACACATG 
CAGTCTGCTG 
GTACCTGCCA 
ATCATTTGCC 



41 
I 

CATATTGAAA 
TCTACAACAT 
TCCTGGTTGC 
ATGGGTATAA 
TCTCAAACAT 
GAAGAGTATT 
ACAGCAAAAT 
GGGCACATGG 
ACATTCATTT 
GAGGCCGAGT 
ACAATGACAA 
CTGACATCAC 
TTAGTAAGCT 
CATCAATAAT 
ACAACCAAGA 
TAATCACAGA 
CTCCTCCCAC 
TGT CCAGCAA 
ATTTGATGCA 
GAGAGATCAG 
CATATCTGCC 
AAGGATTTGA 
TGACCAGCGG 
CAACAATTCA 
GTCTTACAGG 
ATGCTTTCAG 
AAAGTACAGG 
ATACTGTGGG 
TTATATTATT 
CTTTTCGGAC 
CCCTGAACAA 
CCAACTCAGC 
TTCCTCATCC 
CCACTGTCAC 
ATGATGGAGC 
CCTTTGCTGC 
GCACCCCAGC 
ACGGTAATAT 
AGTGGGGCTT 
GCCCCCACCC 
AAGAGGAATT 
CAAGCTATGA 
CTATTTTAGT 
CGTTCTCACC 
AAAGCCACAG 
TATCTAACAT 
GAGATTATCT 
TTATTATAGT 



51 
I 

ACCTGACACA 
GACCCAAAGG 
CTTAAGTTCA 
TGGATTGCTC 
TAAGGAAATG 
TTTCAGAAAT 
AAAACAAGAA 
AGATGATCCA 
CACACCTAAT 
GTTTGTCCAT 
ACCTTTCTAC 
AGGCATTTTT 
TTTTAAAGAA 
GTTCATGCAA 
AGCACCAAAC 
CTCTGCTGAC 
ATTCTCGCTT 
GATGGCAGAG 
GATTGTTGAA 
AGCCCAGCTA 
CACCACTGTA 
GGTGGTTGAA 
AGATGATAAG 
CTCCATTGCC 
AGGTTTAAAG 
TAGAATTTCC 
TGAAAATGTC 
CAACGACACT 
TGATCCTGAT 
AGCTAGTCTT 
TACCCATCAT 
TGTGCCCCCA 
TGTGATGATT 
TGCCACAGTT 
AGGTGCTGAT 
AAATGGTAGA 
CCACTCTATT 
TCAGATGAAT 
TAGCCGAGTC 
TGATGTGTTT 
GACCCTATCT 
AATAAGAATG 
AAAT ACAT CA 
CCAGATTTCC 
AATTTATGTT 
TGCCCAGGCG 
TATATTGAAA 
TGTGACACAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



382 
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WO 02/086443 

CATACTTTAA GCAGGAAAAA GAGAGCAGAC AAGAAAGAGA ATGGAACAAA ATTATTATAA 
ATAAATATCC AAAGTGTCTT CCTT CTTAGA TATAAGACCC ATGGCCTTCG ACTACAAAAA 
CATACTAACA AAGTCAAATT AACATCAAAA CTGTATTAAA ATGCATTGAG TTTTTGTACA 
ATACAGATAA GATTTTTACA TGGT AG AT CA ACAATTCTTT TTGGGGGTAG ATTAGAAAAC 
CCTTACACTT TGGCTATGAA CAAATAATAA AAATTATTCT TTAAAGTAAT GTCTTTAAAG 
GCAAAGGGAA GGGTAAAGTC GGACCAGTGT CAAGGAAAGT TTGTTTTATT GAGGTGGAAA 
AATAGCCCCA AGCAGAGAAA AGGAGGGTAG GTCTGCATTA TAACTGTCTG TGTGAAGCAA 
TCATTTAGTT ACTTTGATTA ATTTTTCTTT TCTCCTTATC TGTGCAGTAC AGGTTGCTTG 
TTTACATGAA GATCATGCTA TATTTTATAT ATGTAGCCCC TAATGCAAAG CTCTTTACCT 
CTTGCTATTT TGTTATATAT ATTT CAGATG ACATCTCCCT GCTAATGCTC AGAGATCTTT 
TTTCACTGTA AGAGGTAACC TTTAACAATA TGGGTATTAC CTTTGTCTCT TCATACCGGT 
TTTATGACAA AGGTCTATTG AATTTATTTG TNTGTAAGTT TCTACTCCCA TCAAAGCAGC 
TTTCTAAGTT TATTGCCTTG GGTTATTATG GAATGATAGT TATAGCCCCN TATAATGCCT 
T AC CTAGGAA A 

Seq ID NO: 519 Protein sequence 
Protein Accession #: NP__006527 . 1 
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2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



I 

MTQRSIAGPI 
IKEMITEASF 
GDDPYTLQYR 
KPFYINGQNQ 
MFMQSLSSW 
TFSLVQAGDK 
RAQLHQINSN 
GDDKL.LGNCL 
SRISSGTGDI 
FDPDGRKYYT 
AVPPATVEAF 
AGADVI KNDG 
IQMNAPRKSV 
LTLSWTAPGE 
PQISTNGPEH 
IilLKGVLTAM 



11 
1 

CNLKFVTLLV 
YLFNATKRRV 
GCGKEGKYIH 
IKVTRCSSDI 
EFCNASTHNQ 
WCLVLDVSS 
DDRKLLVSYL 
PTVLSSGSTI 
FQQHIQLEST 
NNFITNIjTFR 
VERDSLHFPH 
IYSRYFFSFA 
GRNEEERKWG 
DFDQGQATSY 
QPNGETHESH 
GLIGIICLII 



21 
I 

ALSSELPFLG 
FFRNIKILIP 
FTPNFLLNDN 
TGIFVCEKGP 
EAPNLQNQMC 
KMAEADRLLQ 
PTTVSAKTDI 
HSIALGSSAA 
GENVKPHHQL 
TASLWIPGTA 
PVMIYANVKQ 
ANGRYSLKVH 
FSRVSSGGSF 
EIRMSKSLQN 
RIYVAIRAMD 
WTHHTLSRK 



31 

I 

AGVQLQDNGY 
AT WKANNNS K 
LTAGYGSRGR 
CPQENCIISK 
SLRSAWDVIT 
LQQAAEFYLM 
SICSGLKKGF 
PNIaEELSRLT 
KNTVTVDNTV 
KPGHWTYTLN 
GFYPILNATV 
VNHSPSISTP 
SVLGVPAGPH 
IQDDFNNAIL 
RNSLQSAVSN 
KRADKKENGT 



41 
I 

NGLLIAINPQ 
IKQESYEKAN 
VFVHEWAHLR 
LFKEGCTFIY 
DSADFHHSFP 
QIVEIHTFVG 
EWEKLNGKA 
GGLKFFVPDI 
GNDTMFLVTW 
NTHHSLQALK 
TATVEPETGD 
AHSIPGSHAM 
PDVFPPCKII 
VNTSKRNPQQ 
IAQAPLFIPP 
KLL 



51 
I 

VPENQNLISN 
VIVTDWYGAH 
WGVFDEYNND 
NSTQNATASI 
MNGTELPPPP 
IASFDSKGEI 
YGSVMILVTS 
SNSNSMIDAF 
QASGPPEIIL 
VTVTSRASNS 
PVTLRLLDDG 
YVPGYTANGN 
DLEAVKVEEE 
AGIREIFTFS 
NSDPVPARDY 



Seq ID NO: 520 DNA sequence 

Nucleic Acid Accession NM_000228.1 

Coding sequence: 82.. 3600 



GCTTTCAGGC 
GGATCACCCC 
CTCCTGCATG 
CTTGTTGGGA 
ACCTACTGCA 
CCTCACAACT 
TGGTGGCAGT 
TTCCAGCTTC 
GAGCGCTCCT 
ACCTCCACCT 
CAGTCCCTGC 
ATGGATTTAG 
ATCACAAACT 
CCTCCCAGCG 
GGCCATGCTG 
CAGGTCCACG 
GCACCCTTCT 
CAAAGGTGCG 
GCCAGCCAGG 
AACTGTGAGC 
GAGACCTGCA 
CCAGTGACCG 
AAGCCGGGCT 
AACATCCTGG 
CTGCCCAACG 
AGTGGCCAGG 
CAACCAGTTC 
GCAGCCATCC 
TGTGACTGTG 
CTCTGCCGCC 
CGCTACCCGG 
GAGCAGGCCC 
GGGCTGGAGG 
ATCCGAGCAG 
GCCATCCTCT 
GAGACGTTGT 
ACTATGTATC 
GCCTTCCGGA 
GACAGCTCGC 
CGGCAGGCGG 
ATGTCTTCGT 
ATGGCTTGCA 
TGTGGCTCCC 
CAGGTGGCTG 



11 

I 

GATCTGGAGA 
ATTGG CTGAA 
CCCAACAAGC 
GGACCCGGTT 
CCCAGTATGG 
ACTACAGTCA 
CCCAGAATGA 
AAGAAGT CAT 
CAGACTT CGG 
TCCCTCGGGT 
CTCAGAGGCC 
TGTCTGGGAT 
TGAGAGTCAA 
CCTACTATGC 
ATCGCTGCGC 
ATGTCTGTGT 
ACAACAACCG 
ACTGCAATGG 
GGGCATATGG 
GGTGTCAGCT 
TCTCCTGCGA 
GGCAGTGTGT 
TCACTGGACT 
GGTCCCGGAG 
TGGTGGGTCC 
GCTGTGAACC 
ACAGGGCAGT 
GCCAGTGTCC 
ATTTCCGGGG 
CTGGCTTGAC 
TGTGCGTGGC 
TGCGCTTTGG 
ACCGTGGCCT 
TTCTCAGCAG 
CCCTCAGGCG 
CCCTTCCGAG 
AGAGGAAGAG 
TGCTGAGCAC 
GCCTTTTGGA 
GAGGAGGAGG 
TGCCTGACCT 
CCCCAATATC 
GCTGCAGGGG 
AGCAGCTGCG 



21 
I 

AAGAACGGCA 
GATGAGACCA 
CTGCTCCCGT 
TCTCCGAGCT 
CGAGTGGCAG 
CCGAGTAGAG 
TGTGAACCCT 
GATGGAGTTC 
TAAGACCTGG 
CCGCCAGGGT 
TAATGCACGC 
TCCAGCAACT 
TTTCACCAGG 
TGTGTCCCAG 
ACCCAAGCCT 
CTGCCAGCAC 
GCCCTGGAGA 
GCACTCAGAG 
AGGTGTGTGT 
GCACTATTTC 
GTGTGATCCG 
GTGCAAGGAG 
CACCTACGCC 
GGACATGCCG 
CAAATGTGAC 
GTGTGCCTGC 
GCCCTGTCGG 
AGACCGGACC 
AACAGAGGGC 
CGGGCCCCGC 
CTGCCACCCT 
TAGACTCCGC 
GGCCTCCCGG 
CCCCGCAGTC 
AACTCTCCAG 
AGACCTGGAG 
GGAGCAGTTT 
AGCCTACGAG 
CCAGCTCAGG 
AGGCACCGGC 
GACACCCACC 
ATGCCCTGGT 
TGTCCTTCCC 
GGGCTTCAAT 



31 
I 

GAACACACAG 
TTCTTCCTCT 
GGGGCCTGCT 
TCATCTACCT 
ATGAAATGCT 
AATGTGGCTT 
GTCTCTCTGC 
CAGGGGCCCA 
CGAGTGTACC 
CGGCCTCAGA 
CTAAATGGGG 
CAAAGTCAAA 
CTGGCCCCTG 
CTCCGTCTGC 
GGGGCCTCTG 
AACACTGCCG 
CCGGCGGAGG 
ACATGTCACT 
GACAATTGCC 
CGGAACCGGC 
GATGGGGCAG 
CATGTGCAGG 
AACCCGCAGG 
TGTGACGAGG 
CAGTGTGCTC 
GACCCGCACA 
GAAGGCTTTG 
TATGGAGACG 
CCGGGCTGCG 
TGTGACCAGT 
TGCTTCCAGA 
AATGCCACCG 
ATCCTAGATG 
ACAGAGCAGG 
GGCCTGCAGC 
AGTCTTGACA 
GAAAAAATAA 
CAGTCAGCCC 
GACAGCCGGA 
AGCCCCAAGC 
TTCAACAAGC 
GAGCTATGTC 
AGGGCCGGTG 
GCCCAGCTCC 



41 

i 

CAAGGAAAGG 
TGTGTTTTGC 
ATCCACCTGT 
GTGGACTGAC 
GCAAGTGTGA 
CATCCTCCGG 
AGCTGGACCT 
TGCCCGCCGG 
AGTACCTGGC 
GCTGGCAGGA 
GGAAGGTCCA 
AAATTCAAGA 
TGCCCCAAAG 
AGGGGAGCTG 
CAGGCCCCTC 
GCCCAAATTG 
GCCAGGACGC 
TTGACCCCGC 
GGGACCACAC 
GCCCGGGAGC 
TGCCAGGGGC 
GAGAGCGCTG 
GCTGCCACCG 
AGAGTGGGCG 
CCTACCACTG 
ACTCCCCTCA 
GTGGCCTGAT 
TGGCCACAGG 
ACAAGGCATC 
GCCAGCGAGG 
CCTATGATGC 
CCAGCCTGTG 
CAAAGAGTAA 
AGGTGGCTCA 
TGGATCTGCC 
GAAGCTTCAA 
GCAGTGCTGA 
AGGCTGCTCA 
GAGAGGCAGA 
TTGTGGCCCT 
TCTGTGGCAA 
CCCAAGACAA 
GGGCCTTCTT 
AGCGGACCAG 



51 
I 

TCCTTTCTGG 
CCTGCCTGGC 
TGGGGACCTG 
CAAGCCTGAG 
CTCCAGGCAG 
CCCCATGCGC 
GGACAGGAGA 
CATGCTGATT 
TGCCGACTGC 
TGTTCGGTGC 
ACTTAACCTT 
GGTGGGGGAG 
GGGCTACCAC 
CTTCTGTCAC 
CACCGCTGTG 
TGAGCGCTGT 
CCATGAATGC 
TGTGTTTGCC 
CGAAGGCAAG 
TTCCATTCAG 
TCCCTGTGAC 
TGACCTATGC 
CTGTGACTGC 
CTGCCTTTGT 
GAAGCTGGCC 
GCCCACAGTG 
GTGCAGCGCT 
ATGCCGAGCC 
AGGCCGCTGC 
CTACTGCAAT 
GGACCTCCGG 
GTCAGGGCCT 
GATTGAGCAG 
GGTGGCCAGT 
CCTGGAGGAG 
TGGTCTCCTT 
TCCTTCAGGA 
GCAGGTCTCC 
GAGGCTGGTG 
GAGGCTGGAG 
CTCCAGGCAG 
TGGCACAGCC 
GATGGCGGGG 
GCAGATGATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
780 
840 
900 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



383 



10 



15 



20 



25 



WO 02/086443 

AGGGCAGCCG AGGAATCTGC CTCACAGATT CAATCCAGTG CCCAGCGCTT GGAGACCCAG 2 700 

GTGAGCGCCA GCCGCTCCCA GATGGAGGAA GATGTCAGAC GCACACGGCT CCTAAT CCAG 2 760 

CAGGTCCGGG ACTTCCTAAC AGACCCCGAC ACTGATGCAG CCACTATCCA GGAGGT CAGC 2820 

GAGGCCGTGC TGGCCCTGTG GCTGCCCACA GACTCAGCTA CTGTTCTGCA GAAGATGAAT 2 880 

GAGATCCAGG CCATTGCAGC CAGGCTCCCC AACGTGGACT TGGTGCTGTC CCAGACCAAG 2 940 

CAGGACATTG CGCGTGCCCG CCGGTTGCAG GCTGAGGCTG AGGAAGCCAG GAGCCGAGCC 3000 

CATGCAGTGG AGGGCCAGGT GGAAGATGTG GTTGGGAACC TGCGGCAGGG GACAGTGGCA 3060 

CTGCAGGAAG CTCAGGACAC CATGCAAGGC ACCAGCCGCT CCCTTCGGCT TATCCAGGAC 3120 

AGGGTTGCTG AGGTTCAGCA GGTACTGCGG CCAGCAGAAA AGCTGGTGAC AAGCATGACC 3180 

AAG CAGCTGG GTGACTTCTG GACACGGATG GAGGAGCTCC GCCACCAAGC CCGG CAGCAG 3240 

GGGGCAGAGG CAGTCCAGGC CCAGCAGCTT GCGGAAGGTG CCAGCGAGCA GGCATTGAGT 3300 

GCCCAAGAGG GATTTGAGAG AATAAAACAA AAGTATGCTG AGTTGAAGGA CCGGTTGGGT 3 360 

CAGAGTT CCA TGCTGGGTGA GCAGGGTGCC CGGATCCAGA GTGTGAAGAC AGAGGCAGAG 3420 

GAGCTGTTTG GGGAGACCAT GGAGATGATG GACAGGATGA AAGACATGGA GTTGGAGCTG 3480 

CTGCGGGGCA GCCAGGCCAT CATGCTGCGC TCGGCGGACC TGACAGGACT GGAGAAGCGT 3 540 

GTGGAGCAGA TCCGTGACCA CATCAATGGG CGCGTGCTCT ACTATGCCAC CTGCAAGTGA 3 600 

TGCTACAGCT TCCAGCCCGT TGCCCCACTC ATCTGCCGCC TTTGCTTTTG GTTGGGGGCA 3 660 

GATTGGGTTG GAATGCTTTC CATCTCCAGG AGACTTTCAT GCAGCCTAAA GTACAGCCTG 3720 

GACCACCCCT GGTGTGTAGC TAGTAAGATT ACCCTGAGCT GCAGCTGAGC CTGAGCCAAT 3780 

GGGACAGTTA CACTTGACAG ACAAAGATGG TGGAGATTGG CATGCCATTG AAACTAAGAG 3 840 

CTCTCAAGTC AAGGAAG CTG GGCTGGGCAG TATCCCCCGC CTTTAGTTCT CCACTGGGGA 3 900 

GGAATCCTGG ACCAAGCACA AAAACTTAAC AAAAGTGATG TAAAAATGAA AAGCCAAATA 3 960 
AAAATCTTTG G 

Seq ID NO: 521 Protein sequence 
Protein Accession #: NP 000219.1 
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MRPFFLLCFA 
EWQMKCCKCD 
MEFQGPMPAG 
NARLNGGKVQ 
VSQLRLQGSC 
PWRPAEGQDA 
HYFRNRRPGA 
TYANPQGCHR 
CACDPHNSPQ 
TEGPGCDKAS 
RLRNATASLW 
TLQGLQLDLP 
AYEQSAQAAQ 
TPTFNKLCGN 
GFNAQLQRTR 
DPDTDAATIQ 
RLQAEAEEAR 
VLRPAEKLVT 
IKQKYAELKD 
MLRSADLTGL 



11 

I 

LPGLLHAQQA 
SRQPHNYYSH 
MLIERSSDFG 
LNLMDLVSGI 
FCHGHADRCA 
HECQRCDCNG 
SIQETCISCE 
CDCNILGSRR 
PTVQPVHRAV 
GRCLCRPGLT 
SGPGLEDRGL 
IiEEETLSLPR 
QVSDSSRLLD 
SRQMACTPIS 
QMIRAAEESA 
EVSEAVLALW 
SRAHAVEGQV 
SMTKQLGDFW 
RLGQSSMLGE 
EKRVEQ I RDH 



21 
I 

CSRGACYPPV 
RVENVASSSG 
KTWRVYQYLA 
PATQSQKIQE 
PKPGASAGPS 
HSETCHFDPA 
CDPDGAVPGA 
DMPCDEESGR 
PCREGFGGLM 
GPRCDQCQRG 
ASRILDAKSK 
DLESLDRSFN 
QLRDSRREAE 
CPGELCPQDN 
SQIQSSAQRL 
LPTDSATVLQ 
EDWGNLRQG 
TRMEELRHQA 
QGARIQSVKT 
I NGRVL Y YAT 



31 
I 

GDLLVGRTRF 
PMRWWQSQND 
ADCTSTFPRV 
VGEITNLRVN 
TAVQVHDVCV 
VFAASQGAYG 
PCDPVTGQCV 
CLCLPNWGP 
CSAAAIRQCP 
YCNRYPVCVA 
IEQIRAVLSS 
GLLTMYQRKR 
RLVRQAGGGG 
GTACGSRCRG 
ETQVSASRSQ 
KMNEIQAIAA 
TVALQEAQDT 
RQQGAEAVQA 
EAEELFGETM 
CK 



41 
I 

LRASSTCGLT 
VNPVSLQLDL 
RQGRPQSWQD 
FTRLAPVPQR 
CQHNTAGPNC 
GVCDNCRDHT 
CKEHVQGERC 
KCDQCAPYHW 
DRTYGDVATG 
CHPCFQTYDA 
PAVTEQEVAQ 
EQFEKISSAD 
GTGSPKLVAL 
VLPRAGGAFL 
MEEDVRRTRL 
RLPNVDLVLS 
MQGTSRSLRL 
QQLAEGASEQ 
EMMDRMKDME 



51 

KPETYCTQYG 
DRRFQLQEVM 
VRCQSLPQRP 
GYHPPSAYYA 
ERCAPFYNNR 
EGKNCERCQL 
DLCKPGFTGL 
KLASGQGCEP 
CRACDCDFRG 
DLREQALRFG 
VASAILSLRR 
PSGAFRMLST 
RLEMSSLPDL 
MAGQVAEQLR 
LIQQVRDFLT 
QTKQDIARAR 
IQDRVAEVQQ 
ALSAQEGFER 
LELLRGSQAI 



Seq ID NO: 522 DNA sequence 
Nucleic Acid Accession #: NM_001944. 
Coding sequence: 84.. 3083 



TTTTCTTAGA 
TTTCACCAGG 
CCATCTTCGT 
ATGATGAAGA 
AATTTGCCAA 
TTACTTCAGA 
ATCAGCCGCC 
CTATAGT CG A 
AAGGACTAGA 
ATCCTCCAGT 
ACT CACTGGT 
AAATTGCCTT 
GAAACACTGG 
ATCGTCTGGT 
GTAATATTAA 
CAG CACGT AT 
TGGATGAAGA 
GAAATTGGTT 
AGGCTCTAGA 
CTGAATTTCA 
AGGTAATAAA 
AAAAAGGCAT 
ATGAGGACAC 
GATACCTAAT 
ATTCTACTTT 
CGGGTAAAAC 
CAACAGCTGT 
CTAGAACACT 
TAAAGTTGCC 
CCCAGGAACA 
ACAATCGGTG 



11 

I 

CATTAACTGC 
GAAATCAGAG 
GGTGGTCATA 
AGAGATGACT 
ACCCTGCAGA 
TTACCAAGCA 
TTTTGGAATC 
CCGGGAGGAA 
TGTAGAGAAA 
ATTTTCACAA 
GATGATACTA 
CAAAATTGTC 
GGAAGTCCGT 
TGTGAGTGGT 
AGTGAAAGAT 
TGAAGAAAAT 
GTACACAGAT 
TGAAATACAA 
TTATGAACAA 
CCAATCAGTT 
TGTAAGAGAA 
AAGTAGCAAA 
TAACAAAGCT 
GATTGATTCA 
CATAGTTAAC 
TTCTACAGGC 
CCTCGAAAAA 
GAATAATAGA 
TGCCGTATGG 
GATACCTCCT 
TGAGATGCCA 



21 
I 

AGACGGCTGG 
ACAATGATGG 
TTGGTTCATG 
ATGCAACAAG 
GAAGGAGAAG 
ACCCAGAAAA 
TTTGTTGTTG 
ACTCCAAGCT 
CC AC T TAT AC 
CAAATTT TCA 
AATGCCACAG 
TCTCAGGAAC 
ACTTTGACCA 
G CAG ACAAAG 
GTCAACGATA 
ATTTTAAGTT 
AATTGGCTTG 
ACTGATCCTA 
CTACAAAGCG 
ATCTCTCGAT 
GGAATTGCAT 
AAATTGGTGG 
GCCTCAAATG 
AAAACTGCTG 
AAAACAATCA 
ACGGTATATG 
GATGCAGTTT 
TACACTGGCC 
AGTATCACAA 
GGAGTATACC 
CGCAGCTTGA 



31 
I 

CAGGATAGAA 
GGCTCTTCCC 
GAGAATTGCG 
CTAAAAGAAG 
ATAACTCAAA 
TCACCTACCG 
ACAAAAACAC 
TCCTGATCAC 
TAACGGTTAA 
TGGGTGAAAT 
ATGCAGATGA 
CAGCAGGCAC 
ATTCTCTTGA 
ATGGAGAAGG 
ACTTCCCAAT 
CTGAATTACT 
CAGTATATTT 
GAACTAATGA 
TGAAACTTAG 
ACCGAGTTCA 
TCCGTCCTGC 
ATTATATCCT 
TCAAATATGT 
AAATCAAATT 
CAGCTGAGGT 
TTAGAGTACC 
GCAGTTCTTC 
CCTATACATT 
CCCTCAATGC 
ACATCTCCCT 
CACTGGAAGT 



41 
I 

GCAGCGGCTC 
CAGAACTACA 
AATAGAGACT 
GCAAAAACGT 
AAGAAACCCA 
AATCTCTGGA 
TGGAGATATT 
ATGTCGGGCT 
AATTTTGGAT 
TGAAGAAAAT 
ACCAAACCAC 
ACCCATGTTC 
CCGAGAGCAA 
ACTATCAACT 
GTTTAGAGAC 
TCGATTTCAA 
CTTTACCTCT 
AGGCATCCTG 
TATTGCTGTC 
GTCAACCCCA 
TTCCAAGACA 
GGGAACATAT 
CATGGGACGT 
TGTCAAAAAT 
TCTGGCCATA 
CGATTTCAAT 
ACCTTCCGTG 
TGCACTGGAA 
TACCTCGGCC 
GGTACTTACA 
CTGTCAGTGT 



51 
I 

ACTTGGACTT 

GGGGCTCTGG 

AAAGGTCAAT 

GAATGGGTGA 

ATTGCCAAGA 

GTGGGAATCG 

AACATAACAG 

CTAAATGCCC 

ATTAATGATA 

AGTGCCTCAA 

TTGAATTCTA 

CTCCTAAGCA 

GCTAGCAGCT 

CAATGTGAAT 

TCTCAGTATT 

GTAACAGATT 

GGGAATGAAG 

AAAGTGGTGA 

AAAAACAAAG 

GTCACAATTC 

TTTACTGTGC 

CAAGCCATCG 

AACGATGGTG 

ATGAACCGAG 

GATGAATACA 

GACAATTGTC . 

GTTGTCTCCG 

GATCAACCTG 

CTCCTCAGAG 

GACAGTCAGA 

GACAACAGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
- 120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 192 0 

CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2 040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 222 0 

TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CAT CTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2 700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2 760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGT CCGCCT 2 820 

CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2 940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCGATT TCCAGTGTTC 3 000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3 0 60 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3 3 00 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 

Seq ID NO: 52 3 Protein sequence 
Protein Accession #: NP_001935.1 

1 11 21 31 41 51 

I i I I I I 

MMGLFPRTTG ALAIFWVIL VHGELRIETK GQYDEEEMTM QQAKRRQKRE WVKFAKPCRE 60 

GEDNSKRNPI AKITSDYQAT QKITYRISGV GIDQPPFGIF WDKNTGD IN ITAIVDREET 120 

PSFLITCRAL NAQGLDVEKP LILTVKILDI NDNPPVFSQQ IFMGEIEENS ASNSLVMILN 180 

ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGLSTQ CECNIKVKDV NDNFPMFRDS QYSARIEENI LSSELLRFQV TDLDEEYTDN 300 

WLAVYFFTSG NEGNWFEIQT DPRTNEGILK WKALDYEQL QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK TAEIKFVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 480 

VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY TGPYTFALED QPVKLPAVWS 540 

ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 

TTSPGTRYGR PHSGRLGPAA IGIiLLLGLLL LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 660 

GSEGTIHQWG I EGAHPEDKE ITNICVPPVT ANGADFMESS EVCTNTYARG TAVEGTSGME 72 0 

MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 7 80 

DGAI SMNFLD SYFSQKAFAC AEEDDGQEAN DCLLIYDNEG ADATGSPVGS VGCCSFIADD 840 

LDDSFLDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIEV QQTGFVKCQT 90 0 

LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPLLTQNVI 960 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

Seq ID NO: 524 DNA sequence 

Nucleic Acid Accession #: XM_058069.2 

Coding sequence: 1..1413 

1 11 21 31 41 51 

I I I 1 I I 

ATGAAGTTTC TTCTAATACT GCTCCTGCAG GCCACTGCTT CTGGAGCTCT TCCCCTGAAC 60 

AGCTCTACAA GCCTGGAAAA AAATAATGTG CTATTTGGTG AAAGATACTT AGAAAAATTT 120 

TATGGCCTTG AGATAAACAA ACTTCCAGTG ACAAAAATGA AATATAGTGG AAACTTAATG 180 

AAGGAAAAAA TCCAAGAAAT GCAGCACTTC TTGGGTCTGA AAGTGACCGG GCAACTGGAC 240 

ACATCTACCC TGGAGATGAT GCACGCACCT CGATGTGGAG TCCCCGATGT CCATCATTTC 3 00 

AGGGAAATGC CAGGGGGGCC CGTATGGAGG AAACATTATA TCACCTACAG AATCAATAAT 360 

T AC AC AC C TG ACATGAACCG TGAGGATGTT GACTACGCAA TCCGGAAAGC TTTCCAAGTA 42 0 

TGGAGTAATG TTACCCCCTT GAAATTCAGC AAGATTAACA CAGGCATGGC TGACATTTTG 4 80 

GTGGTTTTTG CCCGTGGAGC TCATGGAGAC TTCCATGCTT TTGATGGCAA AGGTGGAATC 540 

CTAGCCCATG CTTTTGGACC TGGATCTGGC ATTGGAGGGG ATGCACATTT CGATGAGGAC 600 

GAATTCTGGA CTACACATTC AGGAGGCACA AACTTGTTCC TCACTGCTGT TCACGAGATT 660 

GGCCATTCCT TAGGTCTTGG CCATTCTAGT GATCCAAAGG CCGTAATGTT CCCCACCTAC 720 

AAATATGTTG ACATCAACAC ATTTCGCCTC TCTGCTGATG ACATACGTGG CATTCAGTCC 780 

CTGTATGGAG ACCCAAAAGA GAACCAACGC TTGCCAAATC CTGACAATTC AG AACC AG CT 840 

CTCTGTGACC CCAATTTGAG TTTTGATGCT GTCACTACCG TGGGAAATAA GATCTTTTTC 900 

TTCAAAGACA GGTTCTTCTG GCTGAAGGTT TCTGAGAGAC CAAAGACCAG TGTTAATTTA 960 

ATTTCTTCCT TATGGCCAAC CTTGCCATCT GGCATTGAAG CTGCTTATGA AATTGAAGCC 102 0 

AGAAATCAAG TTTTTCTTTT TAAAGATGAC AAATACTGGT TAAT TAGCAA TTTAAGACCA 1080 

GAGCCAAATT ATCCCAAGAG CATACATTCT TTTGGTTTTC CTAACTTTGT GAAAAAAATT 1140 

GATGCAGCTG TTTTTAACCC ACGTTTTTAT AGGACCTACT TCTTTGTAGA TAACCAGTAT 1200 

TGGAGGTATG ATGAAAGGAG ACAGATGATG GACCCTGGTT ATCCCAAACT GATTACCAAG 1260 

AACTTCCAAG GAATCGGGCC TAAAATTGAT GCAGTCTTCT ACTCTAAAAA CAAATACTAC 132 0 

TATTTCTTCC AAGGATCTAA CCAATTTGAA TATGACTTCC TACT CCAACG TATCACCAAA 13 80 
ACACTGAAAA GCAATAG CTG GTTTGGTTGT TGA 

Seq ID NO: 525 Protein sequence 
Protein Accession #: P39900 

1 11 21 31 41 51 

I I I 1-1 1 

MKFIiliHiLLQ ATASGALPLN SSTSLEKNNV LFGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 
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YTPDMNREDV DYAIRKAFQV WSNVTPLKFS KINTGMADIL WPARGAHGD FHAFDGKGGI 180 
LAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS DPKAVMFPTY 240 
KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 
FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLFKDD KYWIiISNLRP 3 60 
5 EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 

NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

Seq ID NO: 52 6 DNA sequence 
Nucleic Acid Accession #: NM_024423.1 
10 Coding sequence: 64.. 2590 

1 11 21 31 41 51 

I I 1 I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 
15 CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGT CAGTG 300 

TACACAGCCA GGGCTGTTGC GCTGT CTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 3 60 
20 GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC T AGAACAT C A GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATTCCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGTT 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 
25 CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGAT CTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 

30 AGCACAGGCG TAAT CACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 10 80 

ACTTGTATCA TAACAGTAAC AGATT CAAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 1200 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

35 GAAAATGGAC ATTT CAAAAT C AG CACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 1380 

GAAGCG CCAT TTGCTAGAGA TAT TCCCAGA GTGACAGCCT TGAACAGAGC CTTGGTTACA 1440 

GTTCATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

ATTAAAGAAA ACTTAGCAGT GGGGTCAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

40 AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG AT CCTAAAGG TTGGATCACC 1620 

ATTGATGAAA TTTCAGGGTC AATCATAACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGTTGTATAA TATTACAGTC CTGGCAATAG ACAAAG AT GA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

45 GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 1920 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2 040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

50 ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 2220 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TTATATCAAA CACAGAAGCA 2280 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 2400 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAAT CCTGCC GGGGGGCTGG GCATCATCAT 2460 

55 ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAGAAT CCATTAGAGG ACACACTGGT 2580 

TAAAAATTAA ACATAAAAGA AATTGCATCG ATGTAAT CAG AATGAAGACC GCATGCCATC 2640 

CCAAGATTAT GTCCTCACTT ATAACTATGA GGGAAGAGGA TCTCCAGCTG GTTCTGTGGG 2700 

CTGCTGCAGT GAAAAGCAGG AAGAAGATGG CCTTGACTTT TTAAATAATT TGGAACCCAA 2760 

60 ATTTATTACA TTAGCAGAAG CATG C AC AAA GAGATAATGT CACAGTG CTA CAATTAGGTC 2 82 0 

TTTGTCAGAC ATTCTGGAGG TTTCCAAAAA TAATATTGTA AAGTTCAATT TCAACATGTA 2880 

TGTATATGAT GATTTTTTTC TCAATTTTGA ATTATGCTAC TCACCAATTT ATATTTTTAA 2 940 

AGCCAGTTGT TGCTTATCTT TTCCAAAAAG TGAAAAATGT TAAAACAGAC AACTGGTAAA 3 0 00 

TCTCAAACTC CAGCACTGGA ATTAAGGTCT CTAAAGCATC TGCTCTTTTT TTTTTTTACG 3 0 60 

65 GATATTTTAG TAATAAATAT GCTGGATAAA TATTAGTCCA ACAATAGCTA AGTTATGCTA 3120 

ATATCACATT ATTATGTATT CACTTTAAGT GATAGTTTAA AAAATAAACA AGAAATATTG 3180 

AGT AT CACT A TGTGAAGAAA GTTTTGGAAA AGAAACAATG AAGACTGAAT TAAATTAAAA 3240 

ATGTTGCAGC TCATAAAGAA TTGGGACTCA CCCCTACTGC ACTACCAAAT TCATTTGACT 3300 

TTGGAGGCAA AATGTGTTGA AGTGCCCTAT GAAGTAGCAA TTTTCTATAG GAATATAGTT 3360 

70 GGAAATAAAT GTGTGTGTGT ATATTATTAT TAAT CAATGC AATATTTAAA ATGAAATGAG 3420 

AACAAAGAGG AAAATGGTAA AAACTTGAAA TGAGGCTGGG GTATAGTTTG TCCTACAATA 3 480 

GAAAAAAGAG AGAGCTTCCT AGGCCTGGGC TCTTAAATGC TGCATTATAA CTGAGTCTAT 3 540 

GAGGAAATAG TTCCTGTCCA ATTTGTGTAA TTTGTTTAAA ATTGTAAATA AATTAAACTT 3600 

TTCTGGTTTC TGTGGGAAGG AAATAGGGAA TCCAATGGAA CAGTAGCTTT GCTTTGCAGT 3660 

75 CTGTTTCAAG ATTTCTGCAT CCACAAGTTA GTAGCAAACT GGGGAATACT CGCTGCAGCT 3720 

GGGGTTCCCT GCTTTTTGGT AGCAAGGGTC CAGAGATGAG GTGTTTTTTT CGGGGAGCTA 3 780 

ATAACAAAAA CATTTTAAAA CTTACCTTTA CTGAAGTTAA ATCCTCTATT GCTGTTTCTA 3 840 

TTCTCTCTTA TAGTGACCAA CATCTTTTTA ATTTAGATCC AAATAACCAT GTCCTCCTAG 3900 

AGTTTAGAGG CTAGAGGGAG CTGAGGGGAG GAT CTTACTG AAAGCACCCT GGGGAGATTG 3960 

80 ATTGTCCTTA AACCTAAGCC CCACAAACTT GACACCTGAT CAGGTCTGGG AGCTACAAAA 4020 

TTTCATTTTT CTCCTCACTG CCCTTCTTCT GAGTGGCATT GGCCTGAATC AAGGAAAGCC 4080 

AGGCCTTGTG GGCCCCCTTC TTTCGGCTTT CTGCTAAAGC AACACCTCCA GCAGAGATTC 4140 

CCTTAAGTGA CTCCAGGTTT TCCACCATCC TTCAGCGTGA ATTAATTTTT AATCAGTTTG 4200 

CTTTCTCCAG AGAAATTTTA AAATAATAGA AGAAATAGAA ATTTTGAATG TATAAAAGAA 4260 

85 AAAGATCAAG TTGTCATTTT AGAACAGAGG GAACTTTGGG AGAAAGCAGC CCAAGTAGGT 4320 

TATTTGTACA GTCAGAGGGC AACAGGAAGA TGCAGGCCTT CAAGGGCAAG GAGAGGCCAC 4380 

AAGGAATATG GGTGGGAGTA AAAGCAACAT CGTCTGCTTC ATACTTTTTC CTAGGCTTGG 4440 
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CACTGCCTTT TCCTTTCTCA GGCCAATGGC AACTGCCATT TGAGTCCGGT GAGGGATCAG 4500 

CCAACCTCTT CTCTATGGCT CACCTTATTT GGAGTGAGAA AT CAAGGAGA CAGAGCTGAC 4560 

TGCATGATGA GTCTGAAGGC ATTTGCAGGA TGAGCCTGAA CTGGTTGTGC AGAACAAACA 462 0 

AGGCATTCAT GGGAATTGTT GTATTCCTTC TGCAGCCCTC CTTCTGGGCA CTAAGAAGGT 4680 

CTATGAATTA AATGCCTATC TAAAATTCTG ATTTATTCCT ACATTTTCTG TTTTCTAATT 4740 

TGACCCTAAA ATCTATGTGT TTTAGACTTA GACTTTTTAT TGCCCCCCCC CCCTTTTTTT 480 0 

TTGAGACGGA GTCTCGCTCT GACGCACAGG CTGGAGTGCA GTGGCTCCGA TCTCTGCTCA 4860 

CTGAAAGCTC CGCCTCCCGG GTTCATGCCA TTCTCCTGCC TCAGCCTCCT GAGTAGCTGG 492 0 

GACTAGAGGC GCCCACCACC ACGCCCGGCT AATTTTTTGT ATTTTTAATA GAGACGGGGT 4980 

TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTC GTGATCCGCC TGCCTCGGCC 5040 

TCCCAAAGTG CTGGGATTAC AGGCATGACC CACCGCTCCC GGCCTTGTTT TCCGTTTAAA 5100 

GTCGTCTTCT TTTAATGTAA TCATTTTGAA CATGTGTGAA AGTTGAT CAT ACGAATTGGA 5160 

TCAATCTTGA AATACTCAAC CAAAAGACAG TCGAGAAGCC AGGGGGAGAA AGAACTCAGG 522 0 

GCACAAAATA TTGGTCTGAG AATGGAATTC TCTGTAAGCC TAGTTGCTGA AATTTCCTGC 5280 

TGTAACCAGA AGCCAGTTTT ATCTAACGGC TACTGAAACA CCCACTGTGT TTTGCTCACT 5340 

CCCACTCACC GAT CAAAACC TGCTACCTCC CCAAGACTTT ACTAGTGCCG ATAAACTTTC 5400 

TCAAAGAGCA ACCAGTATCA CTTCCCTGTT TATAAAACCT CTAACCATCT CTTTGTTCTT 5460 

TGAACATGCT GAAAACCACC TGGTCTGCAT GTATGCCCGA ATTTGTAATT CTTTTCTCTC 552 0 

AAATGAAAAT TTAATTTTAG GGATTCATTT CTATATTTTC ACATATGTAG TATTATTATT 5580 

TCCTTATATG TGTAAGGTGA AATTTATGGT ATTTGAGTGT GCAAGAAAAT ATATTTTTAA 5640 

AGCTTTCATT TTTCCCCCAG TGAATGATTT AGAATTTTTT ATGTAAATAT ACAGAATGTT 5700 

TTTTCTTACT TTTATAAGGA AGCAGCTGTC TAAAATGCAG TGGGGTTTGT TTTGCAATGT 5 760 

TTTAAACAGA GTTTTAGTAT TGCTATTAAA AGAAGTTACT TTGCTTTTAA AGAAACTTGG 582 0 

CTGCTTAAAA TAAGCAAAAA TTGGATGCAT AAAGTAATAT TTACAGATGT GGGGAGATGT 5880 

AATAAAACAA TATTAACTTG GCTGCTTAAA ATAAGCAAAA ATTGGATGCA TAAAGTAATA 5 940 

TTTACAGATG TGGGGAGATG TAATAAAACA ATATTAACTT GGTTTCTTGT TTTTGCTGTA 600 0 

TTTAGAGATT . AAATAATTCT AAGATGATCA CTTTGCAAAA TTATGCTTAT GGCTGGCATG 6060 

GAAATAGAAA TACTCAATTA TGTCTTTGTT GTATTAATGG GGAATATTTT GGACAATGTT 612 0 

TCATTAT CAA ATTGTCGACA TCATTAATAT ATATTGTAAT GTTGGGAAGA GATCACTATT 6180 

TTGAAGCACA GCTTTACAGA TGAGTATCTA TGATACATAT GTATAATAAA TTTTGATCGG 6240 

GTATTAAAAG TATTAGAAGG TGGTTATAAT TGCAGAGTAT TCCATGAATA GTACACTGAC 63 00 

ACAGGGGTTT TACTTTGAGG ACCAGTGTAG TCAAGGGAAA ACATGAGTTA AAAAGAAAAG 63 60 

CAGGCAATAT TGCAGTCTTG ATTCTGCCAC TTACAGGATA GATAATGCCT GAACTTTAAT 642 0 

GACAAGATGA TCCAACCATA AAGGTGCTCT GTGCTTCACA GTGAATCTTT TCCCCATGCA 6480 

GGAGTGTGCT CCCCTACAAA CGTTAAGACT GATCATTTCA AAAATCTATT AGCTATATCA 6540 

AAAGCCTTAC ATTTTAATAT AGGTTGAACC AAAATTT CAA TTCCAGTAAC TTCTATTGTA 6600 

ACCATTATTT TTGTGTATGT CTTCAAGAAT GTTCATTGGA TTTTTGTTTG TAATAGTAAA 6660 

ATACCGGATA CATTT CACGT GTCCTTCAGT ATTGATTTGG TTGAATATTG GGTCATAATG 672 0 

GTTGAGAAGC ATGGACACTA GAGCCAGAAT GCTTGGATAT GAATCCTGGA TCTGTCACTT 6780 

ACTT CTGTGT GACCTTTGAA AGGCTACTTA TTTCCTCTCT TAGCTTTCTC ATTAAAATCA 6840 

ATGAACAATG CCAGCCTCAT GGGGTTGTTG AATGATTAAA TTAGTTAATA TACCTAAAGT 6900 

ACATAGAACA CTGCCTGCAC ATAGTAAAAG AATTATAAGT GTGAGGTAGT TGGTAAAATT 6 960 

ATGTAGTTGG ATATACTACC GAACAATATC TAATCTCTTT TTAGGGAAAT AAAGTTTGTG 7020 
CATATATATA ATCCCGAAAC ATG 

Seq ID NO: 527 Protein sequence 
Protein Accession #; NP_077741.1 

1 11 21 31 41 51 

I I I 1 i 1 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 60 

ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 120 

KTRHTRETVL RRAKRRWAP I PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 180 

EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 240 

PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 3 00 

TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPT FRQNAY 3 60 

EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 420 

KPLNYEENRQ VNLEIGVNNE APFARDIPRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 480 

KENLAVGSKI NGYKAYDPEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 540 

KNELYN I T VL» AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTD I LAVDPD 600 

EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 660 

TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 720 

KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 780 
MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EESIRGHTG 

Seq ID NO: 528 DNA sequence 

Nucleic Acid Accession #: NM_001941.2 

Coding sequence: 64.-2754 

1 11 21 31 41 51 

1 I I I I I 

GGCAGGTCTC GCTCTCGGCA CCCTCCCGGC GCCCGCGTTC TCCTGGCCCT GCCCGGCATC 60 

CCGATGGCCG CCGCTGGGCC CCGGCGCTCC GTGCGCGGAG CCGTCTGCCT GCATCTGCTG 120 

CTGACCCTCG TGATCTTCAG TCGTGATGGT GAAGCCTGCA AAAAGGTGAT ACTTAATGTA 180 

CCTTCTAAAC TAGAGGCAGA CAAAATAATT GGCAGAGTTA ATTTGGAAGA GTGCTTCAGG 240 

TCTGCAGACC TCATCCGGTC AAGTGATCCT GATTTCAGAG TTCTAAATGA TGGGTCAGTG 3 00 

TACACAGCCA GGGCTGTTGC GCTGTCTGAT AAGAAAAGAT CATTTACCAT ATGGCTTTCT 3 60 

GACAAAAGGA AACAGACACA GAAAGAGGTT ACTGTGCTGC TAGAACATCA GAAGAAGGTA 420 

TCGAAGACAA GACACACTAG AGAAACTGTT CTCAGGCGTG CCAAGAGGAG ATGGGCACCT 480 

ATTCCTTGCT CTATGCAAGA GAATT CCTTG GGCCCTTTCC CATTGTTTCT TCAACAAGT T 540 

GAATCTGATG CAGCACAGAA CTATACTGTC TTCTACTCAA TAAGTGGACG TGGAGTTGAT 600 

AAAGAACCTT TAAATTTGTT TTATATAGAA AGAGACACTG GAAATCTATT TTGCACTCGG 660 

CCTGTGGATC GTGAAGAATA TGATGTTTTT GATTTGATTG CTTATGCGTC AACTGCAGAT 720 

GGATATTCAG CAGATCTGCC CCTCCCACTA CCCATCAGGG TAGAGGATGA AAATGACAAC 780 

CACCCTGTTT TCACAGAAGC AATTTATAAT TTTGAAGTTT TGGAAAGTAG TAGACCTGGT 840 

ACTACAGTGG GGGTGGTTTG TGCCACAGAC AGAGATGAAC CGGACACAAT GCATACGCGC 900 

CTGAAATACA GCATTTTGCA GCAGACACCA AGGTCACCTG GGCTCTTTTC TGTGCATCCC 960 
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AGCACAGGCG TAATCACCAC AGTCTCTCAT TATTTGGACA GAGAGGTTGT AGACAAGTAC 1020 

TCATTGATAA TGAAAGTACA AGACATGGAT GGCCAGTTTT TTGGATTGAT AGGCACATCA 10 80 

ACTTGTATCA TAACAGTAAC AG AT T C AAAT GATAATGCAC CCACTTTCAG ACAAAATGCT 1140 

TATGAAGCAT TTGTAGAGGA AAATGCATTC AATGTGGAAA TCTTACGAAT ACCTATAGAA 12 00 

GATAAGGATT TAATTAACAC TGCCAATTGG AGAGTCAATT TTACCATTTT AAAGGGAAAT 1260 

GAAAATGGAC ATTT CAAAAT CAGCACAGAC AAAGAAACTA ATGAAGGTGT TCTTTCTGTT 1320 

GTAAAGCCAC TGAATTATGA AGAAAACCGT CAAGTGAACC TGGAAATTGG AGTAAACAAT 13 80 

GAAGCGCCAT TTGCTAGAGA TATTCCCAGA GTGACAGCCT TGAACAG AG C CTTGGTTACA 1440 

GTT CATGTGA GGGATCTGGA TGAGGGGCCT GAATGCACTC CTGCAGCCCA ATATGTGCGG 1500 

ATTAAAGAAA ACTTAGCAGT GGGGT CAAAG ATCAACGGCT ATAAGGCATA TGACCCCGAA 1560 

AATAGAAATG GCAATGGTTT AAGGTACAAA AAATTGCATG ATCCTAAAGG TTGGAT CACC 1620 

ATTGATGAAA TTTCAGGGTC AAT CAT AACT TCCAAAATCC TGGATAGGGA GGTTGAAACT 1680 

CCCAAAAATG AGT TGTATAA TATTACAGTC CTGGCAATAG ACAAAGATGA TAGATCATGT 1740 

ACTGGAACAC TTGCTGTGAA CATTGAAGAT GTAAATGATA ATCCACCAGA AATACTTCAA 1800 

GAATATGTAG TCATTTGCAA ACCAAAAATG GGGTATACCG ACATTTTAGC TGTTGATCCT 1860 

GATGAACCTG TCCATGGAGC TCCATTTTAT TTCAGTTTGC CCAATACTTC TCCAGAAATC 192 0 

AGTAGACTGT GGAGCCTCAC CAAAGTTAAT GATACAGCTG CCCGTCTTTC ATATCAGAAA 1980 

AATGCTGGAT TTCAAGAATA TACCATTCCT ATTACTGTAA AAGACAGGGC CGGCCAAGCT 2 040 

GCAACAAAAT TATTGAGAGT TAATCTGTGT GAATGTACTC ATCCAACTCA GTGTCGTGCG 2100 

ACTTCAAGGA GTACAGGAGT AATACTTGGA AAATGGGCAA TCCTTGCAAT ATTACTGGGT 2160 

ATAGCACTGC TCTTTTCTGT ATTGCTAACT TTAGTATGTG GAGTTTTTGG TGCAACTAAA 222 0 

GGGAAACGTT TTCCTGAAGA TTTAGCACAG CAAAACTTAA TT ATAT C AAA CACAGAAGCA 22 80 

CCTGGAGACG ATAGAGTGTG CTCTGCCAAT GGATTTATGA CCCAAACTAC CAACAACTCT 2 340 

AGCCAAGGTT TTTGTGGTAC TATGGGATCA GGAATGAAAA ATGGAGGGCA GGAAACCATT 240 0 

GAAATGATGA AAGGAGGAAA CCAGACCTTG GAATCCTGCC GGGGGGCTGG GCATCATCAT 2460 

ACCCTGGACT CCTGCAGGGG AGGACACACG GAGGTGGACA ACTGCAGATA CACTTACTCG 2520 

GAGTGGCACA GTTTTACTCA ACCCCGTCTC GGTGAAAAAT TGCATCGATG T AAT C AG AAT 25 80 

GAAGACCGCA TGCCATCCCA AGATTATGTC CTCACTTATA ACTATGAGGG AAGAGGATCT 2 640 

CCAGCTGGTT CTGTGGGCTG CTGCAGTGAA AAGCAGGAAG AAGATGGCCT TGACTTTTTA 2 700 

AATAATTTGG AACCCAAATT TATTACATTA GCAGAAGCAT GCACAAAGAG ATAATGTCAC 2760 

AGTGCTACAA TTAGGTCTTT GTCAGACATT CTGGAGGTTT CCAAAAATAA TATTGTAAAG 2 82 0 

TTCAATTTCA ACATGTATGT ATATGATGAT TTTTTTCTCA ATTTTGAATT ATGCTACTCA 2 880 

CCAATTTATA TTTTTAAAGC CAGTTGTTGC TTATCTTTTC CAAAAAGTGA AAAATGTTAA 2 940 

AACAGACAAC TGGTAAATCT CAAACTCCAG CACTGGAATT AAGGTCTCTA AAGCATCTGC 3 00 0 

TCTTTTTTTT TTTTACGGAT ATTTTAGTAA TAAATATGCT GGATAAATAT TAGTCCAACA 3 0 60 

ATAGCTAAGT TATGCTAATA TCACATTATT ATGTATTCAC TTTAAGTGAT AGTTTAAAAA 3120 

ATAAACAAGA AATATTGAGT ATCACTATGT GAAGAAAGTT TTGGAAAAGA AACAATGAAG 3180 

ACTGAATTAA ATTAAAAATG TTGCAGCTCA TAAAGAATTG GGACTCACCC CTACTGCACT 3240 

ACCAAATTCA TTTGACTTTG GAGGCAAAAT GTGTTGAAGT GCCCTATGAA GTAGCAATTT 3 3 00 

TCTATAGGAA TATAGTTGGA AATAAATGTG TGTGTGTATA TTATTATTAA TCAATGCAAT 33 60 

ATTTAAAATG AAATGAGAAC AAAGAGGAAA ATGGTAAAAA CTTGAAATGA GGCTGGGGTA 3420 

TAGTTTGTCC TACAATAGAA AAAAGAGAGA GCTTCCTAGG CCTGGGCTCT TAAATGCTGC 3480 

ATTATAACTG AGTCTATGAG GAAATAGTTC CTGTCCAATT TGTGTAATTT GTTTAAAATT 3 540 

GTAAATAAAT TAAACTTTTC TGGTTTCTGT GGGAAGGAAA TAGGGAATCC AATGGAACAG 3 60 0 

TAGCTTTGCT TTGCAGTCTG T TT CAAGATT TCTGCATCCA CAAGTTAGTA GCAAACTGGG 3 660 

GAATACTCGC TGCAGCTGGG GTTCCCTGCT TTTTGGTAGC AAGGGTCCAG AGATGAGGTG 3 720 

TTTTTTTCGG GGAGCTAATA ACAAAAACAT TTTAAAACTT ACCTTTACTG AAGTTAAATC 3 780 

CTCTATTGCT GTTTCTATTC TCTCTTATAG TGACCAACAT CTTTTTAATT TAGATCCAAA 3 840 

TAACCATGTC CTCCTAGAGT TTAGAGGCTA GAGGGAGCTG AGGGGAGGAT CTTACTGAAA 3 900 

GCACCCTGGG GAGATTGATT GTCCTTAAAC CTAAGCCCCA CAAACTTGAC ACCTGAT CAG 3 960 

GTCTGGGAGC TACAAAATTT CATTTTTCTC CTCACTGCCC TTCTTCTGAG TGGCATTGGC 4020 

CTGAATCAAG GAAAGCCAGG CCTTGTGGGC CCCCTTCTTT CGGCTTTCTG CTAAAGCAAC 4080 

ACCTCCAGCA GAGATTCCCT TAAGTGACTC CAGGTTTTCC ACCATCCTTC AGCGTGAATT 4140 

AATTTTTAAT CAGTTTGCTT TCTCCAGAGA AATTTTAAAA TAATAGAAGA AATAGAAATT 42 00 

TTGAATGTAT AAAAGAAAAA GAT CAAGTTG TCATTTTAGA ACAGAGGGAA CTTTGGGAGA 42 60 

AAGCAGCCCA AGTAGGTTAT TTGTACAGTC AGAGGGCAAC AGGAAGATGC AGGCCTTCAA 4320 

GGGCAAGGAG AGGCCACAAG GAATATGGGT GGGAGTAAAA GCAACATCGT CTGCTTCATA 43 80 

CTTTTTCCTA GGCTTGGCAC TGCCTTTTCC TTTCTCAGGC CAATGGCAAC TGCCATTTGA 4440 

GTCCGGTGAG GGATCAGCCA ACCTCTTCTC TATGGCTCAC CTTATTTGGA GTGAGAAATC 4500 

AAGGAGACAG AGCTGACTGC ATGATGAGTC TGAAGGCATT TGCAGGATGA GCCTGAACTG 4560 

GTTGTGCAGA ACAAACAAGG CATT CATGGG AATTGTTGTA TTCCTTCTGC AGCCCTCCTT 462 0 

CTGGGCACTA AGAAGGTCTA TGAATTAAAT GCCTATCTAA AATTCTGATT TATTCCTACA 4680 

TTTTCTGTTT T CTAATTTG A CCCTAAAATC TATGTGTTTT AGACTTAGAC TTTTTATTGC 4740 

CCCCCCCCCC TTTTTTTTTG AGACGGAGTC TCGCTCTGAC GCACAGGCTG GAGTGCAGTG 480 0 

GCTCCGATCT CTGCTCACTG AAAGCTCCGC CTCCCGGGTT CATGCCATTC TCCTGCCTCA 4860 

GCCTCCTGAG TAGCTGGGAC TACAGGCGCC CACCACCACG CCCGGCTAAT TTTTTGTATT 4 920 

TTTAATAGAG ACGGGGTTTC ACTGTGTTAG CCAGGATGGT CTCGATCTCC TGACCTCGTG 4980 

ATCCGCCTGC CTCGGCCTCC CAAAGTGCTG GGATTACAGG CATGACCCAC CGCTCCCGGC 5040 

CTTGTTTTCC GTTTAAAGTC GTCTTCTTTT AATGTAATCA TTTTGAACAT GTGTGAAAGT 5100 

TGATCATACG AATTGGATCA AT CTTG AAAT ACTCAACCAA AAGACAGTCG AGAAGCCAGG 5160 

GGGAGAAAGA ACTCAGGGCA CAAAATATTG GTCTGAGAAT GGAATTCTCT GTAAGCCTAG 522 0 

TTGCTGAAAT TTCCTGCTGT AACCAGAAGC CAGTTTTATC TAACGGCTAC TGAAACACCC 52 80 

ACTGTGTTTT GCTCACTCCC TCACTCACCG ATCAAAACCT GCTACCTCCC CAAGACTTTA 5340 

CTAGTGCCGA TAAACTTTCT CAAAGAGCAA CCAGTATCAC TTCCCTGTTT ATAAAACCTC 5400 

TAACCATCTC TTTGTTCTTT GAACATGCTG AAAACCACCT GGTCTGCATG TATGCCCGAA 5460 

TTTGTAATTC TTTTCTCTCA AATGAAAATT TAATTTTAGG GATTCATTTC TATATTTTCA 5520 

CATATGTAGT ATTATTATTT CCTTATATGT GTAAGGTGAA ATTTATGGTA TTTGAGTGTG 5580 

CAAGAAAATA TATTTTTAAA GCTTTCATTT TTCCCCCAGT GAATGATTTA GAATTTTTTA 5640 

TGTAAATATA CAGAATGTTT TTTCTTACTT TTATAAGGAA GCAGCTGTCT AAAATGCAGT 5700 

GGGGTTTGTT TTGCAATGTT TTAAACAGAG TTTTAGTATT GCTATTAAAA GAAGTTACTT 5760 

TGCTTTTAAA GAAACTTGGC TGCTTAAAAT AAGCAAAAAT TGGATGCATA AAGTAATATT 582 0 

TACAGATGTG GGGAGATGTA ATAAAACAAT ATTAACTTGG TTTCTTGTTT TTGCTGTATT 5880 

TAGAGATTAA ATAATTCTAA GATGATCACT TTGCAAAATT ATGCTTATGG CTGGCATGGA 5940 

AATAGAAATA CTCAATTATG TCTTTGTTGT ATTAATGGGG AATATTTTGG ACAATGTTTC 6000 

ATTATCAAAT TGTCGACATC ATTAATATAT ATTGTAATGT TGGGAAGAGA TCACTATTTT 60 60 

GAAGCACAGC TTTACAGATG AGTATCTATG ATACATATGT ATAATAAATT TTGATCGGGT 6120 

ATTAAAAGTA TTAGAAGGTG GTTATAATTG CAGAGTATTC CATGAATAGT ACACTGACAC 6180 
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AGGGGTTTTA CTTTGAGGAC CAGTGTAGTC AAGGGAAAAC ATGAGTTAAA AAGAAAAGCA 
GGCAATATTG CAGTCTTGAT TCTGCCACTT ACAGGATAGA TAATGCCTGA ACTTTAATGA 
CAAGATGATC CAACCATAAA GGTGCTCTGT GCTTCACAGT GAATCTTTTC CCCATGCAGG 
AGTGTGCTCC CCTACAAACG TTAAGACTGA TCATTTCAAA AATCTATTAG CTATATCAAA 
AGCCTTACAT TTTAATATAG GTTGAACCAA AATTTCAATT CCAGTAACTT CTATTGTAAC 
CATTATTTTT GTGTATGTCT TCAAGAATGT TCATTGGATT TTTGTTTGTA ATAGTAAAAT 
ACCGGATACA TTT CACGTGT CCTT CAGT AT TGATTTGGTT GAATATTGGG TCATAATGGT 
TGAGAAGCAT GGACACTAGA GCCAGAATGC TTGGATATGA ATCCTGGATC TGT CACTTAC 
TTCTGTGTGA CCTTTGAAAG GCTACTTATT TCCTCTCTTA GCTTTCTCAT TAAAATCAAT 
GAACAATGCC AGCCTCATGG GGTTGTTGAA TGATTAAATT AGTTAATATA CCTAAAGTAC 
ATAGAACACT GCCTGCACAT AGTAAAAGAA TTATAAGTGT GAGGTAGTTG GTAAAATTAT 
GTAGTTGGAT ATACTACCGA ACAATATCTA ATCTCTTTTT AGGGAAATAA AGTTTGTGCA 
TATATATAAT CCCGAAACAT G 

Seq ID NO: 52 9 Protein sequence 
Protein Accession #: NP_J301932.1 

1 11 21 31 41 51 

I I I I I I 

MAAAGPRRSV RGAVCLHLLL TLVIFSRDGE ACKKVILNVP SKLEADKIIG RVNLEECFRS 
ADLIRSSDPD FRVLNDGSVY TARAVALSDK KRSFTIWLSD KRKQTQKEVT VLLEHQKKVS 
KTRHTRETVL RRAKRRW AP I PCSMQENSLG PFPLFLQQVE SDAAQNYTVF YSISGRGVDK 
EPLNLFYIER DTGNLFCTRP VDREEYDVFD LIAYASTADG YSADLPLPLP IRVEDENDNH 
PVFTEAIYNF EVLESSRPGT TVGWCATDR DEPDTMHTRL KYSILQQTPR SPGLFSVHPS 
TGVITTVSHY LDREWDKYS LIMKVQDMDG QFFGLIGTST CIITVTDSND NAPTFRQNAY 
EAFVEENAFN VEILRIPIED KDLINTANWR VNFTILKGNE NGHFKISTDK ETNEGVLSW 
KPLNYEENRQ VNLEIGVNNE APFARD I PRV TALNRALVTV HVRDLDEGPE CTPAAQYVRI 
KENLAVGSK I NGYKA YD PEN RNGNGLRYKK LHDPKGWITI DEISGSIITS KILDREVETP 
KNELYNITVL AIDKDDRSCT GTLAVNIEDV NDNPPEILQE YWICKPKMG YTDILAVDPD 
EPVHGAPFYF SLPNTSPEIS RLWSLTKVND TAARLSYQKN AGFQEYTIPI TVKDRAGQAA 
TKLLRVNLCE CTHPTQCRAT SRSTGVILGK WAILAILLGI ALLFSVLLTL VCGVFGATKG 
KRFPEDLAQQ NLIISNTEAP GDDRVCSANG FMTQTTNNSS QGFCGTMGSG MKNGGQETIE 
MMKGGNQTLE SCRGAGHHHT LDSCRGGHTE VDNCRYTYSE WHSFTQPRLG EKLHRCNQNE 
DRMPSQDYVL TYNYEGRGSP AGSVGCCSEK QEEDGLDFLN NL1EPKFITI1A EACTKR 



Seq ID NO: 53 0 DNA sequence 

Nucleic Acid Accession #: NM_016583.2 

Coding sequence : 72 . . 842 

1 11 21 31 41 51 

I I I 1 I 1 

GGAGTGGGGG AGAGAGAGGA GACCAGGACA GCTGCTGAGA CCTCTAAGAA GTCCAGATAC 
TAAGAGCAAA GATGTTTCAA ACTGGGGGCC TCATTGTCTT CTACGGGCTG TTAGCCCAGA 
CCATGGCCCA GTTTGGAGGC CTGCCCGTGC CCCTGGACCA GACCCTGCCC TTGAATGTGA 
ATCCAGCCCT GCCCTTGAGT CCCACAGGTC TTGCAGGAAG CTTGACAAAT GCCCTCAGCA 
ATGGCCTGCT GTCTGGGGGC CTGTTGGGCA TTCTGGAAAA CCTTCCGCTC CTGGACATCC 
TGAAGCCTGG AGGAGGTACT TCTGGTGGCC TCCTTGGGGG ACTGCTTGGA AAAGTGACGT 
CAGTGATTCC TGGCCTGAAC AACATCATTG ACATAAAGGT CACTGACCCC CAGCTGCTGG 
AACTTGGCCT TGTGCAGAGC CCTGATGGCC ACCGTCTCTA TGTCACCATC CCTCTCGGCA 
TAAAGCTCCA AGTGAATACG CCCCTGGTCG GTGCAAGTCT GTTGAGGCTG GCTGTGAAGG 
TGGACATCAC TGCAGAAATC TTAGCTGTGA GAGATAAGCA GGAGAGGATC CACCTGGTCC 
TTGGTGACTG CACCCATTCC CCTGGAAGCC TGCAAATTTC TCTGCTTGAT GGACTTGGCC 
CCCTCCCCAT TCAAGGTCTT CTGGACAGCC TCACAGGGAT CTTGAATAAA GTCCTGCCTG 
AGTTGGTTCA GGGCAACGTG TGCCCTCTGG TCAATGAGGT TCTCAGAGGC TTGGACATCA 
CCCTGGTGCA TGACATTGTT AACATGCTGA T CCACGG ACT ACAGTTTGTC ATCAAGGTCT 
AAGCCTTCCA GGAAGGGGCT GGCCTCTGCT GAGCTGCTTC CCAGTGCTCA CAGATGGCTG 
GCCCATGTGC TGGAAGATGA CACAGTTGCC TTCTCTCCGA GGAACCTGCC CCCTCTCCTT 
TCCCACCAGG CGTGTGTAAC ATCCCATGTG CCTCACCTAA TAAAATGGCT CTTCTTCTGC 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 531 Protein sequence 
Protein Accession #: NP_057667.1 

1 11 21 31 41 51 

I I I I I I 

MFQTGGLIVF YGLLAQTMAQ FGGLPVPIiDQ TLPLNVNPALj PLSPTGLAGS LTNAL SNGLL 
SGGLLGILEN LPL>LDIL>KPG GGTSGGLLGG LLGKVTSVIP GLNNIIDIKV TDPQLLELGL 
VQSPDGHRLY VTIPLGIKLQ VNTPLVGASL LRLAVKLDIT AEILAVRDKQ ERIHLVLGDC 
THSPGSLQIS LLDGLGPLPI QGIiLDSLTGI LNKVLPELVQ GNVCPLVNEV LRGLDITLVH 
DIVNMbIHGL QFVIKV 



Seq ID NO: 532 DNA sequence 

Nucleic Acid Accession #: NM_0043 63.1 

Coding sequence: 115.. 2223 

1 11 21 31 41 51 

! I I 1 1 1 

CTCAGGGCAG AGGGAGGAAG GACAG CAGAC CAGACAGTCA CAGCAGCCTT GACAAAACGT 
TCCTGGAACT CAAGCTCTTC TCCACAGAGG AGGACAGAGC AGACAGCAGA GACCATGGAG 
TCTCCCTCGG CCCCTCCCCA CAGATGGTGC ATCCCCTGGC AGAGGCTCCT GCTCACAGCC 
TCACTTCTAA CCTTCTGGAA CCCGCCCACC ACTGCCAAGC TCACTATTGA ATCCACGCCG 
TTCAATGTCG CAGAGGGGAA GGAGGTGCTT CTACTTGTCC ACAATCTGCC CCAGCATCTT 
TTTGGCTACA GCTGGTACAA AGGTGAAAGA GTGGATGGCA ACCGTCAAAT TATAGGATAT 
GTAATAGGAA CTCAACAAGC TACCCCAGGG CCCGCATACA GTGGTCGAGA GATAATATAC 
CCCAATGCAT CCCTGCTGAT CCAGAACATC ATCCAGAATG ACACAGGATT CTACACCCTA 
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CACGTCATAA AGTCAGATCT TGTGAATGAA GAAGCAACTG GCCAGTTCCG GGTATACCCG 540 

GAGCTGCCCA AGCCCTCCAT CTCCAGCAAC AACTCCAAAC CCGTGGAGGA CAAGGATGCT 600 

GTGGCCTTCA CCTGTGAACC TGAGACTCAG GACGCAACCT ACCTGTGGTG GGTAAACAAT 660 

CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG GCAACAGGAC CCTCACTCTA 720 

5 TTCAATGTCA CAAGAAATGA CACAGCAAGC TACAAATGTG AAACCCAGAA CCCAGTGAGT 780 

GCCAGGCGCA GTGATTCAGT CATCCTGAAT GTCCTCTATG GCCCGGATGC CCCCACCATT 840 

TCCCCTCTAA ACACATCTTA CAGATCAGGG GAAAATCTGA ACCTCTCCTG CCACGCAGCC 90 0 

TCTAACCCAC CTGCACAGTA CTCTTGGTTT GTCAATGGGA CTTTCCAGCA ATCCACCCAA 960 

GAGCTCTTTA TCCCCAACAT CACTGTGAAT AATAGTGGAT CCTATACGTG CCAAGCCCAT 1020 

10 AACTCAGACA CTGGCCTCAA TAGGACCACA GTCACGACGA TCACAGTCTA TGCAGAGCCA 1080 

CCCAAACCCT TCATCACCAG CAACAACTCC AACCCCGTGG AGGATGAGGA TGCTGTAGCC 1140 

TTAACCTGTG AACCTGAGAT TCAGAACACA ACCTACCTGT GGTGGGTAAA TAATCAGAGC 12 00 

CTCCCGGTCA GTCCCAGGCT GCAGCTGTCC AATGACAACA GGACCCTCAC TCTACTCAGT 12 60 

GTCACAAGGA ATGATGTAGG ACCCTATGAG TGTGGAATCC AGAACGAATT AAGTGTTGAC 1320 

15 CACAGCGACC CAGTCATCCT GAATGTCCTC TATGGCCCAG ACGACCCCAC CATTTCCCCC 1380 

TCATACACCT ATTACCGTCC AGGGGTGAAC CTCAGCCTCT CCTGCCATGC AGCCTCTAAC 1440 

CCACCTGCAC AGTATTCTTG GCTGATTGAT GGGAACATCC AGCAACACAC ACAAGAGCTC 150 0 

TTTATCTCCA ACATCACTGA GAAGAACAGC GGACTCTATA CCTGCCAGGC CAATAACTCA 1560 

GCCAGTGGCC ACAGCAGGAC TACAGTCAAG ACAATCACAG TCTCTGCGGA GCTGCCCAAG 1620 

20 CCCTCCATCT CCAGCAACAA CTCCAAACCC GTGGAGGACA AGGATGCTGT GGCCTTCACC 1680 

TGTGAACCTG AGGCTCAGAA CACAACCTAC CTGTGGTGGG TAAATGGTCA GAGCCTCCCA 1740 

GTCAGTCCCA GGCTGCAGCT GTCCAATGGC AACAGGACCC TCACTCTATT CAATGTCACA 1800 

AGAAATGACG CAAGAGCCTA TGTATGTGGA ATCCAGAACT CAGTGAGTGC AAACCGCAGT 1860 

GACCCAGTCA CCCTGGATGT CCTCTATGGG CCGGACACCC CCATCATTTC CCCCCCAGAC 192 0 

25 TCGTCTTACC TTTCGGGAGC GAACCTCAAC CTCTCCTGCC ACTCGGCCTC TAACCCATCC 1980 

CCGCAGTATT CTTGGCGTAT CAATGGGATA CCGCAGCAAC ACACACAAGT TCTCTTTATC 2040 

GCCAAAATCA CGCCAAATAA TAACGGGACC TATGCCTGTT TTGTCTCTAA CTTGGCTACT 2100 

GGCCGCAATA ATTCCATAGT CAAGAGCATC ACAGTCTCTG CATCTGGAAC TTCTCCTGGT 2160 

CTCTCAGCTG GGGCCACTGT CGGCAT CATG ATTGGAGTGC TGGTTGGGGT TGCTCTGATA 2220 

30 TAGCAGCCCT GGTGTAGTTT CTTCATTTCA GGAAGACTGA CAGTTGTTTT GCTTCTTCCT 2280 

TAAAGCATTT GCAACAGCTA CAGT CTAAAA TTGCTTCTTT ACCAAGGATA TTTACAGAAA 2340 

AGACTCTGAC CAGAG AT CGA GACCATCCTA GCCAACATCG TGAAACCCCA TCTCTACTAA 240 0 

AAATACAAAA ATGAGCTGGG CTTGGTGGCG CGCACCTGTA GTCCCAGTTA CTCGGGAGGC 2460 

TGAGGCAGGA GAATCGCTTG AACCCGGGAG GTGGAGATTG CAGTGAGCCC AGATCGCACC 252 0 

35 ACTGCACTCC AGTCTGGCAA CAGAGCAAGA CTCCATCTCA AAAAGAAAAG AAAAGAAGAC 2 580 

TCTGACCTGT ACTCTTGAAT ACAAGTTTCT GATACCACTG CACTGTCTGA GAATTTCCAA 2640 

AACTTTAATG AACTAACTGA CAGCTTCATG AAACTGT CCA CCAAGATCAA GCAGAGAAAA 2700 

TAATTAATTT CATGGGACTA AATGAACTAA TGAGGATTGC TGATTCTTTA AATGTCTTGT 2760 

TTCCCAGATT TCAGGAAACT TTTTTTCTTT TAAGCTATCC ACTCTTACAG CAATTTGATA 2820 

40 AAATATACTT TTGTGAACAA AAATTGAGAC ATTTACATTT TCTCCCTATG TGGTCGCTCC 2 880 

AGACTTGGGA AACTATTCAT GAATATTTAT ATTGTATGGT AATATAGTTA TTGCACAAGT 2940 
TCAATAAAAA TCTGCTCTTT GTATAACAGA AAAA 



PCT/US02/12476 



45 



Seq ID NO: 533 Protein sequence 
Protein Accession #: NP_004354.1 



50 



55 



60 



1 

I 

MESPSAPPHR 
HLFGYSWYKG 
TLHVIKSDLV 
NNQSLPVSPR 
TISPLNTSYR 
AHNSDTGLNR 
QSLPVSPRLQ 
SPSYTYYRPG 
NSASGHSRTT 
LPVSPRLQLS 
PDSSYLSGAN 
ATGRNNSIVK 



11 
I 

WCIPWQRLLL 
ERVDGNRQII 
NEEATGQFRV 
LQLSNGNRTL 
SGENLNLSCH 
TTVTTITVYA 
LSNDNRTLTL 
VNLSLSCHAA 
VKTITVSAEL 
NGNRTLtTL FN 
LNLSCHSASN 
SITVSASGTS 



21 
I 

TASLLTFWNP 
GYVIGTQQAT 
YPELPKPSIS 
TLFNVTRNDT 
AASNPPAQYS 
EPPKPFITSN 
LSVTRNDVGP 
SNPPAQYSWL 
PKPSISSNNS 
VTRNDARAYV 
PSPQYSWRIN 
FGI» S AGATVG 



31 

I 

PTTAKLTIES 
PGPAYSGREI 
SNNSKPVEDK 
ASYKCETQNP 
WFVNGTFQQS 
NSNPVEDEDA 
YECGIQNELS 
IDGNIQQHTQ 
KPVEDKDAVA 
CGIQNSVSAN 
GIPQQHTQVL 
IMIGVLVGVA 



41 

1 

TPFNVAEGKE 
IYPNASLLIQ 
DAVAFTCEPE 
VSARRSDSVI 
TQELFIPNIT 
VALTCEPEIQ 
VDHSDPVILN 
ELFISNITEK 
FTCEPEAQNT 
RSDPVTLDVL 
FIAKITPNNN 
LI 



51 

VLLLVHNLPQ 
NIIQNDTGFY 
TQDATYLWWV 
LNVLYGPDAP 
VNNSGSYTCQ 
NTTYLWWVNN 
VLYGPDDPTI 
NSGLYTCQAN 
TYLWWVNGQS 
YGPDTPIISP 
GTYACFVSNL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



65 



70 



75 



80 



Seq ID NO: 534 DNA sequence 

Nucleic Acid Accession #: NM_006952.1 

Coding sequence: 11.. 7 93 



AAT CCCGACA 
TGGAAATGTG 
ATCTGACCAA 
GGCTGCCTGG 
TGTAGGCATC 
AGTATATGCC 
ACCCAACCTC 
TGATGACCAG 
CAATTGCTGT 
TGAGAATAAT 
AGAACCTCTC 
CTGCTATGAA 
ATTTGCCATT 
AATTGAATAT 



11 

! 

ATGGCGAAAG 
ATTATTGGTT 
CACAGCCTCT 
ATCGGCATAT 
ATGAAGTCCA 
TTTGAAGTGG 
TTCCTGAAGC 
TGGAAAAACA 
GGCGTAAATG 
GATGCTGACT 
AACCTGGAGG 
CTGATCTCTG 
CTCTGCTGGA 
TAAGAA 



21 
I 

ACAACTCAAC 
GTTGCGGCAT 
ACCCACTGCT 
TTGTGGGCAT 
GCAGGAAAAT 
CATCTTGTAT 
AGATGCTAGA 
ATGGAGTCAC 
GTCCATCAGA 
ATCCCTGGCC 
CTTGTAAACT 
GTCCAATGAA 
CTTTTTGGGT 



31 
I 

TGTTCGTTGC 
TGCCCTGACT 
TGAAGCCACC 
CTGCCTCTTC 
TCTTCTGGCG 
CACAGCAGCA 
GAGGTACCAA 
CAAAACCTGG 
CTGGCAAAAA 
TCGTCAATGC 
AGGCGTGCCT 
CCGACACGCC 
TCTCCTGGGT 



41 
I 

TTCCAGGGCC 
GCGGAGTGCA 
GACAACGATG 
TGCCTGTCTG 
TATTTCATTC 
ACACAACGAG 
AACAACAGCC 
GACAGGCTCA 
TACACATCTG 
TGTGTTATGA 
GGTTTTTATC 
TGGGGGGTTG 
ACCATGTTCT 



51 
I 

TGCTGATTTT 
TCTTCTTTGT 
ACATCTATGG 
TTCTAGGCAT 
TGATGTTTAT 
ACTTTTTCAC 
CTCCAAACAA 
TGCTCCAGGA 
CCTTCCGGAC 
ACAATCTTAA 
ACAAT CAGGG 
CCTGGTTTGG 
ACTGGAGCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



85 



Seq ID NO: 535 Protein sequence 
Protein Accession #: NP_008883.1 

1 11 21 31 



390 



WO 02/086443 

MAKDNSTVRC FQGMilFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDD I YGAAW 60 

IGIFVGICLF CLSVLGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TQRDFFTPNL 120 

FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 240 
LCWTFWVLLG TMFYWSRIEY 



PCT/US02/12476 
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Seq ID NO: 53 6 DNA sequence 

Nucleic Acid Accession #: NMJD02638.1 

Coding sequence: 120.. 473 



11 



21 



31 



41 



51 



CAATACAGCT 
GCTGGACTGC 
TGAGGGCCAG 
AGGCAGCTGT 
T CAATGGAC A 
CGCAAGAGCC 
TCCGGTGCGC 
TCAAGAAGTG 
CGGTCCTTGC 
TGCTGCCCTT 
GAGCTGCCTC 



AAGGAATTAT 
ATAAAGATTG 
CAGCTTCTTG 
CACGGGAGTT 
AGATCCCGTT 
AGT CAAAGGT 
CATGTTGAAT 
CTGTGAAGGC 
TGCACCTGTG 
CCCCTTCCCA 
TCTCATCCAC 



CCCTTGTAAA 
GTATGGCCTT 
ATCGTGGTGG 
CCTGTTAAAG 
AAAGGACAAG 
CCAGTCTCCA 
CCCCCTAACC 
TCTTGCGGGA 
CCGTCCCCAG 
CACTGTCCAT 
TTTCCAATAA 



TACCACAGAC 
AGCTCTTAGC 
TGTTCCTCAT 
GTCAAGACAC 
TTTCAGTTAA 
CTAAGCCTGG 
GCTGCTTGAA 
TGGCCTGTTT 
AGCTACAGGC 
TCTTCCTCCC 
A 



CCGCCCTGGA 
CAAACACCTT 
CGCTGGGACG 
TGTCAAAGGC 
AGGTCAAGAT 
CTCCTGCCCC 
AGATACTGAC 
CGTTCCCCAG 
CCCATCTGGT 
ATTCAGGATG 



GCCAGGCCAA 
CCTGACACCA 
CTGGTTCTAG 
CGTGTTCCAT 
AAAGTCAAAG 
ATTATCTTGA 
TGCCCAGGAA 
TGAAGGGAGC 
CCTAAGTCCC 
CCCACGGCTG 



Seq ID NO: 537 Protein sequence 
Protein Accession #: NP_002 629.1 

1 11 21 31 41 51 

I I 1 I -I I 

MRASSFLIW VFLIAGTLVL EAAVTGVPVK GQDTVKGRVP FNGQDPVKGQ VSVKGQDKVK 
AQEPVKGPVS TKPGSCPIIL IRCAMLNPPN RCLKDTDCPG IKKCCEGSCG MACFVPQ 

Seq ID NO: 53 8 DNA sequence 

Nucleic Acid Accession ft: NM_001793.2 

Coding sequence: 71.. 2560 



AAAGGGGCAA 
CTCTGCAGCC 
CTGGCTGCAG 
CTTGGAGGCG 
CTGCCCTGGG 
TGGCGAGACA 
ATCCAAACGT 
TGAAAATGGC 
AGACACCAAG 
CTTCGCTGTA 
GATTGCCAAG 
CCCCATGAAC 
GGACACCTTC 
GACAGCCACG 
CCATAGCCAA 
CACCATCAGC 
CATCCAGGCC 
GATCCTTGAT 
GCCTGAGAAT 
CAACTCACCA 
TACCATCACC 
TTTTGAGGCC 
GCTGAAGCTC 
ACCTGTGTTT 
GCCTGTGTGT 
CATCCTGAGA 
TGTGGGCACC 
GGTCTTGGCC 
ACTGATTGAT 
CCAAAGCCCT 
CCCTTTCCAG 
GGAAGGTGAC 
GCACCTTTCT 
GTGCGACTGC 
CCCTGTGCTG 
GAGAAAGAAG 
CGTCTTCTAC 
GCTCCACCGA 
CATCATCCCG 
TATAATTGAG 
CTTGGTGTTC 
CTCCGCCTCC 
GAAGCTGGCA 
GGGACCAAAC 
GACTTCGGAG 
ACGTTAGAGT 
AGCACTGAAA 
TCTTACCTGC 
TACAGTGGAC 



11 
I 

GAGCTGAGCG 
ATGGGGCTCC 
TGCGCGGCCT 
GGAGGCGCGG 
CAAGAGCCAG 
GTCCAGGAAA 
ATCTTACGAA 
AAGGGTCCCT 
ATTTTCTACA 
GAGAAGGAGA 
TATGAGCTCT 
ATCTCCATCA 
CGAGGGAGTG 
GATGAGGATG 
GAACCAAAGG 
GTCATCTCCA 
ACAGACATGG 
GCCAATGACA 
GCAGTGGGCC 
GCGTGGCGTG 
ACCCACCCTG 
AAAAACC AG C 
CCAACCTCCA 
GTCCCACCCT 
GTCTACACTG 
GACCCAGCAG 
CTCGACCGTG 
ATGGACAATG 
GTCAATGACC 
GTGCGCCAGG 
GCCCAGCTCA 
ACAGTGGTCT 
CTGTCTGACC 
CATGGCCATG 
GGGGCTGTCC 
CGGAAGATCA 
TATGGCGAAG 
GGTCTGGAGG 
ACACCCATGT 
AACCTGAAGG 
GACTATGAGG 
GACCAAGACC 
GACATGTACG 
GTCAGGCCAC 
CTTGTCAGGA 
GGTTGCTTCC 
ACCTCTCCAC 
CGTAAAATGC 
TTTCTCTCTG 



21 
I 

GAACACCGGC 
CTCGTGGACC 
CCGAGCCGTG 
AGCAGGAGCC 
CTCTGTTTAG 
GAAGGT CACT 
GACACAAGAG 
TCCCCCAGAG 
GCAT CACGGG 
CAGGCTGGTT 
TTGGCCACGC 
TCGTGACCGA 
TCTTAGAGGG 
ATGCCATCTA 
ACCCACACGA 
GTGGCCTGGA 
ATGGGGACGG 
ATGCTCCCAT 
ATGAGGTGCA 
CCACCTACCT 
AGAGCAACCA 
ACACCCTGTA 
CAGCCACCAT 
CCAAAGTCGT 
CAGAAGACCC 
GGTGGCTAGC 
AGGATGAGCA 
GAAGCCCTCC 
ATGGCCCAGT 
TGCTGAACAT 
CAGATGACTC 
TGT CCCTGAA 
ATGGCAACAA 
TCGAAACCTG 
TGGCTCTGCT 
AGGAGCCCCT 
AGGGGGGTGG 
CCAGGCCGGA 
ACCGTCCTCG 
CGGCTAACAC 
GCAGCGGCTC 
AAGATTACGA 
GTGGCGGGGA 
AGAGCATCTC 
AGTGGCCGTA 
TTAGCCTTTC 
CTGGGCCAGG 
TCAACCCTGT 
GAATGGAACC 



31 
I 

CCGCCGTCGC 
TCTCGCGTCT 
CCGGGCGGTC 
CGGCCAGGCG 
CACTGATAAT 
GAAGGAAAGG 
AGATTGGGTG 
ACTGAATCAG 
GCCGGGGGCA 
GTTGTTGAAT 
TGTGTCAGAG 
CCAGAATGAC 
AGTCCTACCA 
CACCTACAAT 
CCTCATGTTC 
CCGGGAAAAA 
CTCCACCACC 
GTTTGACCCC 
GAGGCTGACG 
TATCATGGGC 
GGGCATCCTG 
CGTTGAAGTG 
AGTGGTCCAC 
TGAGGTCCAG 
TGACAAGGAG 
CATGGACCCA 
GTTTGTGAGG 
CACCACTGGC 
CCCTGAGCCC 
CACGGACAAG 
AGACAT CTAC 
GAAGTTCCTG 
AGAGCAGCTG 
CCCTGGACCC 
GTTCCTCCTG 
CCTACTCCCA 
CGAAGAGGAC 
GGTGGTTCTC 
GCCAGCCAAC 
AGACCCCACA 
CGACGCCGCG 
TTATCTGAAC 
GGACGACTAG 
CAAGGGGTCT 
GCAACTTGGC 
AGGATGGAGG 
GTTGCCTCAG 
GTCCTGGGCC 
TTCTTAGGCC 



41 

I 

GGCAGCTGCT 
CTCCTCCTTC 
TTCAGGGAGG 
CTGGGGAAAG 
GATGACTTCA 
AATCCATTGA 
GTTGCTCCAA 
CTCAAGTCTA 
GACAGCCCCC 
AAGCCACTGG 
AATGGTGCCT 
CACAAGCCCA 
GGTACTTCTG 
GGGGTGGTTG 
ACCATTCACC 
GTCCCTGAGT 
ACGGCAGTGG 
CAGAAGTACG 
GTCACTGATC 
GGTGACGACG 
ACAACCAGGA 
ACCAACGAGG 
GTGGAGGATG 
GAGGGCATCC 
AATCAAAAGA 
GACAGTGGGC 
AACAACATCT 
ACGGGAACCC 
CGTCAGATCA 
GACCTGTCTC 
TGGACGGCAG 
AAGCAGGATA 
ACGGTGATCA 
TGGAAGGGAG 
CTGGTGCTGC 
GAAGATGACA 
CAGGACTATG 
CGCAATGACG 
CCAGATGAAA 
GCCCCGCCCT 
TCCCTGAGCT 
GAGTGGGGCA 
GCGGCCTGCC 
CAGTTCCCCC 
GGAGACAGGC 
AATGTGGGCA 
AGGCCAAGTT 
TGGGCCTGCT 
TCCTGGTGCA 



51 
I 

TCACCCCTCT 
TCCAGGTTTG 
CTGAAGTGAC 
TATTCATGGG 
CTGTGCGGAA 
AGATCTTCCC 
TATCTGTCCC 
ATAAAGATAG 
CTGAGGGTGT 
ACCGGGAGGA 
CAGTGGAGGA 
AGTTTACCCA 
TGATGCAGGT 
CTTACTCCAT 
GGAGCACAGG 
ACACACTGAC 
CAGTAGTGGA 
AGGCCCATGT 
TGGACGCCCC 
GGGACCATTT 
AGGGTTTGGA 
CCCCTTTTGT 
TGAATGAGGC 
CCACTGGGGA 
TCAG CTACCG 
AGGTCACAGC 
ATGAAGTCAT 
TTCTGCTAAC 
CCATCTGCAA 
CCCACACCTC 
AGGT CAACG A 
CATATGACGT 
GGGCCACTGT 
GTTTCATCCT 
TTTTGTTGGT 
CCCGTGACAA 
ACATCACCCA 
TGGCACCAAC 
TCGGCAACTT 
ACGACACCCT 
CCCTCACCTC 
GCCGCTTCAA 
TGCAGGGCTG 
TTCAGCTGAG 
TATGAGTCTG 
GTTTGACTTC 
TCCAGAAGCC 
GTGACTGACC 
ACTTAATTTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



391 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
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70 
75 
80 
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WO 02/086443 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTT CTT CAAAAGTGCA GCCCAGAGCT 3 000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTT CCAGAC CCCAATGCCT CCCATTCGGA 3060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAGAA AAAAA 

Seq ID NO: S3 9 Protein sequence 
Protein Accession #: NP_001784.2 



PCT/US02/12476 



1 

MGLPRGPLAS 
QEPALFSTDN 
KGPFPQRLNQ 
YELFGHAVSE 
DEDDAIYTYN 
TDMDGDGSTT 
AWRATYLIMG 
PTSTATIWH 
DPAGWLAMDP 
VNDHGPVPEP 
TWLSLKKFL 
GAVLALIjFLL 
GLEARPEWL 
DYEGSGSDAA 



11 

I 

LLLLQVCWLQ 
DDFTVRNGET 
LKSNKDRDTK 
NGASVEDPMN 
GWAYSIHSQ 
TAVAWEILD 
GDDGDHFTIT 
VEDVNEAPVF 
DSGQVTAVGT 
RQITICNQSP 
KQDTYDVHLS 
LVLLLLVRKK 
RNDVAPTIIP 
SLSSLTSSAS 



21 
I 

CAASEPCRAV 
VQERRSLKER 
IFYSITGPGA 
ISIIVTDQND 
EPKDPHDLMF 
ANDNAPMFDP 
THPESNQGIL 
VPPSKWEVQ 
LDREDEQFVR 
VRQVUJITDK 
LSDHGNKEQL 
RKIKEPLLLP 
TPMYRPRPAN 
DQDQDYDYLN 



31 
I 

FREAEVTLEA 
NPLKIFPSKR 
DSPPEGVFAV 
HKPKFTQDTF 
TIHRSTGTIS 
QKYEAHVPEN 
TTRKGLDFEA 
EGIPTGEPVC 
NNIYEVMVLA 
DLSPHTSPFQ 
TVIRATVCDC 
EDDTRDNVFY 
PDEIGNFIIE 
EWGSRFKKLA 



41 

1 

GGAEQEPGQA 
ILRRHKRDWV 
EKETGWLLLN 
RGSVLEGVLP 
VISSGLDREK 
AVGHEVQRLT 
KNQHTLYVEV 
VYTAEDPDKE 
MDNGSPPTTG 
AQLTDDSDIY 
HGHVETCPGP 
YGEEGGGEED 
NLKAANTDPT 
DMYGGGEDD 



Seq ID NO: 540 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence : 1 . . 672 



ATGAGGCTCC 
CGGGGCTCCC 
AAGGGCGGGG 
CTGCTCGCCT 
GCGAGACAAC 
TGTCATGTTT 
ACAGAGCCAT 
AAGCAGTGCT 
CTCCTGGAAG 
TTAGAGGGGC 
AGCTGTGGTG 
AGCCTGTCTT 



11 
I 

AAAGACCCCG 
CCTACCGGCC 
AGGGGGCGCC 
TGCTGCTGGT 
GAGATCCAGA 
GTGAGAGAGA 
ACTGCGTTAT 
CCGCTGGTTG 
AGCCCATGCC 
CACCTATCAA 
GGCTGTGGCT 
GA 



21 

I 

ACAGGCCCCG 
AGACCCGGGG 
GCGCGCTGAC 
CGTGGCCCTA 
GGACTCCCAG 
AAACACTTTC 
AGCGGCCGTG 
TGCAGCGATG 
CTTCTTTTAC 
CTCATCAGTG 
GGCCATCCTC 



31 

1 

GCGGGTGGGA 
AGAGGCGCGC 
CCTCCCTGGG 
CCGCGGGTGT 
CGAACGGACG 
GAGTGCCAGA 
AAAATATTTC 
GAGAGACCCA 
CTCAAGTGTT 
TTCAAAGAAT 
CTGCTGCTGG 



41 
I 

GGCGCGCGCC 
GGAGGCTGCG 
CACCGCTGGG 
GGACAGACGC 
AGGGTGACAA 
ACCCAAGGAG 
CACGTTTTTT 
AGCCAGAGGA 
GTAAAATTCG 
ATGCTGGGAG 
CCTCCATTGC 



51 

i 

LGKVFMGCPG 
VAPISVPENG 
KPLDREEIAK 
GTSVMQVTAT 
VPEYTLTIQA 
VTDLDAPNSP 
TNEAPFVLKL 
NQKISYRILR 
TGTLLLTIjID 
WTAEVNEEGD 
WKGGFILPVL 
QDYDITQLHR 
APPYDTLLVF 



51 
I 

CCGGGGCGGG 
AAGGTTCCAG 
GACGATGGCG 
CAACCTGACT 
TAGAGTGTGG 
GTGCAAATGG 
CATGGTTGCG 
GAAGCGGTTT 
CTACTGCAAT 
CATGGGTGAG 
AGCCGGCCTC 



Seq ID NO: 541 Protein sequence 
Protein Accession #: Eos sequence 

1 11 



41 



51 



21 31 

! 1 I I 1 I 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 
LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHV CERENTF ECQNPRRCKW 
TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEEKRF LLEEPMPFFY LKCCKIRYCN 
LEGPPINSSV FKEYAGSMGE SCGGLWLAIL LLLASIAAGL SLS 

Seq ID NO: 542 DNA sequence 

Nucleic Acid Accession #: XM_035292.2 

Coding sequence: 53 .. 1576 



1 

l 

GCTCGCTGGG 
TGCGGGCCCG 
GGAGAAGATG 
CGTGACCCTG 
TATCGGCTCG 
GCTGGCGCTG 
CGCGGAGCTC 
CTACGGCTCG 
AT CGCAGTAC 
CTGCCCGGTG 
GGCCGTGAAC 
CAAGCTCCTG 
TGTGTCCAAT 
TGTGCTGGCA 
CACAGAGGAA 
CATCGTGACG 
GCAGATGCTG 
GTCCTGGATC 
GTTCACATCC 
CTCCATGATC 
GACGCTGCTC 
CAACTGGCTC 
TGAGCTTGAG 
CCTCTTCCTG 
CATCATCCTC 
GTGGCTCCTC 



11 

I 

CCGCGGCTCC 
AAGCGGCGCG 
CTGGCCGCCA 
CAGCGGAACA 
GGCATCTTCG 
GTGGTGTGGG 
GGCACCACCA 
CTGCCCGCCT 
ATCGTGGCCC 
CCCGAGGAGG 
TGCTACAGCG 
GCCCTGGCCC 
CTAGATCCCA 
TTATACAGCG 
ATGATCAACC 
CTGGTGTACG 
TCGTCCGAGG 
ATCCCCGTCT 
TCCAGGCTCT 
CACCCACAGC 
TACGCCTTCT 
TGCGTGGCCC 
CGGCCCATCA 
ATCGCCGTCT 
AGCGGGCTGC 
CAGGGCATCT 



21 

I 

CGGGTGTCCC 
CGCTAGCGGC 
AGAGCGCGGA 
TCACGCTGCT 
TGACGCCCAC 
CCGCGTGCGG 
TCTCCAAATC 
TCCTCAAGCT 
TGGTCTTCGC 
CAGCCAAGCT 
TGAAGGCCGC 
TGATCATCCT 
ACTTCTCATT 
GCCTCTTTGC 
CCTACAGAAA 
TGCTGACCAA 
CCGTGGCCGT 
TCGTGGGCCT 
TCTTCGTGGG 
TCCTCACCCC 
CCAAGGACAT 
TGGCCATCAT 
AGGTGAACCT 
CCTTCTGGAA 
CCGTCTACTT 
TCTCCACGAC 



31 

I 

AGGCCCGGCC 
GCCGGCGGCC 
CGGCTCGGCG 
CAACGGCGTG 
GGGCGTGCTC 
CGTCTTCTCC 
GGGCGGCGAC 
CTGGATCGAG 
CACCTACCTG 
CGTGGCCTGC 
CACCCGGGTC 
GCTGGGCTTC 
TGAAGGCACC 
CTATGGAGGA 
CCTGCCCCTG 
CCTGGCCTAC 
GGACTTCGGG 
GTCCTGCTTC 
GTCCCGGGAA 
CGTGCCGTCC 
CTTCTCCGTC 
CGGCATGATC 
GGCCCTGCCT 
GACACCCGTG 
CTTCGGGGTC 
CGTCCTGTGT 



41 

t 

GGTGCGCAGA 
GAGGAGAAGG 
CCGGCAGGCG 
GCCATCATCG 
AAGGAGGCAG 
ATCGTGGGCG 
TACGCCTACA 
CTGCTCATCA 
CTCAAGCCGC 
CTCTGCGTGC 
CAGGATGCCT 
GTCCAGATCG 
AAACTGGATG 
TGGAATTACT 
GCCATCATCA 
TTCACCACCC 
AACTATCACC 
GGCTCCGTCA 
GGCCACCTGC 
CTCGTGTTCA 
ATCAACTTCT 
TGGCTGCGCC 
GTGTTCTTCA 
GAGTGTGGCA 
TGGTGGAAAA 
CAGAAGCTCA 



51 

I 

GCATGGCGGG 
AAGAGGCGCG 
AGGGCGAGGG 
TGGGGACCAT 
GCTCGCCGGG 
CGCTCTGCTA 
TGCTGGAGGT 
TCCGGCCTTC 
TCTTCCCCAC 
TGCTGCTCAC 
TTGCCGCCGC 
GAAAGGGTGA 
TGGGGAACAT 
TGAATTTCGT 
TCTCCCTGCC 
TGTCCACCGA 
TGGGCGTCAT 
ATGGGTCCCT 
CCTCCATCCT 
CGTGTGTGAT 
TCAGCTTCTT 
ACAGAAAGCC 
TCCTGGCCTG 
TCGGCTTCAC 
ACAAGCCCAA 
TGCAGGTGGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



392 



WO 02/086443 

CCCCCAGGAG ACATAGCCAG GAGGCCGAGT GGCTGCCGGA GGAGCATGC 



PCT/US02/12476 
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Seq ID NO: 543 Protein sequence 
Protein Accession #: XP 035292.2 



MAG AG P KR RA 
GTIIGSGIFV 
LEVYGSLPAF 
LLTAVNCYSV 
GNIVLALYSG 
STEQMLSSEA 
SILSMIHPQL 
RKPELERPIK 
KPKWLLQGIF 



11 

I 

LAAPAAEEKE 
T PTGVLKE AG 
LKLWIELLII 
KAATRVQDAF 
LFAYGGWNYL 
VAVDFGNYHL 
LTPVPSDVFT 
VNLALPVFFI 
STTVLCQKLM 



21 
I 

EAREKMLAAK 
SPGLALWWA 
RPSSQYIVAL 
AAAKLLALAL 
NFVTEEMINP 
GVMSWIIPVF 
CVMTLI/YAFS 
LACLFLIAVS 
QWPQET 



31 

i 

SAD G SAP AGE 
ACGVFSIVGA 
VFATYLLKPL 
IILLGFVQIG 
YRNLPLAI I I 
VGLSCFGSVN 
KDIFSVINFF 
FWKTPVECGI 



41 
I 

GEGVTLQRNI 
LCYAELGTTI 
FPTCPVPEEA 
KGDVSNLDPN 
SLPIVTLVYV 
GSLFTSSRLF 
SFFNWLCVAL 
GFTIILSGLP 



51 
I 

TLIiNGVAIIV 
SKSGGDYAYM 
AKLVACLCVL 
FSFEGTKLDV 
LTNLAYFTTL 
FVGSREGHLP 
AIIGMIWLRH 
VYFFGVWWKN 



Seq ID NO: 544 DNA sequence 

Nucleic Acid Accession #: NM_005268.1 

Coding sequence: 168.. 989 



TAAAAAGCAA 
TCTGGATATG 
AGCCCTGAGG 
TCTTTGAGGG 
TGTCTCTGGT 
GTGATGACCA 
TTGATGAGTT 
CATGCCCCTC 
ACCGAGAAGC 
GTGGGCTCTG 
TTCTCTATGT 
ACGCAGATCC 
TTTTCACCCT 
TCATCTACCT 
TGTGCACAGG 
CGGGTGACCT 
GAGACCATGT 
CCTGGATGGG 
CATGAGGTAG 
TCAACTCCAG 
GCTCGGTTTC 



11 
I 

AAGAATTCGC 
AAATT CAAGC 
AGTAGTCACT 
ACTCCTGAGT 
CTTCATCTTC 
CAAGGACTTC 
CTTCCCTGTG 
ACTGCTCGTG 
CCATGGGGAG 
GTGGACATAT 
GTTCCACTCA 
ATGTCCCAAT 
CTTCATGGTG 
GGTGAGCAAG 
TCATCACCCC 
CATCTTTCTG 
GAAGAAAACC 
GAGGCTCTAG 
GGGCAGGCAA 
CCACCTGCCC 
CTTTTCTAGA 



21 
I 

GGCCGCGTCG 
TGCTTGCTGA 
CAGTAGCAGC 
GGGGTCAACA 
CGCGTGCTGG 
GACTGCAATA 
TCCCATGTGC 
GTCATGCACG 
AACAGTGGGC 
GTCTGCAGCC 
TTCTACCCCA 
ATAGTGGACT 
GCCACAGCTG 
AGATGCCACG 
CACGGTACCA 
GGCTCAGACA 
ATCTTGTGAG 
CATCTCTCAT 
GAGAGAGGAT 
CAGCTCGACG 
ATGGAAATAG 



31 

I 

ACACGGGCTT 
GTCCTATTGC 
TGACGCGTGG 
AGTACTCCAC 
TGTACCTGGT 
CTCGCCAGCC 
GCCTCTGGGC 
TGGCCTACCG 
GCCTCTACCT 
TAGTGTTCAA 
AATATATCCT 
GCTTCATCTC 
CCATCTGCAT 
AGTGCCTGGC 
CCTCTTCCTG 
GTCATCCTCC 
GGGCTGCCTG 
AGGTGCAACC 
TCAGACGCTC 
GCACTGGGCC 
TGAGGGCCAA 



41 

I 

CCCCGAAAAC 
CGGCTGCTGG 
GTCCACCATG 
AGCCTTTGGG 
GACGGCCGAG 
CGGCTGCTCC 
CCTGCAGCTT 
GGAGGTTCAG 
GAACCCCGGC 
GGCGAGCGTG 
CCCTCCTGTG 
CAAGCCCTCA 
CCTGCTCAAC 
AGCAAGGAAA 
CAAACAAGAC 
TCTCTTACCA 
GACTGGTCTG 
TGAGAGTGGG 
TGGGAGCCAG 
AGTTCCCCCT 
TGC 



51 
I 

CTTCCCCGCT 
GAGCCAGGAG 
AACTGGAGTA 
CGCATCTGGC 
CGTGTGTGGA 
AACGTCTGCT 
ATCCTGGTGA 
GAGAAGAGGC 
AAGAAGCGGG 
GACATCGCCT 
GTCAAGTGCC 
GAGAAGAACA 
CTCGTGGAGC 
GCTCAAGCCA 
GACCTCCTTT 
GACCGCCCCC 
GCAGGTTGGG 
GGAGCTAAGC 
TTCCTAGTCC 
CTGCTCTGCA 



Seq ID NO : 545 Protein sequence 
Protein Accession #: NP_005259.1 



MNWSIFEGLL 
SNVCFDEFFP 
GKKRGGLWWT 
SEKNIFTLFM 
DDLLSGDLIF 



11 
I 

SGVNKYSTAF 
VSHVRLWALQ 
YVCSLVFKAS 
VATAAICILL 
LGSDSHPPLL 



21 
I 

GRIWLSLVFI 
LILVTCPSLL 
VDIAFLYVFH 
NLVELIYLVS 
PDRPRDHVKK 



31 

I 

FRVLVYLVTA 
WMHVAYREV 
SFYPKYILPP 
KRCHECLAAR 
TIL 



41 

1 

ERVWSDDHKD 
QEKRHREAHG 
WKCHADPCP 
KAQAMCTGHH 



51 
I 

FDCNTRQPGC 
ENSGRLYLNP 
NIVDCFISKP 
PHGTTSSCKQ 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300" 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 
240 



60 



65 



70 



75 



80 



85 



Seq ID NO: 546 DNA sequence 

Nucleic Acid Accession #: NM_002391.1 

Coding sequence: 2 6.. 457 



CGGGCGAAGC 
CGCCCTGCTG 
CCCGGGGAGC 
CGGCGTGGGT 
GCCCTGCAAC 
TGCGTGTGAT 
CAATGCTCAG 
AAAGGCCAAA 
GCCCCTGGTG 
CACCAGTGCC 
ACTCCCCAGC 
TGAGCCTCCC 
ATTACTAAGA 
TAATAT 



11 
I 

AGCGCGGGCA 
GCGCTCACCT 
GAGTGCGCTG 
TTCCGCGAGG 
TGGAAGAAGG 
GGGGGCACAG 
TGCCAGGAGA 
GCCAAGAAAG 
TCACATGGGG 
TTCTGTCTGC 
CCCACCCCTA 
CCAAAGCAAT 
AACACATCAA 



21 

! 

GCGAGATGCA 
CCGCGGTCGC 
AGTGGGCCTG 
GCACCTGCGG 
AGTTTGGAGC 
GCACCAAAGT 
CCATCCGCGT 
GGAAGGGAAA 
CCTGGCCACG 
TCGTTAGCTT 
AGTGCCCAAA 
GTGAGTCCCA 
ATAAACTGAC 



31 
I 

GCACCGAGGC 
CAAAAAGAAA 
GGGGCCCTGC 
GGCCCAGACC 
CGACTGCAAG 
CCGCCAAGGC 
CACCAAGCCC 
GGACTAGACG 
CCCTCCCTCT 
TAATCAATCA 
GTGGGGAGGG 
GAGCCCGCTT 
TTTTTCCCCC 



41 
I 

TTCCTCCTCC 
GATAAGGTGA 
ACCCCCAGCA 
CAGCGCATCC 
TACAAGTTTG 
ACCCTGAAGA 
TGCACCCCCA 
CCAAGCCTGG 
CCCAGGCCCG 
TGCCCTGCCT 
ACAAGGGATT 
TTGTTCTTCC 
CAATAAAAGC 



Seq ID NO: 547 Protein sequence 
Protein Accession #: NP_0023 82.1 



11 



21 

I 



31 



41 



51 
I 

TCACCCTCCT 
AGAAGGGCGG 
GCAAGGATTG 
GGTGCAGGGT 
AGAACTGGGG 
AGGCGCGCTA 
AGACCAAAGC 
ATGCCAAGGA 
AGATGTGACC 
TGTCCCTCTC 
CTGGGAAGCT 
CCACAATTCC 
TCTTCTTTTT 



51 



MQHRGFLLLT DLALLALTSA V AKKKD KVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 
CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 
RVTKPCTPKT KAKAKAKKGK GKD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



Seq ID NO: 54 8 DNA sequence 



393 



WO 02/086443 

Nucleic Acid Accession #: 
Coding sequence: 1..786 



NM 006783 .1 



PCT/US02/12476 



li 



21 



31 



41 



51 



10 



15 



20 



ATGGATTGGG 
GGGAAGGTGT 
CAGGAAGTGT 
AAAAATGTGT 
CTGATCTTCG 
GAAACCACTC 
ATTAAAAAGC 
TTTTTCCGAA 
TACCACCTGC 
TTTATTTCTA 
ATTTGCATGC 
AGATCAAAGA 
CAGAATGAAA 
AGCTAA 



GGACGCTGCA 
GGATCACAGT 
GGGGTGACGA 
GCTATGACCA 
TCTCCACCCC 
GCAAGTTCAG 
ACAAGGTTCG 
TCATCTTTGA 
CCTGGGTGTT 
GGCCAACAGA 
TGCTTAACGT 
GAGCACAGAC 
TGAATGAGCT 



CACTTTCATC 
CATCTTTATT 
GCAAGAGGAC 
CTTTTTCCCG 
AGCGCTGCTG 
GCGAGGAGAG 
GATAGAGGGG 
AGCAGCCTTT 
GAAATGTGGG 
GAAGACCGTG 
GGCAGAGTTG 
GCAAAAAAAT 
GATTTCAGAT 



GGGGGTGTCA 
TTCCGAGTCA 
TTCGTCTGCA 
GTGTCCCACA 
GTGGCCATGC 
AAGAGGAATG 
TCGCTGTGGT 
ATGTATGTGT 
ATTGACCCCT 
TTTACCATTT 
TGCTACCTGC 
CACCCCAATC 
AGTGGT CAAA 



ACAAACACTC 
TGATCCTAGT 
ACACACTGCA 
TCCGGCTGTG 
ATGTGGCCTA 
ATTTCAAAGA 
GGACGTACAC 
TTTACTTCCT 
GCCCCAACCT 
TTATGATTTC 
TGCTGAAAGT 
ATGCCCTAAA 
ATGCAAT CAC 



CACCAGCATC 
GGTGGCTGCC 
ACCGGGATGC 
GGCCCTCCAG 
CTACAGGCAC 
CATAGAGGAC 
CAGCAGCATC 
TTACAATGGG 
TGTTGACTGC 
TGCGTCTGTG 
GTGTTTTAGG 
GGAGAGTAAG 
AGGTTTCCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



Seq ID NO: 549 Protein sequence 
Protein Accession #: NP_006774.1 



1 11 21 31 41 51 

25 | | | I I I 

MDWGTLHTFI GGVNKHSTSI GKVWITVIFI FRVMILWAA QEVWGDEQED FVCNTLQPGC 60 

KNVCYDHFFP VSHIRLWALQ. LIFVSTPALL VAMHVAYYRH ETTRKFRRGE KRNDFKDIED 120 

IKKHKVRIEG SLWWTYTSSI FFRIIFEAAF MYVFYFLYNG YHLPWVLKCG IDPCPNLVDC 180 

FISRPTEKTV FTIFMISASV ICMLLNVAEL CYLLLKVCFR RSKRAQTQKN HPNHALKESK 240 
30 QNEMNELISD SGQNAITGFP S 



35 



Seq ID NO: 550 DNA sequence 

Nucleic Acid Accession #: NM_002571.1 

Coding sequence: 99.. 587 



11 



21 



31 



41 



51 



40 



45 



50 



CATCCCTCTG 
TCACCCTGGG 
AGGACCTGGA 
ACATCTCCCT 
CCACCCCCGA 
AGAAGAAGGT 
TGGCGAACGA 
AGGACACCAC 
AGGACGATGA 
GGTACTTGCT 
CCAGGAAGAC 
TTTCAAAGAA 
TCCTGCTGCA 
GCAGAGGTTA 



GCTCCAGAGC 
CGTGGCCCTG 
GCTCCCAAAG 
CATGGCGACA 
GGACAACCTG 
CCTTGGAGAG 
GGCCACGCTG 
CACCCCCATC 
GAT CATGCAG 
GGACTTGAAA 
CAGACTCCCA 
TAACCACAGC 
CACCTGCACC 
TTAATAAACC 



TCAGAGCCAC 
GTCTGTGGTG 
TTGGCAGGGA 
CTGAAGGCCC 
GAG AT CGTTC 
AAGACTGGGA 
CTCGATACTG 
CAGAGCATGA 
GGATT CAT CA 
CAGATGGAAG 
CCCTTCCACA 
TCAGAAGACG 
ATTGCCATGG 
CTTGGAGCAT 



CCACAGCCGC 
TCCCGGCCAT 
CCTGGCACTC 
CTCTGAGGGT 
TGCACAGATG 
AT CC AAAGAA 
ACTACGACAA 
TGTGCCAGTA 
GGGCTTTCAG 
AGCCGTGCCG 
CCTCCAGAGC 
ATGACGTGGT 
GGAGGCTGCT 
G 



AGCCATGCTG 
GGACATCCCC 
CATGGCCATG 
CCACATCACC 
GGAGAACAAC 
GTTCAAGATC 
TTTCCTGTTT 
CCTGGCCAGA 
GCCCCTGCCC 
TTTCTAGCTC 
AGTGGGACTT 
CATCTGTGTC 
CCCTGGGGGC 



TGCCTCCTGC 
CAGACCAAGC 
GCGACCAACA 
TCACTGTTGC 
AGCTGTGTTG 
AACTATACGG 
CTCTGCCTAC 
GTCCTGGTGG 
AGGCACCTAT 
ACCTCCGCCT 
CCTCCTGCCC 
GCCATCCCCT 
AGAGTCTCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



55 



60 



Seq ID NO: 551 Protein sequence 
Protein Accession #: NP_002562.1 

1 11 21 31 41 51 

I I I I I I 

MDIPQTKQDL ELPKLAGTWH SMAMATNNIS LMATLKAPLR VHITSLLPTP EDNLEIVLHR 

WENNSCVEKK VLGEKTGNPK KFKINYTVAN EATLLDTDYD NFLFLCLQDT TTPIQSMMCQ 

YDARVLVEDD EIMQGFIRAF RPLPRHLWYL LDIiKQMEEPC RF 



60 
120 



65 



Seq ID NO: 552 DNA sequence 

Nucleic Acid Accession #: NM_006500.1 

Coding sequence: 27.. 1967 



11 



21 



31 



41 



51 



70 
75 
80 
85 



ACTTGCGTCT 
TCGCCGCCTG 
CGCCTGAGCT 
AGTCCCAAGG 
TCATCTTCCG 
TCAGCCTCCA 
GCATCTTCTT 
TCTACAAAGC 
GTAAGGAGCC 
T CAT CTGGTA 
CGTCCCAGAC 
TGGTTAAAGA 
GGAACCACAT 
TGTGGCTGGA 
GTTTGGCTGA 
GGGAGGCAGA 
AGGAACACAG 
TGAGTGAACC 
CCCCTGAGAG 
ACCTCGAGTT 



CGCCCTCCGG 
CTGCTGCTGT 
GGTGGAGGTG 
CAACCTCAGC 
TGTGCGCCAG 
GGACAGAGGG 
GTGCCAGGGC 
TCCGGAGGAG 
TGAGGAGGTC 
CAAGAATGGC 
TGTGGAGTCG 
AGACAAAGAT 
GAAGGAGTCC 
AGTGGAGCCC 
TGGCAACCCT 
GGAAGAGACA 
TGGGCGCTAT 
ACAGGAACTA 
ACAGGAAGGC 
CCAGTGGCTG 



CCAAGCATGG 
CCTCGCGTCG 
GAAGTGGGCA 
CATGTCGACT 
GGCCAGGGCC 
GCTACTCTGG 
AAGCGCCCTC 
CCAAACATCC 
GCTACCTGTG 
CGGCCTCTGA 
AGTGGTTTGT 
GCCCAGTTTT 
AGGGAAGTCA 
GTGGGAATGC 
CCACCACACT 
ACCAACGACA 
GAATGTCAGG 
CTGGTGAACT 
AGCAGCCTCA 
AGAGAAGAGA 



GGCTTCCCAG 
CGGGTGTGCC 
GCACAGCCCT 
GGTTTTCTGT 
AGAGCGAACC 
CCCTGACTCA 
GGT CCCAGGA 
AGGTCAACCC 
TAGGGAGGAA 
AGGAGGAGAA 
ACACCTTGCA 
ACTGTGAGCT 
CCGTCCCTGT 
TGAAGGAAGG 
TCAGCATCAG 
ACGGGGTCCT 
CCTGGAACTT 
ATGTGTCTGA 
CCCTGACCTG 
CAGACCAGGT 



GCTGGTCTGC 
CGGAGAGGCT 
TCTGAAGTGC 
CCACAAGGAG 
TGGGGAGTAC 
AGTCACCCCC 
GTACCGCATC 
CCTGGGCATC 
CGGGTACCCC 
GAACCGGGTC 
GAGTATTCTG 
CAACTACCGG 
TTTCTACCCG 
GGACCGCGTG 
CAAGCAGAAC 
GGTGCTGGAG 
GGACACCATG 
CGTCCGAGTG 
TGAGGCAGAG 
GCTGGAAAGG 



GCCTTCTTGC 
GAGCAGCCTG 
GGCCTCTCCC 
AAGCGGACGC 
GAGCAGCGGC 
CAAGACGAGC 
CAGCTCCGCG 
CCTGTGAACA 
ATTCCTCAAG 
CACATTCAGT 
AAGGCACAGC 
CTGCCCAGTG 
ACAGAAAAAG 
GAAATCAGGT 
CCCAGCACCA 
CCTGCCCGGA 
ATATCGCTGC 
AGTCCCGCAG 
AGTAGCCAGG 
GGGCCTGTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



394 



WO 02/086443 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 12 60 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 13 80 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGG CAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 150 0 

TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 162 0 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCG CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2 040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 222 0 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 22 8 0 

AAATACCTGG CTCCTGCCAG CAG CTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2 34 0 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 24 60 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 252 0 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CAT CCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2 70 0 

TACAAAAAAA AATTAG CTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACT CGGAAGG 2 760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2 820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2 880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3 000 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3 0 60 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 312 0 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 32 40 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3 300 

AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3 3 60 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3 480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3 540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 

Seq ID NO: 553 Protein sequence 
Protein Accession #: NP_006491.1 

1 11 21 31 41 51 

i I I I I I 

GLPRLVCAFL LAACCCCPRV AGVPGEAEQP APELVEVEVG STALLKCGLS QSQGNLSHVD 60 

WFSVHKEKRT LIFRVRQGQG QSEPGEYEQR LSLQDRGATL ALTQVTPQDE RIFLCQGKRP 120 

RSQEYRIQLR VYKAPEEPNI QVNPLGIPVN SKEPEEVATC VGRNGYPIPQ VIWYKNGRPL 180 

KEEKNRVHIQ SSQTVESSGk YTLQSILKAQ LVKEDKDAQF YCELNYRLPS GNHMKESREV 240 

TVPVFYPTEK VWLEVEPVGM LKEGDRVEIR CLADGNPPPH FSISKQNPST REAEEETTND 300 

NGVLVLE PAR KEHSGRYECQ AWNLDTMISL LSEPQELLVN YVSDVRVSPA APERQEGSSL 360 

TLTCEAESSQ DLEFQWLREE TDQVLERGPV LQLHDIjKREA GGGYRCVASV PSIPGLNRTQ 42 0 

LVKLAIFGPP WMAFKERKVW VKENMVLNLS CEASGHPRPT ISWNVNGTAS EQDQDPQRVL 480 

STLNVLVTPE LLETGVECTA SNDLGKNTSI LFLELVNLTT LTPDSNTTTG LSTSTASPHT 540 

RANSTSTERK LPEPESRGW IVAVIVCILV LAVLGAVLYF LYKKGKLPCR RSGKQEITLP 600 
PSRKTELWE VKSDKLPEEM GLLQGSSGDK RAPGDQGEKY IDLRH 

Seq ID NO: 554 DNA sequence 

Nucleic Acid Accession #: NM_003183.3 

Coding sequence: 165.. 2639 

1 11 21 31 41 51 

I i 1 I 1 I 

TCGAGCCTGG CGGTAGAAT C TTCCCAGTAG GCGGCGCGGG AGGGAAAAGA GGATTGAGGG 60 

GCTAGGCCGG GCGGATCCCG TCCTCCCCCG ATGTGAGCAG TTTTCCGAAA CCCCGTCAGG 12 0 

CGAAGGCTGC CCAGAGAGGT GGAGTCGGTA GCGGGGCCGG GAACATGAGG CAGTCTCTCC 180 

TATTCCTGAC CAGCGTGGTT CCTTTCGTGC TGGCGCCGCG ACCTCCGGAT GACCCGGGCT 240 

TCGGCCCCCA CCAGAGACT C GAGAAGCTTG ATTCTTTGCT CTCAGACTAC GATATTCTCT 300 

CTTTATCTAA TATCCAGCAG CATTCGGTAA GAAAAAGAGA TCTACAGACT TCAACACATG 360 

TAGAAACACT ACTAACTTTT TCAGCTTTGA AAAGGCATTT TAAATTATAC CTGACATCAA 420 

GTACTGAACG ' TTTTTCACAA AATTTCAAGG TCGTGGTGGT GGATGGTAAA AACGAAAGCG 480 

AGTACACTGC AAAATGGCAG GACTTCTTCA CTGGACACGT GGTTGGTGAG CCTGACTCTA 540 

GGGTTCTAGC CCACATAAGA GATGATGATG TTATAATCAG AATCAACACA GATGGGGCCG 600 

AATATAACAT AGAGCCACTT TGGAGATTTG TTAATGATAC CAAAGACAAA AGAATGTTAG 660 

TTTATAAATC TGAAGATATC AAGAATGTTT CACGTTTGCA GTCTCCAAAA GTGTGTGGTT 72 0 

ATTTAAAAGT GGATAATGAA GAGTTGCTCC CAAAAGGGTT AGTAGACAGA GAACCACCTG 780 

AAGAGCTTGT TCATCGAGTG AAAAGAAGAG CTGACCCAGA TCCCATGAAG AACACGTGTA 840 

AATTATTGGT GGTAGCAGAT CATCGCTTCT ACAGATACAT GGGCAGAGGG GAAGAGAGTA 900 

CAACTACAAA TTACTTAATA GAGCTAATTG ACAGAGTTGA TGACATCTAT CGGAACACTT 960 

CATGGGATAA TGCAGGTTTT AAAGGCTATG GAATACAGAT AGAGCAGATT CGCATTCTCA 1020 

AGTCTCCACA AGAGGTAAAA CCTGGTGAAA AGCACTACAA CATGGCAAAA AGTTACCCAA 1080 

ATGAAGAAAA GGATGCTTGG GATGTGAAGA TGTTGCTAGA GCAATTTAGC TTTGATATAG 1140 

CTGAGGAAGC ATCTAAAGTT TGCTTGGCAC ACCTTTTCAC ATACCAAGAT TTTGATATGG 1200 

GAACTCTTGG ATTAGCTTAT GTTGGCTCTC CCAGAGCAAA CAGCCATGGA GGTGTTTGTC 1260 

CAAAGGCTTA TTATAGCCCA GTTGGGAAGA AAAATATCTA TTTGAATAGT GGTTTGACGA 132 0 

GCACAAAGAA TTATGGTAAA ACCATCCTTA CAAAGGAAGC TGACCTGGTT ACAACTCATG 13 80 



395 



WO 02/086443 

AATTGGGACA TAATTTTGGA GCAGAACATG ATCCGGATGG TCTAGCAGAA TGTGCCCCGA 1440 

ATGAGGACCA GGGAGGGAAA TATGTCATGT ATCCCATAGC TGTGAGTGGC GATCACGAGA 1500 

ACAATAAGAT GTTTTCAAAC TGCAGTAAAC AATCAAT CTA TAAGACCATT GAAAGTAAGG 1560 

CCCAGGAGTG TTTTCAAGAA CGCAGCAATA AAGTTTGTGG GAACT CGAGG GTGGATGAAG 162 0 

GAGAAGAGTG TGATCCTGGC ATCATGTATC TGAACAACGA CACCTGCTGC AACAGCGACT 1680 

GCACGTTGAA GGAAGGTGTC CAGTGCAGTG ACAGGAACAG TCCTTGCTGT AAAAACTGTC 1740 

AGTTTGAGAC TGCCCAGAAG AAGTGCCAGG AGGCGATTAA TGCTACTTGC AAAGGCGTGT 1800 

CCTACTGCAC AGGTAATAGC AGTGAGTGCC CGCCTCCAGG AAATGCTGAA AATGACACTG 1860 

TTTGCTTGGA TCTTGGCAAG TGTAAGGATG GGAAATGCAT CCCTTTCTGC GAGAGGGAAC 192 0 

AGCAG CTGGA GTCCTGTGCA TGTAATGAAA CTGACAACTC CTGCAAGGTG TGCTGCAGGG 1980 

ACCTTTCTGG CCGCTGTGTG CCCTATGTCG ATGCTGAACA AAAGAACTTA TTTTTGAGGA 2 040 

AAGG AAAG CC CTGTACAGTA GGATTTTGTG ACATGAATGG CAAATGTGAG AAACGAGTAC 2100 

AGGATGTAAT TGAACGATTT TGGGATTTCA TTGACCAGCT GAGCATCAAT ACTTTTGGAA 2160 

AGTTTTTAGC AGACAACATC GTTGGGTCTG TCCTGGTTTT CTCCTTGATA TTTTGGATTC 2220 

CTTTCAGCAT TCTTGTCCAT TGTGTGGATA AGAAATTGGA TAAACAGTAT GAATCTCTGT 22 80 

CTCTGTTTCA CCCCAGTAAC GTCGAAATGC TGAGCAGCAT GGATTCTGCA TCGGTTCGCA 2 340 

TTATCAAACC CTTTCCTGCG CCCCAGACTC CAGGCCGCCT GCAGCCTGCC CCTGTGATCC 240 0 

CTTCGGCGCC AGCAGCTCCA AAACTGGACC ACCAGAGAAT GGACACCATC CAGGAAGACC 2 460 

CCAGCACAGA CTCCCATATG GACGAGGATG GGTTTGAGAA GGACCCCTTC CCAAATAGCA 252 0 

GCACAGCTGC CAAGTCATTT GAGGATCTCA CGGACCATCC GGTCGCCAGA AGTGAAAAGG 2580 

CTGCCTCCTT TAAACTG CAG CGT CAGAATC GTGTTAACAG CAAAGAAACA GAGTGCTAAT 2 640 

TTAGTTCTCA GCTCTTCTGA CTTAAGTGTG CAAAATATTT TTATAGATTT GACCTACAAA 2700 

TCAATCACAG CTTGTATTTT GTGAAGACTG GGAAGTGACT TAGCAGATGC TGGTCATGTG 2 760 

TTTGAACTTC CTGCAGGTAA ACAGTTCTTG TGTGGTTTGG CCCTTCTCCT TTTGAAAAGG 2 820 

TAAGGTGAAA GTGAATCTAC TTATTTTGAG GCTTTCAGGT TTTAGTTTTT AAAATATCTT 2 8 80 

TTGACCTGTG GTGCAAAAGC AGAAAATACA GCTGGATTGG GTTATGAATA TTTACGTTTT 2 94 0 

TGTAAATTAA TCTTTTATAT TGATAACAGC ACTGACTAGG GAAATGATCA GTTTTTTTTT 30 0 0 

ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA GAAAAGTGGA 3 060 

ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA GATAAATTTA 312 0 

GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT TTTTATGTAG 3180 

CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC TTCATAATTC 3240 

TAAGACTAAT GCTCAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA TGGTAGCCAG 3300 

TTGAATTTAT GGAATCTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG TTTTTCTGTA 3 3 60 

TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT AGAAAATTCA 3 420 

CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG GCTGAGGTTG 3 480 
CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 

Seq ID NO: 555 Protein sequence 
Protein Accession #: NP_003174.2 

1 11 21 31 41 51 

! I i 1 I I 

MRQSLLFLTS WPFVLAPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKRDL 60 

QTSTHVETLL TFSALKRHFK LYLTSSTERF SQNFKWWD GKNESEYTAK WQDFFTGHW 120 

GEPDSRVLAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE DIKNVSRLQS 180 

PKVCGYLKVD NEELLPKGLV DREPPEELVH RVKRRADPDP MKNTCKLLW ADHRFYRYMG 240 

RGEESTTTNY IiIELIDRVDD IYRNTSWDNA GFKGYGIQIE QIRILKSPQE VKPGEKHYNM 3 00 

AKSYPNEEKD AWDVKMLLEQ FSFDIAEEAS KVCLAHLFTY QDFDMGTLGL AYVGSPRANS 3 60 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN FGAEHDPDGL 42 0 

AECAPNEDQG GKYVMYP I AV SGDHENNKMF SNCSKQSIYK TIESKAQECF QERSNKVCGN 480 

SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA 540 

TCKGVSYCTG NSSECPPPGN AENDTVCLDL GKCKDGKCIP FCEREQQLES CACNETDN S C 600 

KVCCRDLSGR CVPYVDAEQK NLFLRKGKPC T VGF CDMNGK CEKRVQDVIE RFWDFIDQLS 660 

INTFGKFLAD NIVGSVLVFS LIFWIPFSIL VHCVDKKLDK QYESLSLFHP SNVEMLSSMD 72 0 

SASVRIIKPF PAPQTPGRLQ PAPVIPSAPA APKLDHQRMD TIQEDPSTDS HMDEDGFEKD 780 
PFPNSSTAAK SFEDLTDHPV ARSEKAASFK LQRQNRVNSK ETEC 

Seq ID NO: 556 DNA sequence 

Nucleic Acid Accession #: NM_021832.1 

Coding sequence: 164.. 2248 

1 11 21 31 41 51 

I ! I i I I 

TCGAGCCTGG CGGTAGAATC TTCCCAGTAG GCGGCGCGGG AGGAAAAGAG GATTGAGGGG 60 

CTAGGCCGGG CGGATCCCGT CCTCCCCCGA TGTGAGCAGT TTTCCGAAAC CCCGTCAGGC 120 

GAAGGCTGCC CAGAGAGGTG GAGTCGGTAG CGGGGCCGGG AACATGAGGC AGTCTCTCCT 180 

ATTCCTGACC AGCGTGGTTC CTTTCGTGCT GGCGCCGCGA CCTCCGGATG ACCCGGGCTT 240 

CGGCCCCCAC CAGAGACTCG AGAAGCTTGA TTCTTTGCTC TCAGACTACG ATATTCTCTC 300 

TTTATCTAAT ATCCAGCAGC ATTCGGTAAG AAAAAGAGAT CTACAGACTT CAACACATGT 3 60 

AGAAACACTA CTAACTTTTT CAGCTTTGAA AAGGCATTTT AAATTATACC TGACATCAAG 420 

TACTGAACGT TTTTCACAAA ATTTCAAGGT CGTGGTGGTG GATGGTAAAA ACGAAAGCGA 480 

GTACACTGTA AAATGGCAGG ACTTCTTCAC TGGACACGTG GTTGGTGAGC CTGACTCTAG 540 

GGTTCTAGCC CACATAAGAG ATGATGATGT TATAATCAGA ATCAACACAG ATGGGGCCGA 600 

ATATAACATA GAGCCACTTT GGAGATTTGT TAATGATACC AAAGACAAAA GAATGTTAGT 660 

TTATAAATCT GAAGATATCA AG AATGTTT C ACGTTTGCAG TCTCCAAAAG TGTGTGGTTA 720 

TTTAAAAGTG GATAATGAAG AGTTGCTCCC AAAAGGGTTA GTAGACAGAG AACCACCTGA 780 

AGAGCTTGTT CAT CGAGTGA AAAGAAGAGC TGACCCAGAT CCCATGAAGA ACACGTGTAA 840 

ATTATTGGTG GTAGCAGATC ATCGCTTCTA CAGATACATG GGCAGAGGGG AAGAGAGTAC 900 

AACTACAAAT TACTTAATAG AG CTAATTG A CAGAGTTGAT GACATCTATC GGAACACTTC 960 

ATGGGATAAT GCAGGTTTTA AAGGCTATGG AATACAGATA GAGCAGATTC GCATTCTCAA 1020 

GTCTCCACAA GAGGTAAAAC CTGGTGAAAA GCACTACAAC ATGGCAAAAA GTTACCCAAA 1080 

TGAAGAAAAG GATGCTTGGG ATGTGAAGAT GTTGCTAGAG CAATTTAGCT TTGATATAGC 1140 

TGAGGAAGCA TCTAAAGTTT GCTTGGCACA CCTTTTCACA TACCAAGATT T TGATATGGG 1200 

AACTCTTGGA TTAGCTTATG TTGGCTCTCC CAG AG CAAAC AGCCATGGAG GTGTTTGTCC 1260 

AAAGGCTTAT TATAGCCCAG TTGGGAAGAA AAATATCTAT TTGAATAGTG GTTTGACGAG 1320 

CACAAAGAAT TATGGTAAAA CCATCCTTAC AAAGGAAGCT GACCTGGTTA CAACTCATGA 13 80 

ATTGGGACAT AATTTTGGAG CAGAACATGA TCCGGATGGT CTAGCAGAAT GTGCCCCGAA 1440 
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TGAGGACCAG GGAGGGAAAT ATGTCATGTA TCCCATAGCT GTGAGTGGCG ATCACGAGAA 1500 

CAATAAGATG TTTTCAAACT GCAGTAAACA ATCAATCTAT AAGACCATTG AAAGTAAGGC 1560 

CCAGGAGTGT TTTCAAGAAC GCAG CAATAA AGTTTGTGGG AACTCGAGGG TGGATGAAGG 1620 

AGAAGAGTGT GATCCTGGCA TCATGTATCT GAACAACGAC ACCTGCTGCA ACAGCGACTG 1680 

CACGTTGAAG GAAGGTGTCC AGTGCAGTGA CAGGAACAGT CCTTGCTGTA AAAACTGTCA 1740 

GTTTGAGACT GCCCAGAAGA AGTGCCAGGA GGCGATTAAT GCTACTTGCA AAGGCGTGTC 180 0 

CTACTGCACA GGTAATAGCA GTGAGTGCCC GCCTCCAGGA AATGCTGAAG ATGACACTGT 18 60 

TTGCTTGGAT CTTGGCAAGT GTAAGGATGG GAAATGCATC CCTTTCTGCG AGAGGGAACA 192 0 

GCAGCTGGAG TCCTGTGCAT GTAATGAAAC TGACAACTCC TGCAAGGTGT GCTGCAGGGA 1980 

CCTTTCCGGC CGCTGTGTGC CCTATGTCGA TGCTGAACAA AAGAACTTAT TTTTGAGGAA 2 040 

AGGAAAGCCC TGTACAGTAG GATTTTGTGA CATGAATGGC AAATGTGAGA AACGAGTACA 2100 

GGATGTAATT GAACGATTTT GGGATTTCAT TGACCAGCTG AGCATCAATA CTTTTGGAAA 2160 

GTTTTTAGCA GACAACATCG TTGGGTCTGT CCTGGTTTTC TCCTTGATAT TTTGGATTCC 2220 

TTTCAGCATT CTTGTCCATT GTGTGTAACG TCGAAATGCT GAGCAGCATG GATTCTGCAT 22 80 

CGGTTCGCAT TATCAAACCC TTTCCTGCGC CCCAGACTCC AGGCCGCCTG CAGCCTGCCC 2340 

CTGTGATCCC TTCGGCGCCA GCAGCTCCAA AACTGGACCA CCAGAGAATG GACACCATCC 2400 

AGGAAGACCC CAGCACAGAC TCACATATGG ACGAGGATGG GTTTGAGAAG GACCCCTTCC 2460 

CAAATAGCAG CACAGCTGCC AAGT CATTTG AGGATCTCAC GGACCATCCG GTCACCAGAA 2520 

GTGAAAAGGC TGCCTCCTTT AAACTGCAGC GTCAGAATCG TGTTGACAGC AAAGAAACAG 25 80 

AGTGCTAATT TAGTTCTCAG CTCTTCTGAC TTAAGTGTGC AAAATATTTT TATAGATTTG 2 640 

ACCTACAATC AATCACAGCT TATATTTTGT GAAGACTGGG AAGTG ACT T A GCAGATGCTG 2700 

GTCATGTGTT TGAACTTCCT GCAGGTAAAC AGTTCTTGTG TGGTTTGGCC CTTCTCCTTT 2 760 

TGAAAAGGTA AGGTGAAGGT GAATCTAGCT TATTTTGAGG CTTTCAGGTT TTAGTTTTTA 2 820 

AAATATCTTT TGACCTGTGG TGCAAAAGCA GAAAATACAG CTGGATTGGG TTATGAGTAT 2 880 

TTACGTTTTT GTAAATTAAT CTTTTATATT GATAACAGGC ACTGACTAGG GAAATGATCA 2 940 

GTTTTTTTTT ATACACTGTA ATGAACCGCT GAATATGAAG CATTTGGCAT TTATTTGTGA 3 000 

GAAAAGTGGA ATAGTTTTTT TTTTTTTTTT TTTTTTTTGC CTTCAACTAA AAACAAAGGA 3060 

GATAAATTTA GTATACATTG TATCTAAATT GTGGGTCTAT TTCTAGTTAT TACCCAGAGT 312 0 

TTTTATGTAG CAGGGAAAAT ATATATCTAA ATTTAGAAAT CATTTGGGTT AATATGGCTC 3180 

TTCATAATTC TAAGACTAAT GCT CAGAACC TAACCACTAC CTTACAGTGA GGGCTATACA 3 2 40 

TGGTAG CCAG TTGAATTTAT GGAAT CTACC AACTGTTTAG GGCCCTGATT TGCTGGGCAG 3300 

TTTTTCTGTA TTTTATAAGT ATCTTCATGT ATCCCTGTTA CTGATAGGGA TACATGTCTT 33 60 

AGAAAATTCA CTATTGGCTG GGAGTGGTGG CTCATGCCTG TAATCCCAGC ACTTGGAGAG 3420 
3421 GCTGAGGTTG CGCCACTACA CTCCAGCCTG GGTGACAGAG TGAGATCTGC CTC 

Seq ID NO: 557 Protein sequence 
Protein Accession #: NP_068604.1 

1 11 21 31 41 51 

I I I I I I 

MRQSLLFLTS WPFVLAPRP PDDPGFGPHQ RLEKLDSLLS DYDILSLSNI QQHSVRKRDL 60 

QTSTHVETLL TFSALKRHFK LYLTSSTERF SQNFKVWVD GKNESEYTVK WQDFFTGHW 120 

GEPDSRVLAH IRDDDVIIRI NTDGAEYNIE PLWRFVNDTK DKRMLVYKSE DIKNVSRLQS 180 

PKVCGYLKVD NEELLPKGLV DREPPEELVH RVKRRADPDP MKNTCKLLW ADHRFYRYMG 240 

RGEESTTTNY LIELIDRVDD I YRNTSWDNA GFKGYGIQIE QIRILKSPQE VKPGEKHYNM 3 00 

AKSYPNEEKD AWDVKMLLEQ FSFDIAEEAS KVCLAHLFTY QDFDMGTLGL AYVGSPRANS 3 60 

HGGVCPKAYY SPVGKKNIYL NSGLTSTKNY GKTILTKEAD LVTTHELGHN FGAEHDPDGL 420 

AECAPNEDQG GKYVMYP I AV SGDHENNKMF SNCSKQSIYK TIESKAQECF QERSNKVCGN 480 

SRVDEGEECD PGIMYLNNDT CCNSDCTLKE GVQCSDRNSP CCKNCQFETA QKKCQEAINA 540 

TCKGVSYCTG NSSECPPPGN AEDDTVCLDL GKCKDGKCIP FCEREQQLES CACNETDNSC * 60 0 

KVCCRDLSGR CVPYVDAEQK NIiFLRKGKPC TVGFCDMNGK CEKRVQDVIE RFWDFIDQLS 660 
INTFGKFLAD NIVGSVLVFS LIFWIPFSIL VHCV 

Seq ID NO: 558 DNA sequence 

Nucleic Acid Accession #: NM_004994.1 

Coding sequence: 20.. 2143 

1 11 21 31 41 51 

I I i I I i 

AGACACCTCT GCCCTCACCA TGAGCCTCTG GCAGCCCCTG GTCCTGGTGC TCCTGGTGCT 60 

GGGCTGCTGC TTTGCTGCCC CCAGACAGCG CCAGTCCACC CTTGTGCTCT TCCCTGGAGA 120 

CCTGAGAACC AATCTCACCG ACAGGCAGCT GGCAGAGGAA TACCTGTACC GCTATGGTTA 180 

CACTCGGGTG GCAGAGATGC GTGGAGAGTC GAAATCTCTG GGGCCTGCGC TGCTGCTTCT 240 

CCAGAAGCAA CTGTCCCTGC CCGAGACCGG TGAGCTGGAT AGCGCCACGC TGAAGGCCAT 3 00 

GCGAACCCCA CGGTGCGGGG TCCCAGACCT GGGCAGATTC CAAACCTTTG AGGGCGACCT 360 

CAAGTGGCAC CACCACAACA TCACCTATTG GATCCAAAAC TACT CGGAAG ACTTGCCGCG 420 

GGCGGTGATT GACGACGCCT TTGCCCGCGC CTTCGCACTG TGGAGCGCGG TGACGCCGCT 480 

CACCTTCACT CGCGTGTACA GCCGGGACGC AGACATCGTC ATCCAGTTTG GTGTCGCGGA 540 

GCACGGAGAC GGGTATCCCT TCGACGGGAA GGACGGGCTC CTGGCACACG CCTTTCCTCC 600 

TGGCCCCGGC ATTCAGGGAG ACGCCCATTT CGACGATGAC GAGTTGTGGT CCCTGGGCAA 660 

GGGCGT CGTG GTTCCAACTC GGTTTGGAAA CGCAGATGGC GCGGCCTGCC ACTTCCCCTT 720 

CATCTTCGAG GGCCGCTCCT ACTCTGCCTG CACCACCGAC GGTCGCTCCG ACGGCTTGCC 780 

CTGGTGCAGT ACCACGGCCA ACTACGACAC CGACGACCGG TTTGGCTTCT GCCCCAGCGA 840 

GAGACTCTAC ACCCGGGACG GCAATGCTGA TGGGAAACCC TGCCAGTTTC CATTCATCTT 900 

CCAAGGCCAA TCCTACTCCG CCTGCACCAC GGACGGT CGC TCCGACGGCT ACCGCTGGTG 960 

CGCCACCACC GCCAACTACG ACCGGGACAA GCTCTTCGGC TTCTGCCCGA CCCGAGCTGA 1020 

CTCGACGGTG ATGGGGGGCA ACTCGGCGGG GGAGCTGTGC GTCTTCCCCT TCACTTTCCT 1080 

GGGTAAGGAG TACTCGACCT GTACCAGCGA GGGCCGCGGA GATGGGCGCC TCTGGTGCGC 1140 

TACCACCTCG AACTTTGACA GCGACAAGAA GTGGGGCTTC TGCCCGGACC AAGGATACAG 1200 

TTTGTTCCTC GTGGCGGCGC ATGAGTTCGG CCACGCGCTG GGCTTAGATC ATTCCTCAGT 1260 

GCCGGAGGCG CTCATGTACC CTATGTACCG CTTCACTGAG GGGCCCCCCT TGCATAAGGA 1320 

CGACGTGAAT GGCATCCGGC ACCTCTATGG TCCTCGCCCT GAACCTGAGC CACGGCCTCC 1380 

AACCACCACC ACACCGCAGC CCACGGCTCC CCCGACGGTC TGCCCCACCG GACCCCCCAC 1440 

TGTCCACCCC TCAGAGCGCC CCACAGCTGG CCCCACAGGT CCCCCCTCAG CTGGCCCCAC 1500 

AGGTCCCCCC ACTGCTGGCC CTTCTACGGC CACTACTGTG CCTTTGAGTC CGGTGGACGA 1560 

TGCCTGCAAC GTGAACATCT TCGACGCCAT CGCGGAGATT GGGAACCAGC TGTATTTGTT 1620 

CAAGGATGGG AAGTACTGGC GATTCTCTGA GGGCAGGGGG AGCCGGCCGC AGGGCCCCTT 1680 
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CCTTATCGCC GACAAGTGGC CCGCGCTGCC CCGCAAGCTG GACTCGGTCT TTGAGGAGCC 1740 

GCTCTCCAAG AAGCTTTTCT TCTTCTCTGG GCGCCAGGTG TGGGTGTACA CAGGCGCGTC 180 0 

GGTGCTGGGC CCGAGGCGTC TGGACAAGCT GGGCCTGGGA GCCGACGTGG CCCAGGTGAC 1860 

CGGGGCCCTC CGGAGTGGCA GGGGGAAGAT GCTGCTGTTC AGCGGGCGGC GCCTCTGGAG 192 0 

GTTCGACGTG AAGGCGCAGA TGGTGGATCC CCGGAGCGCC AGCGAGGTGG ACCGGATGTT 1980 

CCCCGGGGTG CCTTTGGACA CGCACGACGT CTTCCAGTAC CGAGAGAAAG CCTATTTCTG 2040 

CCAGGACCGC TTCTACTGGC GCGTGAGTTC CCGGAGTGAG TTGAACCAGG TGGACCAAGT 2100 

GGGCTACGTG ACCTATGACA TCCTGCAGTG CCCTGAGGAC TAGGGCTCCC GTCCTGCTTT 2160 

GCAGTGCCAT GTAAATCCCC ACTGGGACCA ACCCTGGGGA AGGAGCCAGT TTGCCGGATA 2220 

CAAACTGGTA TTCTGTTCTG GAGGAAAGGG AGGAGTGGAG GTGGGCTGGG CCCTCTCTTC 2280 
TCACCTTTGT TTTTTGTTGG AGTGTTTCTA ATAAACTTGG ATTCTCTAAC CTTT 

Seq ID NO: 559 Protein sequence 
Protein Accession #: NP_004985.1 

1 11 21 31 41 51 

I I I I I I 

MSLWQPLVLV LLVLGCCFAA PRQRQSTLVL FPGDLRTNLT DRQLAEEYLY RYGYTRVAEM 60 

RGESKSLGPA LLLLQKQLSL PETGELDSAT LKAMRTPRCG VPDLGRFQTF EGDLKWHHHN 120 

ITYWIQNYSE DLPRAVIDDA FARAFALWSA VTPLTFTRVY SRDADIVIQF GVAEHGDGYP 180 

FDGKDGLLAH AFPPGPGIQG D AH FDDDELW SLGKGVWPT RFGNADGAAC HFPFIFEGRS 2 40 

YSACTTDGRS DGLPWCSTTA NYDTDDRFGF CPSERLYTRD GNADGKPCQF PFIFQGQSYS 3 00 

ACTTDGRSDG YRWCATTANY DRDKDFGFCP TRAD S TVMGG NSAGELCVFP FTFLGKEYST 3 60 

CTSEGRGDGR LWCATTSNFD SDKKWGFCPD QGYSLFLVAA HEFGHALGLD HSSVPEALMY 42 0 

PMYRFTEGPP LHKDDVNG1R HLYGPRPEPE PRPPTTTTPQ PTAPPTVCPT GPPTVHPSER 480 

PTAGPTGPPS AGPTGPPTAG PSTATTVPLS PVDDACNVNI FDAIAEIGNQ LYLFKDGKYW 540 

RFSEGRGSRP QGPFLIADKW PALPRKLDSV FEEPLSKKLF FFSGRQVWVY TGASVLGPRR 600 

LDKLGLGADV AQVTGALRSG RGKMLLFSGR RLWRFDVKAQ MVDPRSASEV DRMFPGVPLD 660 
THDVFQYREK AYFCQDRFYW RVSSRSELNQ VDQVGYVTYD ILQCPED 

Seq ID NO: 560 DNA sequence 

Nucleic Acid Accession #: NM_000213.1 

Coding sequence: 127.. 5385 

1 11 21 31 41 51 

111)11 

CGCCCGCGCG CTGCAGCCCC ATCTCCTAGC GGCAGCCCAG GCGCGGAGGG AGCGAGTCCG 60 

CCCCGAGGTA GGTCCAGGAC GGGCGCACAG CAGCAGCCGA GGCTGGCCGG GAGAGGGAGG 12 0 

AAGAGGATGG CAGGGCCACG CCCCAGCCCA TGGGCCAGGC TGCTCCTGGC AGCCTTGATC 180 

AGCGTCAGCC TCTCTGGGAC CTTGGCAAAC CGCTGCAAGA AGGCCCCAGT GAAGAGCTGC 240 

ACGGAGTGTG TCCGTGTGGA TAAGGACTGC GCCTACTGCA CAGACGAGAT GTT CAGGGAC 300 

CGGCGCTGCA ACACCCAGGC GGAGCTGCTG GCCGCGGGCT GCCAGCGGGA GAGCATCGTG 360 

GT CATGG AG A GCAGCTTCCA AATCACAGAG GAGACCCAGA TTGACACCAC CCTGCGGCGC 420 

AGCCAGATGT CCCCCCAAGG CCTGCGGGTC CGTCTGCGGC CCGGTGAGGA GCGGCATTTT 480 

GAGCTGGAGG TGTTTGAGCC ACTGGAGAGC CCCGTGGACC TGTACATCCT CATGGACTTC 540 

TCCAACTCCA TGTCCGATGA TCTGGACAAC CT CAAGAAG A TGGGGCAGAA CCTGGCTCGG 600 

GTCCTGAGCC AGCTCACCAG CGACTACACT ATTGGATTTG GCAAGTTTGT GGACAAAGTC 660 

AGCGTCCCGC AGACGGACAT GAGGCCTGAG AAGCTGAAGG AGCCCTGGCC CAACAGTGAC 720 

CCCCCCTTCT CCTTCAAGAA CGTCATCAGC CTGACAGAAG ATGTGGATGA GTTCCGGAAT 780 

AAACTGCAGG GAGAGCGGAT CTCAGGCAAC CTGGATGCTC CTGAGGGCGG CTTCGATGCC 840 

ATCCTGCAGA CAGCTGTGTG CACGAGGGAC ATTGGCTGGC GCCCGGACAG CACCCACCTG 900 

CTGGTCTTCT CCACCGAGTC AGCCTTCCAC TATGAGGCTG ATGGCGCCAA CGTGCTGGCT 960 

GGCAT CATGA GCCGCAACGA TGAACGGTGC CACCTGGACA CCACGGGCAC CTACACCCAG 102 0 

TACAGGACAC AGGACTACCC GTCGGTGCCC ACCCTGGTGC GCCTGCTCGC CAAGCACAAC 1080 

ATCATCCCCA TCTTTGCTGT CACCAACTAC TCCTATAGCT ACTACGAGAA GCTTCACACC 1140 

T AT TTCCCTG TCTCCTCACT GGGGGTGCTG CAGGAGGACT CGTCCAACAT CGTGGAGCTG 12 00 

CTGGAGGAGG CCTTCAATCG GATCCGCTCC AACCTGGACA TCCGGGCCCT AGACAGCCCC 12 60 

CGAGGCCTTC GGACAGAGGT CACCTCCAAG ATGTTCCAGA AGACGAGGAC TGGGTCCTTT 132 0 

CACATCCGGC GGGGGGAAGT GGGTATATAC CAGGTGCAGC TGCGGGCCCT TGAGCACGTG 13 80 

GATGGGACGC ACGTGTGCCA GCTGCCGGAG G AC CAGAAGG GCAACATCCA TCTGAAACCT 1440 

TCCTTCTCCG ACGGCCTCAA GATGGACGCG GGCATCATCT GTGATGTGTG CACCTGCGAG 150 0 

CTGCAAAAAG AGGTGCGGTC AGCTCGCTGC AGCTTCAACG GAGACTTCGT GTGCGGACAG 1560 

TGTGTGTGCA GCGAGGGCTG GAGTGGCCAG ACCTGCAACT GCTCCACCGG CTCTCTGAGT 162 0 

GACATTCAGC CCTGCCTGCG GGAGGGCGAG GACAAGCCGT GCTCCGGCCG TGGGGAGTGC 1680 

CAGTGCGGGC ACTGTGTGTG CTACGGCGAA GGCCGCTACG AGGGTCAGTT CTGCGAGTAT 1740 

GACAACTTCC AGTGTCCCCG CACTTCCGGG TTCCTCTGCA ATGACCGAGG ACGCTGCTCC 180 0 

ATGGGCCAGT GTGTGTGTGA GCCTGGTTGG ACAGGCCCAA GCTGTGACTG TCCCCTCAGC 1860 

AATGCCACCT GCATCGACAG CAATGGGGGC ATCTGTAATG GACGTGGCCA CTGTGAGTGT 192 0 

GGCCGCTGCC ACTGCCACCA GCAGTCGCTC TACACGGACA CCATCTGCGA GAT CAACTAC 1980 

TCGGCGATCC ACCCGGGCCT CTG CGAGGAC CTACGCTCCT GCGTGCAGTG CCAGGCGTGG 2 040 

GGCACCGGCG AGAAGAAGGG • GCGCACGTGT GAGGAATGCA ACTTCAAGGT CAAGATGGTG 2100 

GACGAGCTTA AGAGAGCCGA GGAGGTGGTG GTGCGCTGCT CCTTCCGGGA CGAGGATGAC 2160 

GACTGCACCT ACAGCTACAC CATGGAAGGT GACGGCGCCC CTGGGCCCAA CAGCACTGTC 222 0 

CTGGTGCACA AGAAGAAGGA CTGCCCTCCG GGCTCCTTCT GGTGGCTCAT CCCCCTGCTC 2280 

CTCCTCCTCC TGCCGCTCCT GGCCCTGCTA CTGCTGCTAT GCTGGAAGTA CTGTGCCTGC 2340 

TGCAAGGCCT GCCTGGCACT TCTCCCGTGC TGCAACCGAG GTCACATGGT GGGCTTTAAG 2400 

GAAGACCACT ACATGCTGCG GGAGAACCTG ATGGCCTCTG ACCACTTGGA CACGCCCATG 2460 

CTGCGCAGCG GGAACCTCAA GGGCCGTGAC GTGGTCCGCT GGAAGGTCAC CAACAACATG 252 0 

CAGCGGCCTG GCTTTGCCAC TCATGCCGCC AGCATCAACC CCACAGAGCT GGTGCCCTAC 2580 

GGGCTGTCCT TGCGCCTGGC CCGCCTTTGC ACCGAGAACC TGCTGAAGCC TGACACTCGG 2 640 

GAGTGCGCCC AGCTGCGCCA GGAGGTGGAG GAGAACCTGA ACGAGGTCTA CAGGCAGATC 2700 

TCCGGTGTAC ACAAGCTCCA GCAGACCAAG TTCCGGCAGC AGCCCAATGC CGGGAAAAAG 2760 

CAAGACCACA CCATTGTGGA CACAGTGCTG ATGGCGCCCC GCTCGGCCAA GCCGGCCCTG 282 0 

CTGAAGCTTA CAGAGAAGCA GGTGGAACAG AGGGCCTTCC ACGACCTCAA GGTGGCCCCC 2880 

GGCTACTACA CCCTCACTGC AGACGAGGAC GCCCGGGGCA TGGTGGAGTT CCAGGAGGGC 2 940 

GTGGAGCTGG TGGACGTACG GGTGCCCCTC TTTATCCGGC CTGAGGATGA CGACGAGAAG 3000 

CAGCTGCTGG TGGAGGCCAT CGACGTGCCC GCAGGCACTG CCACCCTCGG CCGCCGCCTG 3 0 60 
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GTAAACATCA CCATCATCAA GGAGCAAGCC AGAGACGTGG TGTCCTTTGA GCAGCCTGAG 3120 

TTCTCGGTCA GCCGCGGGGA CCAGGTGGCC CGCATCCCTG TCATCCGGCG TGTCCTGGAC 3180 

GGCGGGAAGT CCCAGGTCTC CTACCGCACA CAGGATGGCA CCGCGCAGGG CAACCGGGAC 3240 

TACATCCCCG TGGAGGGTGA GCTGCTGTTC CAGCCTGGGG AGGCCTGGAA AGAGCTGCAG 3300 

GTGAAGCTCC TGGAGCTGCA AGAAGTTGAC TCCCTCCTGC GGGGCCGCCA GGTCCGCCGT 3360 

TTCCACGTCC AGCTCAGCAA CCCTAAGTTT GGGGCCCACC TGGGCCAGCC CCACTCCACC 3420 

ACCATCATCA TCAGGGACCC AGATGAACTG GACCGGAGCT TCACGAGTCA GATGTTGTCA 3480 

TCACAGCCAC CCCCTCACGG CGACCTGGGC GCCCCGCAGA ACCCCAATGC TAAGGCCGCT 3540 

GGGTCCAGGA AGATCCATTT CAACTGGCTG CCCCCTTCTG GCAAGCCAAT GGGGTACAGG 3600 

GTAAAGTACT GGATTCAGGG TGACTCCGAA TCCGAAGCCC ACCTGCTCGA CAGCAAGGTG 3 660 

CCCTCAGTGG AGCT CACCAA CCTGTACCCG TATTGCGACT ATGAGATGAA GGTGTGCGCC 3720 

TACGGGGCTC AGGGCGAGGG ACCCTACAGC TCCCTGGTGT CCTGCCGCAC CCACCAGGAA 3780 

GTGCCCAGCG AGCCAGGGCG TCTGGCCTTC AATGT CGTCT CCTCCACGGT GACCCAGCTG 3840 

AGCTGGGCTG AGCCGGCTGA GACCAACGGT GAGATCACAG CCTACGAGGT CTGCTATGGC 3900 

CTGGTCAACG ATGACAACCG ACCTATTGGG CCCATGAAGA AAGTGCTGGT TGACAACCCT 3960 

AAGAACCGGA TGCTGCTTAT TGAGAACCTT CGGGAGTCCC AGCCCTACCG CTACACGGTG 402 0 

AAGGCGCGCA ACGGGG CCGG CTGGGGGCCT GAGCGGGAGG CCATCATCAA CCTGGCCACC 4080 

CAGCCCAAGA GGCCCATGTC CATCCCCATC ATCCCTGACA TCCCTATCGT GGACGCCCAG 4140 

AGCGGGGAGG ACTACGACAG CTTCCTTATG TACAGCGATG ACGTTCTACG CTCTCCATCG 420 0 

GGCAGCCAGA GGCCCAGCGT CTCCGATGAC ACTGAGCACC TGGTGAATGG CCGGATGGAC 4260 

TTTGCCTTCC CGGGCAGCAC CAACTCCCTG CACAGGATGA CCACGACCAG TGCTGCTGCC 4320 

TATGGCACCC ACCTGAGCCC ACACGTGCCC CACCGCGTGC TAAG CACATC CTCCACCCTC 4380 

ACACGGGACT ACAACTCACT GACCCGCTCA GAACACTCAC ACTCGACCAC ACTGCCGAGG 4440 

GACTACTCCA CCCTCACCTC CGTCTCCTCC CACGACTCTC GCCTGACTGC TGGTGTGCCC 45 00 

GACACGCCCA CCCGCCTGGT GTTCTCTGCC CTGGGGCCCA CATCTCTCAG AGTGAGCTGG 4560 

CAGGAGCCGC GGTGCGAGCG GCCGCTGCAG GGCTACAGTG TGGAGTACCA GCTGCTGAAC 462 0 

GGCGGTGAGC TGCATCGGCT CAACATCCCC AACCCTGCCC AGACCTCGGT GGTGGTGGAA 4680 

GACCTCCTGC CCAACCACTC CTACGTGTTC CGCGTGCGGG CCCAGAGCCA GGAAGGCTGG 4740 

GGCCGAGAGC GTGAGGGTGT CATCACCATT GAATCCCAGG TGCACCCGCA GAGCCCACTG 4800 

TGTCCCCTGC CAGGCTCCGC CTTCACTTTG AGCACTCCCA GTGCCCCAGG CCCGCTGGTG 4860 

TTCACTGCCC TGAGCCCAGA CTCGCTGCAG CTGAGCTGGG AGCGGCCACG GAGGCCCAAT 492 0 

GGGGATATCG TCGGCTACCT GGTGACCTGT GAGATGGCCC AAGGAGGAGG GCCAGCCACC 4980 

GCATTCCGGG TGGATGGAGA CAGCCCCGAG AGCCGGCTGA CCGTGCCGGG CCTCAGCGAG 5040 

AACGTGCCCT ACAAGTT CAA GGTGCAGGCC AGGACCACTG AGGGCTTCGG GCCAGAGCGC 5100 

GAGGGCATCA TCACCATAGA GTCCCAGGAT GGAGGACCCT TCCCGCAGCT GGGCAGCCGT 5160 

GCCGGGCTCT TCCAGCACCC GCTGCAAAGC GAGTACAGCA GCATCACCAC CACCCACACC 5220 

AGCGCCACCG AGCCCTTCCT AGTGGATGGG CCGACCCTGG GGGCCCAGCA CCTGGAGGCA 5280 

GGCGGCTCCC TCACCCGGCA TGTGACCCAG GAGTTTGTGA GCCGGACACT GACCACCAGC 5340 

GGAACCCTTA GCACCCACAT GGACCAACAG TTCTTCCAAA CTTGACCGCA CCCTGCCCCA 5400 

CCCCCGCCAT GTCCCACTAG GCGTCCTCCC GACTCCTCTC CCGGAGCCTC CTCAGCTACT 5460 

CCATCCTTGC ACCCCTGGGG GCCCAGCCCA CCCGCATGCA CAGAGCAGGG GCTAGGTGTC 552 0 

TCCTGGGAGG CATGAAGGGG GCAAGGTCCG TCCTCTGTGG GCCCAAACCT ATTTGTAACC 5580 

AAAGAGCTGG GAGCAGCACA AGGACCCAGC CTTTGTTCTG CACTTAATAA ATGGTTTTGC 5640 
TACTG 

Seq ID NO: 561 Protein sequence 
Protein Accession #: NP_000204.1 

1 11 21 31 41 51 

i I I I' I I 

MAGPRPSPWA RLLLAALISV SLSGTLANRC KKAPVKSCTE CVRVDKDCAY CTDEMFRDRR 60 
CNTQAELLAA GCQRESIWM ESSFQITEET QIDTTLRRSQ MSPQGLRVRL RPGEERHFEIi 120 
EVFEPLESPV DLYILMDFSN SMSDDLDNLK KMGQNLARVL SQLTSDYTIG FGKFVDKVSV 180' 
PQTDMRPEKL KEPWPNSDPP FSFKNVISLT EDVDEFRNKL QGERISGNLD APEGGFDAIL 240 
QTAVCTRDIG WRPDSTHLLV FSTESAFHYE ADGANVLAGI MSRNDERCHL DTTGTYTQYR 3 00 
TQDYPSVPTL VRLLAKHNI I PIFAVTNYSY SYYEKLHTYF PVSSLGVLQE DSSNIVELLE 3 60 
EAFNRIRSNL DIRALDSPRG LRTEVTSKMF QKTRTGSFHI RRGEVGIYQV QLRALEHVDG 420 
THVCQLPEDQ KGNIHLKPSF SDGLKMDAGI ICDVCTCELQ KE VRS ARCS F NGDFVCGQCV 48 0 
CSEGWSGQTC NCSTGSLSDI QPCLREGEDK PCSGRGECQC GHCVCYGEGR YEGQFCEYDN 540 
FQCPRTSGFL CNDRGRCSMG QCVCEPGWTG PSCDCPLSNA TCIDSNGGIC NGRGHCECGR 60 0 
CHCHQQSLYT DTICEINYSA IHPGLCEDLR SCVQCQAWGT GEKKGRTCEE CNFKVKMVDE 660 
LKRAEEVWR CSFRDEDDDC TYSYTMEGDG APGPNSTVLV HKKKDCPPGS FWWLIPLLLL 72 0 
LLPLLALLLL LCWKYCACCK ACLALLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLR 780 
SGNLKGRDW RWKVTNNMQR PGFATHAASI NPTELVPYGL SLRLARLCTE NLLKPDTREC 840 
AQLRQEVEEN LNEVYRQISG VHKLQQTKFR QQPNAGKKQD HTIVDTVLMA PRSAKPALLK 90 0 
LTEKQVEQRA FHDLKVAPGY YTLTADQDAR GMVEFQEGVE LVDVRVPLFI RPEDDDEKQL 960 

LVEAIDVPAG TATLGRRLVN ITIIKEQARD WSFEQPEFS VSRGDQVARI PVIRRVLDGG 102 0 

KSQVSYRTQD GTAQGNRDYI PVEGELLFQP GEAWKELQVK LLELQEVDSL LRGRQVRRFH 10 80 

VQLSNPKFGA HLGQPHSTTI IIRDPDELDR SFTSQMLSSQ PPPHGDLGAP QNPNAKAAGS 1140 

RKIHFNWLPP SGKPMGYRVK YWIQGDSESE AHLLDSKVPS VELTNLYPYC DYEMKVCAYG 12 00 

AQGEGPYSSL VSCRTHQEVP SEPGRLAFNV VSSTVTQLSW AEPAETNGEI TAYEVCYGLV 12 60 

NDDNRPIGPM KKVLVDNPKN RMLLIENLRE SQPYRYTVKA RNGAGWGPER EAIINLATQP 1320 

KRPMSIPIIP DIPIVDAQSG EDYDSFLMYS DDVLRSPSGS QRPSVSDDTE HLVNGRMDFA 13 80 

FPGSTNSLHR MTTTSAAAYG THLSPHVPHR VLSTSSTLTR DYNSLTRSEH SHSTTLPRDY 1440 

STLTSVSSHD SRLTAGVPDT PTRLVFSALG PTSLRVSWQE PRCERPLQGY SVEYQIiLNGG 1500 

ELHRLNIPNP AQTSVWEDL LPNHSYVFRV RAQSQEGWGR EREGVITIES QVHPQSPLCP 15 60 

LPGSAFTLST PSAPGPLVFT ALSPDSLQLS WERPRRPNGD IVGYLVTCEM AQGGGPATAF 1620 

RVDGDSPESR LTVPGLSENV PYKFKVQART TEGFGPEREG IITIESQDGG PFPQLGSRAG 1680 

LFQHPLQSEY SSITTTHTSA TEPFLVDGPT LGAQHLEAGG SLTRHVTQEF VSRTLTTSGT 1740 
LSTHMDQQFF QT 

Seq ID NO: 562 DNA sequence 

Nucleic Acid Accession #: NM_013332.1 

Coding sequence : 1 . . 63 

1 11 21 31 41 51 

1 I I I I i 



399 



WO 02/086443 

GCACGAGGGC GCTTTTGTCT CCGGTGAGTT TTGTGGCGGG AAGCTTCTGC GCTGGTGCTT 
AGTAACCGAC TTTCCTCCGG ACTCCTGCAC GACCTGCTCC TACAGCCGGC GATCCACTCC 
CGGCTGTTCC CCCGGAGGGT CCAGAGGCCT TTCAGAAGGA GAAGGCAGCT CTGTTTCTCT 
GCAGAGGAGT AGGGTCCTTT CAGCCATGAA GCATGTGTTG AACCTCTACC TGTTAGGTGT 
GGTACTGACC CTACTCTCCA TCTTCGTTAG AGTGATGGAG TCCCTAGAAG GCTTACTAGA 
GAGCCCATCG CCTGGGACCT CCTGGACCAC CAGAAGCCAA CTAGCCAACA CAGAGCCCAC 
CAAGGGCCTT CCAGACCATC CATCCAGAAG CATGTGATAA GACCTCCTTC CATACTGGCC 
ATATTTTGGA ACACTGACCT AGACATGTCC AGATGGGAGT CCCATTCCTA GCAGACAAGC 
TGAGCACCGT TGTAACCAGA GAACTATTAC TAGGCCTTGA AGAACCTGTC TAACTGGATG 
CTCATTGCCT GGGCAAGGCC TGTTTAGGCC GGTTGCGGTG GCTCATGCCT GTAATCCTAG 
CACTTTGGGA GGCTGAGGTG GGTGGAT C AC CTGAGGTCAG GAGTTCGAGA CCAGCCTCGC 
CAACATGGCG AAACCCCATC TCTACTAAAA ATACAAAAGT TAGCTGGGTG TGGTGGCAGA 
GGCCTGTAAT CCCAGTTCCT TGGGAGGCTG AGGCGGGAGA ATTGCTTGAA CCCGGGGACG 
GAGGTTGCAG TGAACCGAGA TCGCACTGCT GTACCCAGCC TGGGCCACAG TGCAAGACTC 
CATCTCAAAA AAAAAAAGAA AAGAAAAAGC CTGTTTAATG CACAGGTGTG AGTGGATTGC 
TTATGGCTAT GAGATAGGTT GATCTCGCCC TTACCCCGGG GTCTGGTGTA TGCTGTGCTT 
TCCTCAGCAG TATGGCTCTG ACATCTCTTA GATGTCCCAA CTTCAGCTGT TGGGAGATGG 
TGATATTTTC AACCCTACTT CCTAAACATC TGTCTGGGGT TCCTTTAGTC TTGAATGTCT 
TATGCTCAAT TATTTGGTGT TGAGCCTCTC TTCCACAAGA GCTCCTCCAT GTTTGGATAG 
CAGTTGAAGA GGTTGTGTGG GTGGGCTGTT GGGAGTGAGG ATGGAGTGTT CAGTGCCCAT 
TTCTCATTTT ACATTTTAAA GTCGTTCCTC CAACATAGTG TGTATTGGTC TGAAGGGGGT 
GGTGGGATGC CAAAGCCTGC TCAAGTTATG GACATTGTGG CCACCATGTG GCTTAAATGA 
TTTTTTCTAA CTAATAAAGT GGAATATATA TTTCAAAAAA AAAAAAAAAA AA 

Seq ID NO: 563 Protein sequence 
Protein Accession #: NP_037464.1 

1 11 21 31 41 51 

I I I I I l 

MKHVLNLYLIj GWLTLLSIF VRVMESLEGL LESPSPGTSW TTR SQL ANTE PTKGLPDHPS 
RSM 



Seq ID NO: 564 DNA sequence 

Nucleic Acid Accession #: NM_023 915.1 

Coding sequence: 250.. 1326 



1 11 21 31 41 51 

I I I I I 1 

GGCACGAGGG TTTCGTTTTC ATGCTTTACC AGAAAATCCA CTTCCCTGCC GACCT TAGTT 
TCAAAGCTTA TTCTTAATTA GAGACAAGAA ACCTGTTTCA ACTTGAAGAC ACCGTATGAG 
GTGAATGGAC AGCCAGCCAC CACAATGAAA GAAATCAAAC CAGGAATAAC CTATGCTGAA 
CCCACGCCTC AATCGTCCCC AAGTGTTTCC TGACACGCAT CTTTGCTTAC AGTGCATCAC 
AACTGAAGAA TGGGGTTCAA CTTGACGCTT GCAAAATTAC CAAATAACGA GCTGCACGGC 
CAAGAGAGTC ACAATTCAGG CAACAGGAGC GACGGGCCAG GAAAGAACAC CACCCTTCAC 
AATGAATTTG ACACAATTGT CTTGCCGGTG CTTTATCTCA TTATATTTGT GGCAAGCATC 
TTGCTGAATG GTTTAG CAGT GTGGATCTTC TTCCACATTA GGAATAAAAC CAGCTTCATA 
TTCTATCTCA AAAACATAGT GGTTGCAGAC CT CAT AATGA CGCTGACATT TCCATTTCGA 
ATAGTCCATG ATGCAGGATT TGGACCTTGG TACTTCAAGT TTATTCTCTG CAGATACACT 
TCAGTTTTGT TTTATGCAAA CATGTATACT TCCATCGTGT TCCTTGGGCT GATAAGCATT 
GAT CGCTAT C TGAAGGTGGT CAAGCCATTT GGGGACTCTC GGATGTACAG CATAACCTTC 
ACGAAGGTTT TATCTGTTTG TGTTTGGGTG ATCATGGCTG TTTTGTCTTT GCCAAACATC 
ATCCTGACAA ATGGT CAGCC AACAGAGGAC AATATCCATG ACTGCTCAAA ACTTAAAAGT 
CCTTTGGGGG TCAAATGGCA TACGGCAGTC ACCTATGTGA ACAGCTGCTT GTTTGTGGCC 
GTGCTGGTGA TTCTGATCGG ATGTTACATA GCCATATCCA GGTACATCCA CAAATCCAGC 
AGGCAATTCA TAAGTCAGTC AAGCCGAAAG CGAAAACATA ACCAGAGCAT CAGGGTTGTT 
GTGGCTGTGT TTTTTACCTG CTTTCTACCA TATCACTTGT GCAGAATTCC TTTTACTTTT 
AGTCACTTAG ACAGGCTTTT AGATGAATCT GCACAAAAAA TCCTATATTA CTGCAAAGAA 
ATTACACTTT TCTTGTCTGC GTGTAATGTT TGCCTGGATC CAATAATTTA CTTTTTCATG 
TGTAGGTCAT TTTCAAGAAG GCTGTTCAAA AAATCAAATA TCAGAACCAG GAGTGAAAGC 
ATCAGATCAC TGCAAAGTGT GAGAAGATCG GAAGTTCGCA TATATTATGA TTACACTGAT 
GTGTAGGCCT TTTATTGTTT GTTGGAATCG ATATGTACAA AGTGTAAATA AATGTTTCTT 
TTCATTATCC TTAAAAAAAA AA 



Seq ID NO: 565 Protein sequence 
Protein Accession #: NP_076404 



1 11 21 31 41 51 

I I I I I I 

MGFNLTLAKL PNNELHGQES HNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIFVASILLN 
GLAVWIFFHI RNKTSFIFYL KNIWADLIM TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 
FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 
NGQPTEDNIH DCSKLiKSPLG VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 
ISQSSRKRKH NQSIRVWAV FFTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 
FLSACNVCLD PIIYFFMCRS FSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 



Seq ID NO: 566 DNA sequence 

Nucleic Acid Accession #: NM_005365.1 

Coding sequence: 1..94 8 

1 11 21 31 41 51 

I 1 I I I I 

ATGTCTCTCG AGCAGAGGAG TCCGCACTGC AAGCCTGATG AAGACCTTGA AGCCCAAGGA 
GAGGACTTGG GCCTGATGGG TGCACAGGAA CCCACAGGCG AGGAGGAGGA GACTACCTCC 
TCCTCTGACA GCAAGGAGGA GGAGGTGT CT GCTGCTGGGT CATCAAGTCC TCCCCAGAGT 
CCTCAGGGAG GCGCTTCCTC CTCCATTTCC GTCTACTACA CTTTATGGAG CCAATTCGAT 
GAGGGCTCCA GCAGTCAAGA AGAGGAAGAG CCAAGCTCCT CGGTCGACCC AGCTCAGCTG 
GAGTTCATGT TCCAAGAAGC ACTGAAATTG AAGGTGGCTG AGTTGGTTCA TTTCCTGCTC 



400 



WO 02/086443 

CACAAATATC GAGTCAAGGA GCCGGTCACA AAGGCAGAAA TGCTGGAGAG CGTCATCAAA 420 

AATTACAAGC GCTACTTTCC TGTGATCTTC GGCAAAGCCT CCGAGTTCAT GCAGGTGATC 480 

TTTGGCACTG ATGTGAAGGA GGTGGACCCC GCCGGCCACT CCTACATCCT TGTCACTGCT 540 

CTTGGCCTCT CGTGCGATAG CATGCTGGGT GATGGTCATA GCATGCCCAA GGCCGCCCTC 600 

5 CTGATCATTG TCCTGGGTGT GAT CCTAACC AAAGACAACT GCGCCCCTGA AGAGGTTATC 660 

TGGGAAGCGT TGAGTGTGAT GGGGGTGTAT GTTGGGAAGG AGCACATGTT CTACGGGGAG 72 0 

CCCAGGAAGC TGCTCACCCA AGATTGGGTG CAGGAAAACT ACCTGGAGTA CCGGCAGGTG 780 

CCCGGCAGTG ATCCTGCGCA CTACGAGTTC CTGTGGGGTT CCAAGGCCCA CGCTGAAACC 840 

AGCTATGAGA AGGTCATAAA TTATTTGGTC ATGCTCAATG CAAGAGAGCC CATCTGCTAC 900 
10 CCATCCCTTT ATGAAGAGGT TTTGGGAGAG GAGCAAGAGG GAGTCTGA 



PCT/US02/12476 



Seq ID NO: 567 Protein sequence 
Protein Accession #: NP_005356.1 



15 

20 
25 
30 
35 
40 
45 
50 
55 
60 



I 

MSLEQRSPHC 
PQGGASSSIS 
HKYRVKEPVT 
LGLSCDSMLG 
PRKLLTQDWV 
PSLYEEVLGE 



11 

I 

KPDEDLEAQG 
VYYTLWSQFD 
KAEMLESVIK 
DGHSMPKAAL 
QENYLEYRQV 
EQEGV 



21 
I 

EDLGLMGAQE 
EGSSSQEEEE 
NYKRYFPVIF 
LIIVLGVILT 
PGSDPAHYEF 



31 
I 

PTGEEEETTS 
PSSSVDPAQL 
GKASEFMQVI 
KDNCAPEEVI 
LWGSKAHAET 



41 
1 

SSDSKEEEVS 
EFMFQEALKL 
FGTDVKEVDP 
WEAL SVMGV Y 
SYEKVINYLV 



51 
I 

AAGSSSPPQS 
KVAELVHFLL 
AGHSYILVTA 
VGKEHMFYGE 
MLNAREPICY 



Seq ID NO: 568 DNA sequence 
Nucleic Acid Accession #: NM_014400 
Coding sequence: 86.. 1126 



GGTTACTCAT 
GACGCCAAGG 
GATCTGGACT 
GTGCTACAGC 
GAAGTGCGCG 
CGGACAATTC 
CGGCCTGGAT 
CTGCAACGCC 
ATACCCGCCC 
GGGTACATCG 
CTTCGACGGC 
CTGTGTCCAG 
TGGCTCCTGT 
CCCTCGAATC 
CACATCTGTC 
GCCAGCGCCA 
GGAGCCCAGG 
TCCTGCAAAA 
ATTGGCAGCC 
AAATTTCCCT 
CCCACCACTG 
CTTCTGCTGC 
GGGTGTTCTA 
TCCTCTTGTG 
AGGATGCTAA 
GGTGGGACAA 
ATCGGTTCCC 
CTTATGTCTG 
TTGTATAGTG 



11 

i 

CCTGGGCTCA 
GAGCAGGACG 
GCAGGCTGGC 
TGCGTGCAGA 
CCGGGCGTGG 
TCGCTGGCAG 
CTTCACGGGC 
AAGCTCAACC 
AACGGCGTGG 
CCGCCGGTCG 
AACGT CACCT 
GATGAATTCT 
TGCCAGGGGT 
CCACCCCTTG 
ACCACTTCTA 
ACCAGTCAGA 
TTGACTGGAG 
GGGGGGCCCC 
CTTCTGTTGG 
CTCACCTACT 
GACTGGGCTG 
GCTGGTTTGC 
GCTTTTTGAG 
ATGTTAGGAC 
GCTTCCTACT 
TGGCTCCCCA 
CATATGT CTT 
TGTGTGATCA 
AAAAAAAA 



21 

I 

GGTAAGAGGG 
GAGCCATGGA 
TGCTGCTGCT 
AAGCAGATGA 
ACGTCTGCAC 
TGCSGGGTTG 
TTCTGGCGTT 
TCACCTCGCG 
AGTGCTACAG 
TGAGCTGCTA 
TGACGGCAGC 
GCACTCGGGA 
CCCGCTGTAA 
TCCGGCTGCC 
CCTCGGCCCC 
CTCCGAGACA 
GCGCCGCTGG 
AGCAGCCCCA 
CCGTGGCTGC 
TCTCTGGCCC 
GCCCAGCCCC 
GGCTTTGGGA 
GACAGCTCCT 
AGAGTGAGAG 
CACTTTCTCC 
CTCTAAGCAC 
CCTTACTAGA 
GTTTCTGGCA 



31 

I 

CCCGAGCTCG 
CCCCGCCAGG 
GCTGCTTCGC 
CGGATGCTCC 
CGAGGCCGTG 
CGGTTCGGGA 
CATCCAGCTG 
GGCGCTCGAC 
CTGTGTGGGC 
CAACGCCAGC 
TAATGTGACT 
TGGAGTAACA 
CTCTGACCTC 
CCCTCCAGAG 
AGTGAGACCC 
GGGAGTAGAA 
CCACCAGGAC 
TAATAAAGGC 
TGGTGT CCT A 
TGGGTACCCC 
TGTTTTTCCA 
AATAAAATAC 
GTATCCTTCT 
AAGTCAGCTG 
TAGCCAGCCT 
TGCCTCCCCT 
CTGTGAGCTC 
CATAAATGCC 



41 
I 

GAGGCGGCAC 
AAAGCAGGTG 
GGAGGAGCGC 
CCGAACAAGA 
GGGGCGGTGG 
CTCCCCGGCA 
CAGCAATGCG 
CCGGCAGGTA 
CTGAGCCGGG 
GATCATGTCT 
GTGTCCTTGC 
GGCCCAGGGT 
CGCAACAAGA 
CCCACGACTG 
ACATCCACCA 
CACGAGGCCT 
CGCAGCAATT 
TGTGTGGCTC 
CTGTGAGCTT 
TCTTCTCATC 
ACATTCCCCA 
CGTTGTATAT 
CATCCTTGTC 
TCACGGGGAA 
GGACTTTGGA 
ACTCCCCGCA 
CTCGAGGGCA 
TCAATAAAGA 



51 
i 

ACCCAGGGGG 
CCCAGGCCAT 
AGGCCCTGGA 
TGAAGACAGT 
AGACCATCCA 
AGAATGACCG 
CTCAGGATCG 
ATGAGAGTGC 
AGGCGTGCCA 
ACAAGGGCTG 
CTGTCCGGGG 
TCACGCTCAG 
CCTACTTCTC 
TGGCCTCAAC 
CCAAACCCAT 
CCCGGGATGA 
CAGGGCAGTA 
CCACAGCTGG 
CTCCACCTGG 
ACTTCCTGTT 
GTATCCCCAG 
ATTCTGGCAG 
TCTCCGCTTG 
GGTGAGAGAG 
GCGTGGGGTG 
TCTTTGGGGA 
GGGACCGTGC 
TTTAATTACT 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



Seq ID NO: 569 Protein sequence 
Protein Accession #: NP 055215 



65 



70 



75 



80 



85 



MDPARKAGAQ 
CTEAVGAVET 
SRALDPAGNE 
AANVTVSLPV 
LPPPEPTTVA 
AGHQDRSNSG 



11 

I 

AMI WTAGWLL 
IHGQFSLAVX 
SAYPPNGVEC 
RGCVQDEFCT 
STTSVTTSTS 
QYPAKGGPQQ 



21 

I 

LLLLRGGAQA 
GCGSGLPGKN 
YSCVGLSREA 
RDGVTGPGFT 
APVRPTSTTK 
PHNKGCVAPT 



31 

I 

LECYSCVQKA 
DRGLDLHGLL 
CQGTSPPWS 
LSGSCCQGSR 
PMPAPTSQTP 
AGLAALLLAV 



41 
I 

DDGCSPNKMK 
AFIQLQQCAQ 
CYNASDHVYK 
CNSDLRNKTY 
RQGVEHEASR 
AAGVLIj 



51 

I 

TVKCAPGVDV 
DRCNAKLNLT 
GCFDGNVTLT 
FSPRIPPLVR 
DEEPRLTGGA 



Seq ID NO: 570 DNA sequence 

Nucleic Acid Accession #: NM_00532 9.1 

Coding sequence: 1..1662 



ATGCCGGTGC 
GTGCTGGGTG 
CACTACCTGT 
CTTTTTGCCT 
TCCCCGCGGC 
TTGCGCAAGT 
GTGGTGGATG 
GGCGGCACCG 
GGTGAGACGG 
AGCACCTTCT 



11 
I 

AGCTGACGAC 
GCATCCTGGC 
CCTTCGGCCT 
TCCTGGAGCA 
GGGGCTCGGT 
GCCTGCGCTC 
GCAACCGCCA 
AGCAGGCCGG 
AGGCCAGCCT 
CGTG CAT CAT 



21 
I 

AGCCCTGCGT 
AGCCTATGTG 
GTACGGCGCC 
CCGG CGCATG 
GGCACTGTGC 
GGCCCAGCGC 
GGAGGACGCC 
CTTCTTTGTG 
GCAGGAGGGC 
GCAGAAGTGG 



31 
I 

GTGGTGGGCA 
ACGGGCTACC 
ATCCTGGGCC 
CGACGTGCCG 
ATTGCCGCGT 
ATCTCCTTCC 
TACATGCTGG 
TGGCGCAGCA 
ATGGACCGTG 
GGAGGCAAGC 



41 

I 

CCAGCCTGTT 
AGTTCATCCA 
TGCACCTGCT 
GCCAGGCCCT 
ACCAGGAGGA 
CTGACCTCAA 
ACATCTTCCA 
ACTTCCATGA 
TGCGGGATGT 
GCGAGGT CAT 



51 
I 

TGCCCTGGCA 
CACGGAAAAG 
CATTCAGAGC 
GAAGCTGCCC 
CCCTGACTAC 
GGTGGTCATG 
CGAGGTGCTG 
GGCAGGCGAG 
GGTGCGGGCC 
GTACACGGCC 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



401 



WO 02/086443 

TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGCAGC 780 

GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAG CAAGT CT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

TACGAGTCAG TGGTCACGGG XTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 1380 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 1440 

CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGGAG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTT CAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 1620 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA 

Seq ID NO: 571 Protein sequence 
Protein Accession #: NP__005320.1 

1 11 21 31 41 51 

I I I I I 1 

MPVQLTTALR WGTSLFALA VLGGILAAYV TGYQFIHTEK HYLSFGLYGA ILGLHLLIQS 60 

LFAFLEHRRM RRAGQALKLP SPRRGSVALC IAAYQEDPDY LRKCLRSAQR ISFPDLKWM 12 0 

WDGNRQEDA YMLDIFHEVL GGTEQAGFFV WRSNFHEAGE GETEASLQEG MDRVRDWRA 180 

STFSCIMQKW GGKREVMYTA FKALGDSVDY IQVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 

VGGDVQILNK YDSWISFLSS VRYWMAFNVE RACQSYFGCV QCISGPLGMY RNSLLQQFLE 300 

DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LNQQTRWSKS 360 

YFREWLYNSL WFHKHHLWMT YESWTGFFP FFLIATVIQL FYRGRIWNIL LFLLTVQLVG 420 

I I KATYACFIi RGNAEMIFMS LYSLLYMSSL LPAKIFAIAT INKSGWGTSG RKTIWNFIG 480 
LIPVSIWVAV LLGGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML YLAI IARRCG 540 
KKPEQYSLAF AEV 

Seq ID NO: 572 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 148-7095 

1 11 21 31 41 51 

I I I I 1 I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC AT CATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATT TCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGG CT CAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTT CTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATT CATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CT CGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 12 00 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 12 60 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 132 0 

TGCACTAATG GCTTATATGG AAAATACAGC G AC CAACTG A TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 150 0 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTAG*VA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 162 0 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATAT CCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2 040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACT CGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2520 

TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGT CGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 
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AGCCTTGCTC AGTATTCTGA TGTGCTGTCC ACTACT CATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2 880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATT CTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 3060 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGT CTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTT CAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 33 60 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 342 0 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3600 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3 660 

ACCTCAGCTT CTTTTAGTAC TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTATCAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3 900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC AT TTCCTATG CAAGTGAGAA ATATGAACCA 3 960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATGATGAG 4020 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGGCATGTA 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATT CCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 4200 

ATTCCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CTCAACAAAG 4320 

TTGCTGTTTC CTTCTAAGGC AACTTCTGAG CTGAGTCATA GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGATGAC 4440 

AGAGGTAGTG ATGGCTTATC CATTCATAAG TGTATGTCAT GCTCATCCTA TAGAGAATCA 4500 

CAGGAAAAGG TAATGAATGA TTCAGACACC CACGAAAACA GTCTTATGGA TCAGAATAAT 4560 

CCAATCTCAT ACTCACTATC TGAGAATTCT GAAGAAGATA ATAGAGTCAC AAGTGTATCC 4620 

TCAGACAGTC AAACTGGTAT GGACAGAAGT GCTGGTAAAT CACCATCAGC AAATGGGCTA 4680 

TCCCAAAAGC ACAATGATGG AAAAGAGGAA AATGACATTC AGACTGGTAG TGCTCTGCTT 4740 

CCTCTCAGCC CTGAATCTAA AGCATGGGCA GTTCTGACAA GTGATGAAGA AAGTGG AT CA 4800 

GGGCAAGGTA CCT CAGATAG CCTTAATGAG AATGAGACTT CCACAGATTT CAGTTTTGCA 4860 

GACACTAATG AAAAAGATGC TGATGGGATC CTGGCAGCAG GTGACT CAGA AATAACTCCT 492 0 

GGATTCCCAC AGTCCCCAAC ATCATCTGTT ACTAGCGAGA ACTCAGAAGT GTTCCACGTT 4980 

TCAGAGGCAG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 5040 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 5100 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 5160 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 522 0 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 5280 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 53 40 

C AG AG C T GT A CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 5400 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 5460 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 5520 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 5580 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 5640 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 5700 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 5760 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 582 0 

ACACAGTATC ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 5880 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 5940 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATAT AT TG TGCTAGACAG TATGTTGCAG 60 00 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 6060 

AGAAATTATT TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 6120 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 6180 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GCTCCTGAGC 6240 

CAGTCAAATA TACAGCAGAG TGACTATTCT GCAGCCCTAA AGCAATGCAA CAGGGAAAAG 63 00 

AATCGAACTT CTTCTATCAT CCCTGTGGAA AGATCAAGGG TTGGCATTTC ATCCCTGAGT 63 60 

GGAGAAGGCA CAGACTACAT CAATGCCTCC TATATCATGG GCTATTACCA GAGCAATGAA 642 0 

TTCATCATTA CCCAGCACCC TCTCCTTCAT ACCATCAAGG ATTTCTGGAG GATGATATGG 6480 

GACCATAATG CCCAACTGGT GGTTATGATT CCTGATGGCC AAAACATGGC AGAAGATGAA 6540 

TTTGTTTACT GGCCAAATAA AGATGAGCCT ATAAATTGTG AGAGCTTTAA GGTCACTCTT 6600 

ATGGCTGAAG AACACAAATG TCTATCTAAT GAGGAAAAAC TTATAATTCA GGACTTTATC 6660 

TTAGAAGCTA CACAGGATGA TTATGTACTT GAAGTGAGGC ACTTTCAGTG TCCTAAATGG 6720 

CCAAATCCAG ATAGCCCCAT TAGTAAAACT TTTGAACTTA TAAGTGTTAT AAAAGAAGAA 6780 

GCTGCCAATA GGGATGGGCC TATGATTGTT CATGATGAGC ATGGAGGAGT GACGGCAGGA 6840 

ACTTTCTGTG CTCTGACAAC CCTTATGCAC CAACTAGAAA AAGAAAATTC CGTGGATGTT 6900 

TACCAGGTAG CCAAGATGAT CAAT CTGATG AGGCCAGGAG TCTTTGCTGA CATTGAGCAG 6960 

TATCAGTTTC TCTACAAAGT GATCCTCAGC CTTGTGAGCA CAAGGCAGGA AGAGAATCCA 7020 

TCCACCTCTC TGGACAGTAA TGGTGCAGCA TTGCCTGATG GAAATATAGC TGAGAGCTTA 7080 

GAGTCTTTAG TTTAACACAG AAAGGGGTGG GGGGACT CAC ATCTGAGCAT TGTTTTCCTC 7140 

TTCCTAAAAT TAGGCAGGAA AATCAGTCTA GTTCTGTTAT CTGTTGATTT CCCATCACCT 7200 

GACAGTAACT TTCATGACAT AGGATTCTGC CGCCAAATTT ATATCATTAA CAATGTGTGC 72 60 

CTTTTTGCAA GACTTGTAAT TTACTTATTA TGTTTGAACT AAAATGATTG AATTTTACAG 7320 

TATTTCTAAG AATGGAATTG TGGTATTTTT TTCTGTATTG ATTTTAACAG AAAATTTCAA 73 80 

TTTATAGAGG TTAGGAATTC CAAACTACAG AAAATGTTTG TTTTTAGTGT CAAATTTTTA 7440 

GCTGTATTTG TAGCAATTAT CAGGTTTGCT AGAAATATAA CTTTTAATAC AGTAGCCTGT 7500 

AAATAAAACA CTCTTCCATA TGATATTCAA CATTTTACAA CTGCAGTATT CACCTAAAGT 7560 

AGAAATAATC TGTTACTTAT TGTAAATACT GCCCTAGTGT CTCCATGGAC CAAATTTATA 7620 

TTTATAATTG TAGATTTTTA TATTTTACTA CTGAGT CAAG TTTTCTAGTT CTGTGTAATT 7680 

GTTTAGTTTA ATGACGTAGT TCATTAGCTG GTCTTACTCT ACCAGTTTTC TGACATTGTA 7740 

TTGTGTTACC TAAGTCATTA ACTTTGTTTC AGCATGTAAT TTTAACTTTT GTGGAAAATA 7800 

GAAATACCTT CATTTTGAAA GAAGTTTTTA TGAGAATAAC ACCTTACCAA ACATTGTTCA 7860 

AATGGTTTTT ATCCAAGGAA TTGCAAAAAT AAATATAAAT AT TGCCATTA AAAAAAAAAA 7 920 
AAAAAAAAAA AAAAAAAAAA AAAA 
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Seq ID NO: 573 Protein' sequence : 
Protein Accession #: Eos sequence 

5 1 11 21 31 41 51 

1 I I I I I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNLKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 12 0 

FKASKITFHW GKCNMS SDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

10 ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPWSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKDIEEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

15 RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTEIiPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVS I TEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 72 0 

TEVTPHAFTP SSRQQDLVST VNWYSQTTQ PVYNGETPLQ PSYSSEVFPL VTPLIjLDNQI 780 

20 LNTTPAASSS DSALHATPVF PSVDVSFESI LSSYDGAPLL PFSSASFSSE LFRHLHTVSQ 840 

ILPQVTSATE SDKVPLHASL PVAGGDLLLE PSLAQYSDVL STTHAASETL EFGSESGVLY 900 

KTLMFSQVEP PSSDAMMHAR SSGPEPSYAL SDNEGSQHIF TVSYSSAIPV HDSVGVTYQG 960 

SLFSGPSHIP IPKSSLITPT ASLLQPTHAL SGDGEWSGAS SDSEFLLPDT DGLTALNISS 102 0 

PVSVAEFTYT TSVFGDDNKA LSKSEIIYGN ETELQIPSFN EMVYPSESTV MPNMYDNVNK 1080 

25 LNASLQETSV SISSTKGMFP GSLAHTTTKV FDHEISQVPE NNFSVQPTHT VSQASGDTSL 1140 

KPVLSANSEP ASSDPASSEM LSPSTQLLFY ETSASFSTEV LLQPSFQASD VDTLLKTVLP 1200 

AVPSDPILVE TPKVDKISST MLHLIVSNSA SSENMLHSTS VPVFDVSPTS HMHSASLQGL 12 60 

TISYASEKYE PVLLKSESSH QWPSLYSND ELFQTANLEI NQAHPPKGRH VFATPVLSID 132 0 

EPLNTLINKL IHSDEILTST KSSVTGKVFA GIPTVASDTF VSTDHSVPIG NGHVAI TAVS 1380 

30 PHRDGSVTST KLLFPSKATS ELSHSAKSDA GLVGGGEDGD TDDDGDDDDD DRGSDGLSIH 1440 

KCMSCSSYRE SQEKVMNDSD THENSLMDQN NPISYSLSEN SEEDNRVTSV SSDSQTGMDR 1500 

SPGKSPSANG LSQKHNDGKE ENDIQTGSAL LPLSPESKAW AVLTSDEESG SGQGTSDSLN 1560 

ENETSTDFSF ADTNEKDADG ILAAGDSEIT PGFPQSPTSS VTSENSEVFH VSEAEASNSS 1620 

HESRIGLAEG LESEKKAVIP LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR 1680 

35 VISTPPTPIF PISDDVGAIP I KHFPKHVAD LHAS SGFTEE FETLKEFYQE VQSCTVDLGI 1740 

TAD S SNHPDN KHKNRYINIV AYDH SRVKL A QLAEKDGKLT DYINANYVDG YNRPKAY I AA 1800 

QGPLKSTAED FWRMIWEHNV EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ 1860 

VLAYYTVRNF TLRNTKIKKG SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY 192 0 

AKRHAVGPW VHCSAGVGRT GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE 1980 

40 QYVFIHDTLV EAILSKETEV LDSHIHAYVN ALLIPGPAGK TKLEKQFQLL SQSNIQQSDY 2040 

SAALKQCNRE KNRTSSIIPV ERSRVGISSL SGEGTDYINA SYIMGYYQSN EFIITQHPLIi 2100 

HTIKDFWRMI WDHNAQLWM I PDGQNMAED EFVYWPNKDE PINCESFKVT LMAEEHKCLS 2160 

NEEKLIIQDF ILEATQDDYV LEVRHFQCPK WPNPDSPISK TFELISVIKE E AANRDGPM I 2220 

VHDEHGGVTA GTFCALTTLM HQLEKENSVD VYQVAKMINL MRPGVFAD I E QYQFLYKVIL 22 80 

45 SLVSTRQEEN PSTSLDSNGA ALPDGNIAES LESLV 

Seq ID NO: 574 DNA sequence 
Nucleic Acid Accession #: Eos sequence 
^ Coding sequence: 148-4518 

1 11 21 31 41 51 

1 I 1 I I I 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

55 CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC AT CATTGGAA 420 

60 AACACATTCA T T CAT AACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCAT CTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 72 0 

65 GATTT CAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TT AGAT CCAT TCATACTGTT GAACCTTCTG CCAAACT CAA CTGACAAGTA TTACATTTAC 840 

AATGGCT CAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TT TTAAAGAT 900 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 102 0 

70 TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 12 60 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 132 0 

75 TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAAT C CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACA CT AC AA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

80 AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

85 GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 
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GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 222 0 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

5 TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG CAGAGGCCAG TAATAGTAGC CATGAGT CTC GTATTGGTCT AGCTGAGGGG 2460 

TTGGAATCCG AGAAGAAGGC AGTTATACCC CTTGTGATCG TGTCAGCCCT GACTTTTATC 252 0 

TGTCTAGTGG TTCTTGTGGG TATTCTCATC TACTGGAGGA AATGCTTCCA GACTGCACAC 2580 

TTTTACTTAG AGGACAGTAC ATCCCCTAGA GTTATATCCA CACCTCCAAC ACCTATCTTT 2640 

10 CCAATTTCAG ATGATGTCGG AG CAATTCCA ATAAAGCACT TTCCAAAGCA TGTTGCAGAT 2700 

TTACATGCAA GTAGTGGGTT TACTGAAGAA TTTGAGACAC TGAAAGAGTT TTACCAGGAA 2760 

GTGCAGAGCT GTACTGTTGA CTTAGGTATT ACAGCAGACA GCTCCAACCA CCCAGACAAC 282 0 

AAGCACAAGA ATCGATACAT AAATATCGTT GCCTATGATC ATAGCAGGGT TAAGCTAGCA 2 880 

CAGCTTGCTG AAAAGGATGG CAAACTGACT GATTATATCA ATGCCAATTA TGTTGATGGC 2 940 

15 TACAACAGAC CAAAAGCTTA TATTGCTGCC CAAGGCCCAC TGAAATCCAC AGCTGAAGAT 3 0 00 

TTCTGGAGAA TGATATGGGA ACATAATGTG GAAGTTATTG TCATGATAAC AAACCTCGTG 3 060 

GAGAAAGGAA GGAGAAAATG TGAT CAGTAC TGGCCTGCCG ATGGGAGTGA GGAGTACGGG 3120 

AACTTTCTGG TCACTCAGAA GAGTGTGCAA GTGCTTGCCT ATTATACTGT GAGGAATTTT 3180 

ACTCTAAGAA ACACAAAAAT AAAAAAGGGC TCCCAGAAAG GAAGACCCAG TGGACGTGTG 3240 

20 GTCACACAGT ATCACTACAC GCAGTGGCCT GACATGGGAG TACCAGAGTA CTCCCTGCCA 3300 

GTGCTGACCT TTGTGAGAAA GGCAGCCTAT GCCAAGCGCC ATGCAGTGGG GCCTGTTGTC 33 60 

GTCCACTGCA GTGCTGGAGT TGGAAGAACA GGCACATATA TTGTGCTAGA CAGTATGTTG 3420 

CAGCAGATTC AACACGAAGG AACTGTCAAC ATATTTGGCT TCTTAAAACA CATCCGTTCA 3 480 

CAAAGAAATT ATT TGGTACA AACTGAGGAG CAATATGTCT TCATTCATGA TACACTGGTT 3540 

25 GAGGCCATAC TTAGTAAAGA AACTGAGGTG CTGGACAGTC ATATT CATGC CTATGTTAAT 360 0 

GCACTCCTCA TTCCTGGACC AGCAGGCAAA ACAAAGCTAG AGAAACAATT CCAGCTCCTG 3660 

AGCCAGTCAA ATATACAGCA GAGTGACTAT TCTGCAGCCC TAAAGCAATG CAACAGGGAA 3720 

AAGAATCGAA CTTCTTCTAT CATCCCTGTG GAAAGATCAA GGGTTGGCAT TTCATCCCTG 3780 

AGTGGAGAAG GCACAGACTA CATCAATGCC TCCTATATCA TGGGCTATTA CCAGAGCAAT 3840 

30 GAATTCATCA TTACCCAGCA CCCTCTCCTT CATACCATCA AGGATTTCTG GAGGATGATA 3900 

TGGGACCATA ATGCCCAACT GGTGGTTATG ATTCCTGATG GCCAAAACAT GGCAGAAGAT 3960 

GAATTTGTTT ACTGGCCAAA TAAAGATGAG CCTATAAATT GTGAGAGCTT TAAGGTCACT 4020 

CTTATGGCTG AAGAACACAA ATGTCTATCT AATGAGGAAA AACTTATAAT TCAGGACTTT 4080 

ATCTTAGAAG CTACACAGGA TGATTATGTA CTTGAAGTGA GGCACTTTCA GTGTCCTAAA 4140 

35 TGGCCAAATC CAGATAGCCC CATTAGTAAA ACTTTTGAAC TTATAAGTGT TATAAAAGAA 4200 

GAAGCTGCCA AT AGGGATGG GCCTATGATT GTT CATGATG AGCATGGAGG AGTGACGGCA 42 60 

GGAACTTTCT GTGCTCTGAC AACCCTTATG CACCAACTAG AAAAAGAAAA TTCCGTGGAT 4320 

GTTTACCAGG TAGCCAAGAT GATCAATCTG ATGAGGCCAG GAGT CTTTGC TGACATTGAG 4380 

CAGTATCAGT TTCTCTACAA AGTGATCCTC AGCCTTGTGA GCACAAGGCA GGAAGAGAAT 4440 

40 CCATCCACCT CTCTGGACAG TAATGGTGCA GCATTGCCTG ATGGAAATAT AGCTGAGAGC 450 0 

TTAGAGTCTT TAGTTTAACA CAGAAAGGGG TGGGGGGACT CACATCTGAG CATTGTTTTC 4560 

CTCTTCCTAA AATTAGGCAG GAAAATCAGT CTAGTTCTGT TAT CTGTTGA TTTCCCATCA 4620 

CCTGACAGTA ACTTTCATGA CATAGGATTC TGCCGCCAAA TTTATATCAT TAACAATGTG 4680 

TGCCTTTTTG CAAGACTTGT AATTTACTTA TTATGTTTGA ACTAAAATGA TTGAATTTTA 4740 

45 CAGTATTTCT AAGAATGGAA TTGTGGTATT TTTTTCTGTA TTGATTTTAA CAGAAAATTT 4800 

CAATTTATAG AGGTTAGGAA TTCCAAACTA CAGAAAATGT TTGTTTTTAG TGTCAAATTT 4860 

TTAGCTGTAT TTGTAGCAAT TAT CAGGTTT GCTAGAAATA TAACTTTTAA TACAGT AG CC 4 920 

TGTAAATAAA ACACTCTTCC ATATGATATT CAACATTTTA CAACTGCAGT ATTCACCTAA 4 980 

AGTAGAAATA ATCTGTTACT TATTGTAAAT ACTGCCCTAG TGTCTCCATG GACCAAATTT 5040 

50 ATATTTATAA TTGTAGATTT TTATATTTTA CTACTGAGTC AAGTTTTCTA GTTCTGTGTA 5100 

ATTGTTTAGT TTAATGACGT AGTT CATT AG CTGGTCTTAC TCTACCAGTT TTCTGACATT 5160 

GTATTGTGTT ACCTAAGTCA TTAACTTTGT TTCAGCATGT AATTTTAACT TTTGTGGAAA 5220 

ATAGAAATAC CTTCATTTTG AAAGAAGTTT TTATGAGAAT AACACCTTAC CAAACATTGT 5280 

TCAAATGGTT TTTATCCAAG GAATTGCAAA AATAAATATA AATATTGCCA TTAAAAAAAA 53 40 

55 AAAAAAAAAA AAAAAAAAAA AAAAAAA 
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Seq ID NO: 575 Protein sequence: 
Protein Accession #: Eos sequence 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDWIVFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
PLVIVSALTF 
PIKHFPKHVA 
VAYDHSRVKL 
VEVIVMITNL 
GSQKGRPSGR 
TGTYIVLDSM 
VLDSHIHAYV 
VERSRVGISS 
MIPDGQNMAE 
VLEVRHFQCP 
MHQLEKENSV 
AALPDGNIAE 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEEGKD I EEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
ICLWLVGIL 
DLHASSGFTE 
AQLAEKDGKL 
VEKGRRKCDQ 
WTQYHYTQW 
LQQIQHEGTV 
NALLI PGPAG 
LSGEGTDYIN 
DEFVYWPNKD 
KWPNPDSPIS 
DVYQVAKMIN 
SLESLV 



21 

I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
IYWRKCFQTA 
EFETLKEFYQ 
TDYINANYVD 
YWPADGSEEY 
PDMGVPEYSL 
NIFGFLKHIR 
KTKLEKQFQL 
ASYIMGYYQS 
EPINCESFKV 
KTFELISVIK 
LMRPGVFADI 



31 

1 

KLVEEIGWSY 
ENTFIHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
I CTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNAEASNS 
HFYLEDSTSP 
EVQSCTVDLG 
GYNRPKAYIA 
GNFLVTQKSV 
PVLTFVRKAA 
SQRNYLVQTE 
LSQSNIQQSD 
NEFIITQHPL 
TLMAEEHKCL 
EEAANRDGPM 
EQYQFLYKVI 



41 
I 

TGALNQKNWG 
TVE INLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
SHESRIGLAE 
RVISTPPTPI 
ITADSSNHPD 
AQGPLKSTAE 
QVLAYYTVRN 
YAKRHAVGPV 
EQYVFIHDTL 
YSAALKQCNR 
LHTIKDFWRM 
SNEEKLIIQD 
IVHDEHGGVT 
LSLVSTRQEE 



51 

1 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASliND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
GLESEKKAVI 
FPISDDVGAI 
NKHKNRY INI 
DFWRMIWEHN 
FTLRNTKIKK 
WHCSAGVGR 
VEAILSKETE 
EKNRTSSIIP 
IWDHNAQIiW 
FILEATQDDY 
AGTFCALTTL 
NPSTSLDSNG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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Seq ID NO: 576 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4494 

1 11 21 31 41 51 

111!)! 

CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 12 0 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAAT AT CCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 3 60 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC ATCATTGGAA 420 

AACAC AT T C A T T CAT AACAC TGGGAAAACA GTGGAAATTA ATCT CACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 60 0 

GAGATGCAAA TCTACTGCTT TGATGCAGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 

GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGGCTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 90 0 

ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 102 0 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 10 80 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT T CTTGT T AC A 114 0 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 12 0 0 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 132 0 

TGCACTAATG GCTTATATGG AAAAT AC AG C GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAG G AAGTG A ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGT CGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 252 0 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2 640 

ATTT CAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGGAAGTGC AGAGCTGTAC TGTTGACTTA 2760 

GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC ACAAGAATCG ATACATAAAT 2 82 0 

ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC TTGCTGAAAA GGATGGCAAA 2880 

CTGACTGATT ATATCAATGC CAAT TATGTT GATGGCTACA ACAGACCAAA AGCTTATATT 2940 

GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT GGAGAATGAT ATGGGAACAT 3000 

AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA AAGGAAGGAG AAAATGTGAT 3060 

CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT TTCTGGTCAC TCAGAAGAGT 3120 

GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTC TAAGAAACAC AAAAATAAAA 3180 

AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA CACAGTATCA CTACACGCAG 3240 

TGGCCTGACA TGGGAGTACC AG AGTACT CC CTGCCAGTGC TGACCTTTGT GAGAAAGGCA 3300 

GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC ACTGCAGTGC TGGAGTTGGA 3360 

AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC AGATTCAACA CGAAGGAACT 3420 

GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA GAAATTATTT GGTACAAACT 3480 

G AGG AG CAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG CCATACTTAG TAAAGAAACT 3540 

GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC TCCTCATTCC TGGACCAGCA 3600 

GGCAAAACAA AGCTAGAGAA ACAATT CCAG CTCCTGAGCC AGTCAAATAT ACAGCAGAGT 3 660 

GACTATT CTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA ATCGAACTTC TTCTATCATC 3720 

CCTGTGGAAA GATCAAGGGT TGGCATTTCA TCCCTGAGTG GAGAAGGCAC AGACTACATC 3 780 

AATGCCTCCT AT AT CATGGG CTATTACCAG AGCAATGAAT TCATCATTAC CCAGCACCCT 3840 

CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG ACCATAATGC CCAACTGGTG 3 900 

GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT TTGTTTACTG GCCAAATAAA 3 960 

GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA TGGCTGAAGA ACACAAATGT 402 0 

CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT TAGAAGCTAC ACAGGATGAT 4080 

TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC CAAATCCAGA TAGCCCCATT 4140 

AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG CTGCCAATAG GGATGGGCCT 4200 

ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA CTTTCTGTGC TCTGACAACC 4260 

CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT ACCAGGTAGC CAAGATGATC 4320 

AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT ATCAGTTTCT CTACAAAGTG 4380 

ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT CCACCTCTCT GGACAGTAAT 4440 

GGTGCAGCAT TGCCTGATGG AAAT AT AG CT GAGAGCTTAG AGTCTTTAGT TTAACACAGA 450 0 

AAGGGGTGGG GGGACTCACA TCTGAGCATT GTTTTCCTCT TCCTAAAATT AGGCAGGAAA 4560 

ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG ACAGTAACTT TCATGACATA 4620 

GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC TTTTTGCAAG ACTTGTAATT 4680 

TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT ATTTCTAAGA ATGGAATTGT 4740 

GGTATTTTTT T CTGTATTGA TTTTAACAGA AAATTTCAAT TTATAGAGGT TAGGAATT CC 4800 



406 



WO 02/086443 

AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG CTGTATTTGT AGCAATT AT C 4 860 

AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA AATAAAACAC TCTTCCATAT 492 0 

GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA GAAATAATCT GTTACTTATT 4980 

GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT TTATAATTGT AGATTTT TAT 5040 

ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG TTTAGTTTAA TGACGTAGTT 5100 

CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT TGTGTTACCT AAGTCATTAA 5160 

CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG AAATACCTTC ATTTTGAAAG 522 0 

AAGTTTTTAT GAGAATAACA CCTTACCAAA CAT TGTTCAA ATGGTTTTTA TCCAAGGAAT 52 80 

TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 5340 
AAA 

Seq ID NO: 577 Protein sequence: 
Protein Accession #: EOS sequence 



MRILKRFLAC 
QSPINIDEDL 
FKASKITFHW 
ILFEVGTEEN 
TDTVDW1VFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LVIVSALTFI 
I KHFPKHVAD 
KLAQLAEKDG 
NLrVEKGRRKC 
GRWTQYHYT 
SMLQQIQHEG 
YVNALLIPGP 
SSLSGEGTDY 
AEDEFVYWPN 
CPKWPNPDSP 
SVDVYQVAKM 
AESLESLV 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEEGKDIEEG 
GKGD VPNTS L> 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
CLWLVGILI 
LHASSGFTEE 
KLTDYINANY 
DQYWPADGSE 
QWPDMGVPEY 
TVNIFGFLKH 
AGKTKLEKQF 
INASYIMGYY 
KDEPINCESF 
ISKTFELISV 
INLMRPGVFA 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVSITEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
YWRKCFQTAH 
FEEVQSCTVD 
VDGYNRPKAY 
EYGNFLVTQK 
SLPVLTFVRK 
IRSQRNYLVQ 
QtiLSQSNIQQ 
QSNEFIITQH 
KVTLMAEEHK 
IKEEAANRDG 
DIEQYQFLYK 



31 

I 

KLVEEIGWSY 
ENTF IHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNEASNSS 
FYLEDSTSPR 
LGITADSSNH 
IAAQGPLKST 
SVQVLAYYTV 
AAYAKRHAVG 
TEEQYVFIHD 
SDYSAALKQC 
PLLHTIKDFW 
CLSNEEKLII 
PMIVHDEHGG 
VILSLVSTRQ 



41 

I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVLY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
HESRIGLAEG 
VISTPPTPIF 
PDNKHKNRYI 
AEDFWRMIWE 
RNFTLRNTKI 
PVWHCSAGV 
TI/VEAILSKE 
NREKNRTSSI 
RMIWDHNAQL 
QDFILEATQD 
VTAGTFCALT 
EENPSTSLDS 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
LESEKKAVIP 
PISDDVGAIP 
N I VAYDHSRV 
HNVEVIVMIT 
KKGSQKGRPS 
GRTGTY I VLD 
TEVLVSH I HA 
IPVERSRVGI 
WMIPDGQNM 
DYVLEVRHFQ 
TLMHQLEKEN 
NGAALPDGNI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



Seq ID NO: 578 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 501-4514 



CACACATACG 
CAAAAAAAAC 
CGGCGAGGGG 
CAGCTCCTCT 
CTTGTTGAAG 
AATATCCAAC 
AAGTAAATGT 
ACACATTCAT 
TCAGCGGAGG 
AATGCAATAT 
AGATGCAAAT 
GAAAAGGGAA 
ATTTCAAAGC 
TAGATCCATT 
ATGGCTCATT 
CAGTT AG CAT 
CTGGTTATGT 
TCTCTAGACA 
GTTCAGAACC 
GGGAAAGACC 
AGTTGGATGG 
GTGCTATTCT 
GCACTAATGG 
ATCCTGAACT 
AAGAGGGAAA 
ACCAAATCAG 
CGAAATACAA 
AGGGTGATGT 
CAGAAAAAGA 
AAGGTACTTC 
ACTTGTCGGG 
GTTTATTGAC 
CAACTTCTGC 
AAAACCCAGA 
AAGATTCAAC 
TGTGGTTTCC 
GCTTTCTCCA 
CCTTTTCTGC 



11 

I 

CACGCACGAT 
ATTTCCTTCG 
CCGCAGACCG 
GTGTTTGCCG 
AGATTGGCTG 
ATGTAATAGC 
GAATCTTAAG 
TCATAACACT 
AGTTTCAGAA 
GTCATCTGAT 
CTACTGCTTT 
GTTAAGAGCT 
GATTATTGAT 
CATACTGTTG 
GACATCTCCT 
CTCTGAAAGC 
CATGCTGATG 
GGTGTTTTCC 
AGAAAATGTT 
TCGAGTCGTT 
AGAGGACCAA 
CAATAATTTG 
CTTATATGGA 
TGATCTTTTC 
AGACATTGAA 
GAAAAAGGAA 
TGAAGCCAAG 
TCCCAATACA 
TATTTCCTTG 
AGCCTCTTTA 
GACTGCAGAA 
CAGTTTCAAG 
TATCCCATTC 
GACAATAACA 
TTCATCAGGT 
TAGCTCTACA 
GACTAATTAC 
AGGCCCAGTG 



21 
I 

CTCACTTCGA 
CTCCCCCTCC 
TCTGGAAATG 
CCTGGATTGG 
GTCCTATACA 
CCAAAACAAT 
AAACTTAAAT 
GGGAAAACAG 
ATGGTGTTTA 
GGATCAGAGC 
GATGCGGACC 
TTATCCATTT 
GGAGTCGAAA 
AACCTTCTGC 
CCCTGCACAG 
CAGTTGGCTG 
GACTACTTAC 
TCATACACTG 
CAGGCTGACC 
TATGATACCA 
ACCAAGCATG 
CTACCCAATA 
AAATACAGCG 
CCTGAATTAA 
GAAGGCGCTA 
CCCCAGATTT 
ACTAACCGAT 
TCTTTAAATT 
ACTTCTCAGA 
AATGATGGCT 
TCCTTAAATA 
CTTGATACTG 
ATCTCTGAGA 
TATGATGTCC 
TCAGAAGAAT 
GACATAACAG 
ACTGAGATAC 
ATGTCACAGG 



31 
I 

TCTATACACT 
CTCTCCACTC 
CGAATCCTAA 
GCTAATGGAT 
GGAGCACTGA 
CTCCTATCAA 
TTCAGGGTTG 
TGGAAATTAA 
AAGCAAGCAA 
ATAGTTTAGA 
GATTTTCAAG 
TGTTTGAGGT 
GTGTTAGTCG 
CAAACTCAAC 
ACACAGTTGA 
TTTTTTGTGA 
AAAACAATTT 
GAAAGGAAGA 
CAGAGAATTA 
TGATTGAGAA 
AATTTTTGAC 
TGAGTTATGT 
ACCAACTGAT 
TTGGAACTGA 
TTGTGAATCC 
CTACCACAAC 
CCCCAACAAG 
CCACTTCCCA 
CTGTGACTGA 
CTAAAACTGT 
CAGTTTCTAT 
GAGCTGAAGA 
ACATATCCCA 
TTATACCAGA 
CACTAAAGGA 
CACAGCCCGA 
GTGTTGATGA 
GTCCCTCAGT 



41 

1 

GGAGGATTAA 
TGAGAAGCAG 
AGCGTTTCCT 
ACTACAGACA 
ATCAAAAAAT 
TATTGATGAA 
GGATAAAACA 
TCTCACTAAT 
GATAACTTTT 
AGGACAAAAA 
TTTTGAGGAA 
TGGGACAGAA 
TTTTGGGAAG 
TGACAAGTAT 
CTGGATTGTT 
AGTTCTTACA 
T CG AG AG CAA 
GATTCATGAA 
TACCAGCCTT 
GTTTGCAGTT 
AG ATGG CT AT 
TCTTCAGATA 
TGTCGACATG 
AGAAATAATC 
TGGTAGAGAC 
ACACTACAAT 
AGGAAGTGAA 
ACCAGTCACT 
ACTGCCACCT 
TCTTAGATCT 
AACAGAATAT 
TTCTTCAGGC 
AGGGTATATA 
AT CTGCTAGA 
TCCTTCTATG 
TGTTGGATCA 
ATCTGAGAAG 
TACAGATCTG 



51 

I 

AACAAACAAA 
AGGAGCCGCA 
CGCTTGCATT 
ACAGAGAAAA 
TGGGGAAAGA 
GATCTTACAC 
TCATTGGAAA 
GACTACCGTG 
CACTGGGGAA 
TTTCCACTTG 
GCAGTCAAAG 
GAAAATTTGG 
CAGGCTGCTT 
TACATTTACA 
TTTAAAGATA 
ATGCAACAAT 
CAGTACAAGT 
GCAGTTTGTA 
CTTGTTACAT 
TTGTACCAGC 
CAAGACTTGG 
GTAGCCATAT 
CCTACTGATA 
AAGGAGGAGG 
AGTGCTACAA 
CGCATAGGGA 
TTCTCTGGAA 
AAATTAGCCA 
CACACTGTGG 
CCACATATGA 
GAGGAGGAGA 
TCCAGTCCCG 
TTTTCCTCCG 
AATGCTTCCG 
GAGGGAAATG 
GGCAGAGAGA 
ACAACCAAGT 
GAAATGCCAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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ATTATTCTAC CTTTGCCTAC TTCCCAACTG AGGTAACACC TCATGCTTTT ACCCCATCCT 2340 

CCAGACAACA GGATTTGGTC TCCACGGTCA ACGTGGTATA CTCGCAGACA ACCCAACCGG 2400 

TATACAATGA GGCCAGTAAT AGTAGCCATG AGT CTCGTAT TGGTCTAGCT GAGGGGTTGG 2460 

AATCCGAGAA GAAGGCAGTT ATACCCCTTG TGATCGTGTC AGCCCTGACT TTTATCTGTC 252 0 

TAGTGGTTCT TGTGGGTATT CTCATCTACT GGAGGAAATG CTTCCAGACT GCACACTTTT 2580 

ACTTAGAGGA CAGTACATCC CCTAGAGTTA TATCCACACC TCCAACACCT ATCTTTCCAA 2640 

TTTCAGATGA TGTCGGAGCA ATTCCAATAA AGCACTTTCC AAAGCATGTT GCAGATTTAC 2 700 

ATGCAAGTAG TGGGTTTACT GAAGAATTTG AGACACTGAA AGAGT TTTAC CAGGAAGTGC 2760 

AGAGCTGTAC TGTTGACTTA GGTATTACAG CAGACAGCTC CAACCACCCA GACAACAAGC 2820 

ACAAGAATCG ATACATAAAT ATCGTTGCCT ATGATCATAG CAGGGTTAAG CTAGCACAGC 2 880 

TTGCTGAAAA GGATGGCAAA CTGACTGATT ATATCAATGC CAATTATGTT GATGGCTACA 2940 

ACAGACCAAA AGCTTATATT GCTGCCCAAG GCCCACTGAA ATCCACAGCT GAAGATTTCT 3000 

GGAGAATGAT ATGGGAACAT AATGTGGAAG TTATTGTCAT GATAACAAAC CTCGTGGAGA 3060 

AAGGAAGGAG AAAATGTGAT CAGTACTGGC CTGCCGATGG GAGTGAGGAG TACGGGAACT 3120 

TTCTGGTCAC TCAGAAGAGT GTGCAAGTGC TTGCCTATTA TACTGTGAGG AATTTTACTG 3180 

TAAGAAACAC AAAAATAAAA AAGGGCTCCC AGAAAGGAAG ACCCAGTGGA CGTGTGGTCA 3240 

CACAGT AT CA CTACACGCAG TGGCCTGACA TGGGAGTACC AGAGTACTCC CTGCCAGTGC 3300 

TGACCTTTGT GAGAAAGGCA GCCTATGCCA AGCGCCATGC AGTGGGGCCT GTTGTCGTCC 3360 

ACTGCAGTGC TGGAGTTGGA AGAACAGGCA CATATATTGT GCTAGACAGT ATGTTGCAGC 3 42 0 

AGATTCAACA CGAAGGAACT GTCAACATAT TTGGCTTCTT AAAACACATC CGTTCACAAA 3480 

GAAATTATTT GGTACAAACT GAGGAGCAAT ATGTCTTCAT TCATGATACA CTGGTTGAGG 3540 

CCATACTTAG TAAAGAAACT GAGGTGCTGG ACAGTCATAT TCATGCCTAT GTTAATGCAC 360 0 

TCCTCATTCC TGGACCAGCA GGCAAAACAA AGCTAGAGAA ACAATT CCAG CTCCTGAGCC 3660 

AGTCAAATAT ACAGCAGAGT GACTATTCTG CAGCCCTAAA GCAATGCAAC AGGGAAAAGA 372 0 

ATCGAACTTC TTCTATCATC CCTGTGGAAA GAT CAAGGGT TGGCATTTCA TCCCTGAGTG 3780 

GAGAAGGCAC AGACTACATC AATGCCTCCT ATATCATGGG CTATTACCAG AGCAATGAAT 3840 

-TCATCATTAC CCAGCACCCT CTCCTTCATA CCATCAAGGA TTTCTGGAGG ATGATATGGG 3900 

ACCATAATGC CCAACTGGTG GTTATGATTC CTGATGGCCA AAACATGGCA GAAGATGAAT 3 960 

TTGTTTACTG GCCAAATAAA GATGAGCCTA TAAATTGTGA GAGCTTTAAG GTCACTCTTA 4020 

TGGCTGAAGA ACACAAATGT CTATCTAATG AGGAAAAACT TATAATTCAG GACTTTATCT 4080 

TAGAAGCTAC ACAGGATGAT TATGTACTTG AAGTGAGGCA CTTTCAGTGT CCTAAATGGC 4140 

CAAATCCAGA TAGCCCCATT AGTAAAACTT TTGAACTTAT AAGTGTTATA AAAGAAGAAG 4200 

CTGCCAATAG GGATGGGCCT ATGATTGTTC ATGATGAGCA TGGAGGAGTG ACGGCAGGAA 4260 

CTTTCTGTGC TCTGACAACC CTTATGCACC AACTAGAAAA AGAAAATTCC GTGGATGTTT 4320 

ACCAGGTAGC CAAGATGATC AATCTGATGA GGCCAGGAGT CTTTGCTGAC ATTGAGCAGT 4380 

ATCAGTTTCT CTACAAAGTG ATCCTCAGCC TTGTGAGCAC AAGGCAGGAA GAGAATCCAT 4440 

CCACCTCTCT GGACAGTAAT GGTGCAGCAT TGCCTGATGG AAATATAGCT GAGAGCTTAG 4500 

AGTCTTTAGT TTAACACAGA AAGGGGTGGG GGGACT CACA TCTGAGCATT GTTTTCCTCT 4560 

TCCTAAAATT AGGCAGGAAA ATCAGTCTAG TTCTGTTATC TGTTGATTTC CCATCACCTG 4620 

ACAGTAACTT TCATGACATA GGATTCTGCC GCCAAATTTA TATCATTAAC AATGTGTGCC 4680 

TTTTTGCAAG ACTTGTAATT TACTTATTAT GTTTGAACTA AAATGATTGA ATTTTACAGT 4740 

ATTTCTAAGA ATGGAATTGT GGTATTTTTT TCTGTATTGA TTTTAACAGA AAATTTCAAT 4800 

TTATAGAGGT TAGGAAT TCC AAACTACAGA AAATGTTTGT TTTTAGTGTC AAATTTTTAG 4860 

CTGTATTTGT AGCAATTATC AGGTTTGCTA GAAATATAAC TTTTAATACA GTAGCCTGTA 4 920 

AATAAAACAC TCTTCCATAT GATATTCAAC ATTTTACAAC TGCAGTATTC ACCTAAAGTA 4 980 

GAAATAATCT GTTACTTATT GTAAATACTG CCCTAGTGTC TCCATGGACC AAATTTATAT 5040 

TTATAATTGT AGATTTTTAT ATTTTACTAC TGAGTCAAGT TTTCTAGTTC TGTGTAATTG 5100 

TTTAGTTTAA TGACGTAGTT CATTAGCTGG TCTTACTCTA CCAGTTTTCT GACATTGTAT 5160 

TGTGTTACCT AAGTCATTAA CTTTGTTTCA GCATGTAATT TTAACTTTTG TGGAAAATAG 5220 

AAATACCTTC ATTTTGAAAG AAGTTTTTAT GAGAATAACA CCTTACCAAA CATTGTT CAA 52 80 

ATGGTTTTTA T CCAAGGAAT TGCAAAAATA AATATAAATA TTGCCATTAA AAAAAAAAAA 5340 
AAAAAAAAAA AAAAAAAAAA AAA 

Seq ID NO: 579 Protein sequence: 
Protein Accession # : EOS sequence 

1 11 21 31 41 51 

I I I I I I 

MVFKASKITF HWGKCNMSSD GSEHSLEGQK FPLEMQIYCF DADRFSSFEE AVKGKGKLRA 60 

LSILFEVGTE ENBDFKAIID GVESVSRFGK QAALDPFILL NLLPNSTDKY YIYNGSLTSP 120 

PCTDTVDWIV FKDTVSISES QLAVFCEVL.T MQQSGYVMLM DYLQNNFREQ QYKFSRQVFS 180 

SYTGKEEIHE AVCSSEPENV QADPENYTSL LVTWERPRW YDTMIEKFAV LYQQLDGEDQ 240 

TKHEFLTDGY QDLGAILNNL LPNMSYVLQI VAI CTNGLYG KYSDQLIVDM PTDNPELDLF 3 00 

PELIGTEEII KEEEEGKDIE EGAIVNPGRD SATNQIRKKE PQISTTTHYN RIGTKYNEAK 360 

TNRSPTRGSE FSGKGDVPNT SLNSTSQPVT KLATEKDISh TSQTVTELPP HTVEGTSASL 420 

NDGSKTVLRS PHMNLSGTAE SLNTVSITEY EEESLLTSFK LDTGAEDSSG SSPATSAIPF 480 

ISENISQGYI FSSENPETIT YDVLIPESAR NASEDSTSSG SEESLKDPSM EGNVWFPSST 540 

DITAQPDVGS GRESFLQTNY TEIRVDESEK TTKSFSAGPV MSQGPSVTDL EMPHYSTFAY 600 

FPTEVTPHAF TPSSRQQDLV STVNWYSQT TQPVYNEASN SSHESRIGLA EGLESEKKAV 660 

IPLVIVSALT FICLWLVGI LIYWRKCFQT AHFYLEDSTS PRVISTPPTP IFPISDDVGA 720 

IPIKHFPKHV ADLHASSGFT EEFETLKEFY QEVQSCTVDL GITADSSNHP DNKHKNR Y I N 780 

IVAYDHSRVK LAQLAEKDGK LTDYINANYV D GYNRPKAY I AAQGPLKSTA EDFWRMIWEH 840 

NVEVIVMITN LVE KGRRKCD QYWPADGSEE YGNFLVTQKS VQVLAYYTVR NFTLRNTKIK 900 

KGSQKGRPSG RWTQYHYTQ WPDMGVPEYS LPVLTFVRKA AYAKRHAVGP WVHCSAGVG 960 

RTGTYIVLDS MLQQIQHEGT VNIFGFLKHI RSQRNYLVQT .EEQYVFIHDT LVEAILSKET 1020 

EVLDSHIHAY VNALL I PGPA GKTKLEKQFQ LLSQSNIQQS DYSAALKQCN REKNRTSSII 1080 

PVERSRVGIS SLSGEGTDYI NASYIMGYYQ SNEFIITQHP LIiHTIKDFWR M I WDHNAQLV 1140 

VMIPDGQNMA EDEFVYWPNK' DEPINCESFK VTLMAEEHKC LSNEEKLIIQ DFILEATQDD 12 00 

YVLEVRHFQC PKWPNPDSPI SKTFELISVI KEEAANRDGP MIVHDEHGGV TAGTFCALTT 1260 

LMHQLEKENS VDVYQVAKM I NLMRPGVFAD IEQYQFLYKV ILSLVSTRQE ENPSTSLDSN 132 0 
GAALPDGNIA ESLESLV 

Seq ID NO: 580 DNA sequence 

Nucleic Acid Accession #: EOS sequence 

Coding sequence: 148-4632 

1 11 21 31 41 51 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 
CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 
CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AACGTTTCCT CGCTTGCATT 180 
CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 
CTTGTTGAAG AGATTGGCTG GTCCTATACA GG AG CACTGA ATCAAAAAAA TTGGGGAAAG 300 
AAATAT CCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGAT CTTACA 3 60 
CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC AT CATTGGAA 420 
AACACATTCA TT CAT AACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 
GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 540 
AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTTCCACTT 600 
GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 
GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 720 
GATTTCAAAG CGATTATTGA TGGAGTCGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 
TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 
AATGGCT CAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 900 
ACAGTTAGCA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG T C ATGCTG AT GGACTACTTA CAAAACAATT TT CGAGAGCA ACAGTACAAG 1020 

TTCTCTAGAC AGGTGTTTTC CTCATACACT GGAAAGGAAG AGATTCATGA AGCAGTTTGT 1080 

AGTTCAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 12 00 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAAT TT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 13 20 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 13 80 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 150 0 

AACCAAATCA GGAAAAAGGA ACCCCAGATT TCTACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 162 0 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA ATCCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTTCAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 1920 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATATCCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATT CAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCTCTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 2220 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 22 80 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

TCCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG AGGCCAGTAA TAGTAGCCAT GAGTCTCGTA TTGGTCTAGC TGAGGGGTTG 2460 

GAATCCGAGA AGAAGGCAGT TATACCCCTT GTGATCGTGT CAGCCCTGAC TTTTATCTGT 2520 

CTAGTGGTTC TTGTGGGTAT TCTCATCTAC TGGAGGAAAT GCTTCCAGAC TGCACACTTT 2580 

TACTTAGAGG ACAGTACATC CCCTAGAGTT ATATCCACAC CTCCAACACC TATCTTTCCA 2 640 

ATTTCAGATG ATGTCGGAGC AATTCCAATA AAGCACTTTC CAAAGCATGT TGCAGATTTA 2700 

CATGCAAGTA GTGGGTTTAC TGAAGAATTT GAGACACTGA AAGAGTTTTA CCAGGAAGTG 2760 

CAGAGCTGTA CTGTTGACTT AGGTATTACA GCAGACAGCT CCAACCACCC AGACAACAAG 2820 

CACAAGAATC GATACATAAA TATCGTTGCC TATGATCATA GCAGGGTTAA GCTAGCACAG 2 880 

CTTGCTGAAA AGGATGGCAA ACTGACTGAT TATATCAATG CCAATTATGT TGATGGCTAC 2 940 

AACAGACCAA AAGCTTATAT TGCTGCCCAA GGCCCACTGA AATCCACAGC TGAAGATTTC 3000 

TGGAGAATGA TATGGGAACA TAATGTGGAA GTTATTGTCA TGATAACAAA CCTCGTGGAG 3 060 

AAAGGAAGGA GAAAATGTGA TCAGTACTGG CCTGCCGATG GGAGTGAGGA GTACGGGAAC 3120 

TTTCTGGTCA CTCAGAAGAG TGTGCAAGTG CTTGCCTATT ATACTGTGAG GAATTTTACT 3180 

CTAAGAAACA CAAAAATAAA AAAGGGCTCC CAGAAAGGAA GACCCAGTGG ACGTGTGGTC 3240 

ACACAGT AT C ACTACACGCA GTGGCCTGAC ATGGGAGTAC CAGAGTACTC CCTGCCAGTG 3300 

CTGACCTTTG TGAGAAAGGC AGCCTATGCC AAGCGCCATG CAGTGGGGCC TGTTGTCGTC 3360 

CACTGCAGTG CTGGAGTTGG AAGAACAGGC ACATATATTG TGCTAGACAG TATGTTGCAG 3420 

CAGATTCAAC ACGAAGGAAC TGTCAACATA TTTGGCTTCT TAAAACACAT CCGTTCACAA 3480 

AGAAATT AT T TGGTACAAAC TGAGGAGCAA TATGTCTTCA TTCATGATAC ACTGGTTGAG 3 540 

GCCATACTTA GTAAAGAAAC TGAGGTGCTG GACAGTCATA TTCATGCCTA TGTTAATGCA 3600 

CTCCTCATTC CTGGACCAGC AGGCAAAACA AAGCTAGAGA AACAATTCCA GGGTCTCACT 3660 

CTGTCACCCA GGCTGGAGTG CAGAGGCACA ATCTCGGCTC ACTGCAACCT TCCTCTCCCT 3 72 0 

GGCTTAACTG ATCCTCCTAC CTCAGCCTCC CGAGTGGCTG GGACTATACT CCTGAGCCAG 3780 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 3840 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 3900 

GAAGG CACAG ACTACAT CAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 3 960 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 4020 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 4080 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 4140 

GCTGAAGAAC AC AAATGT CT ATCTAATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 4200 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 4260 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 4320 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 4380 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 4440 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TG AG CAGT AT 450 0 

CAGTTTCTCT ACAAAGTGAT CCTCAGCCTT GTGGGCACAA GGCAGGAAGA GAAT CCATCC 4560 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 4620 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 4680 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC ATCACCTGAC 4 740 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA TCATTAACAA TGTGTGCCTT 4800 

TTTGCAAGAC TTGTAATTTA CTTATTATGT TTGAACTAAA ATGATTGAAT TT TACAGT AT 4860 

TT CTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT T T AACAG AAA ATTT CAATTT 4 920 

ATAGAGGTTA GGAATT CCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 4 980 

GTATTTGTAG CAATTAT CAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 5040 

TAAAACACTC TT CCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 5100 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 5160 
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ATAATTGTAG ATTTTTATAT TTTACTACTG AGTCAAGTTT TCTAGTTCTG TGTAATTGTT 5220 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 5280 

TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 5340 

ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 5400 

GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 5460 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 581 Protein sequence: 
Protein Accession # : EOS sequence 

1 11 21 31 41 51 

I I 1 I 1 I 

MRILKRFLAC IQLLCVCRLD WANGYYRQQR KLVEEIGWSY TGALNQKNWG KKYPTCNSPK 60 

QSPINIDEDL TQVNVNIjKKL KFQGWDKTSL ENTFIHNTGK TVEINLTNDY RVSGGVSEMV 12 0 

FKASKITFHW GKCNMSSDGS EHSLEGQKFP LEMQIYCFDA DRFSSFEEAV KGKGKLRALS 180 

ILFEVGTEEN LDFKAIIDGV ESVSRFGKQA ALDPFILLNL LPNSTDKYYI YNGSLTSPPC 240 

TDTVDWIVFK DTVSISESQL AVFCEVLTMQ QSGYVMLMDY LQNNFREQQY KFSRQVFSSY 300 

TGKEEIHEAV CSSEPENVQA DPENYTSLLV TWERPRWYD TMIEKFAVLY QQLDGEDQTK 360 

HEFLTDGYQD LGAILNNLLP NMSYVLQIVA ICTNGLYGKY SDQLIVDMPT DNPELDLFPE 420 

LIGTEEIIKE EEEGKD I EEG AIVNPGRDSA TNQIRKKEPQ ISTTTHYNRI GTKYNEAKTN 480 

RSPTRGSEFS GKGDVPNTSL NSTSQPVTKL ATEKDISLTS QTVTELPPHT VEGTSASLND 540 

GSKTVLRSPH MNLSGTAESL NTVSITEYEE ESLLTSFKLD TGAEDSSGSS PATSAIPFIS 600 

ENISQGYIFS SENPETITYD VLIPESARNA SEDSTSSGSE ESLKDPSMEG NVWFPSSTDI 660 

TAQPDVGSGR ESFLQTNYTE IRVDESEKTT KSFSAGPVMS QGPSVTDLEM PHYSTFAYFP 720 

TEVTPHAFTP SSRQODLVST VNWYSQTTQ PVYNEASNSS HESRIGLAEG LESEKKAVIP 780 

LVIVSALTFI CLWLVGILI YWRKCFQTAH FYLEDSTSPR VISTPPTPIF PISDDVGAIP 840 

IKHFPKHVAD LHAS SGFTEE FETLKEFYQE VQSCTVDLGI TADSSNHPDN KHKNRYINIV 90 0 

AYDHSRVKLA QLAEKDGKLT DYINANYVDG YNRPKAYIAA QGPLKSTAED FWRMIWEHNV 960 

EVIVMITNLV EKGRRKCDQY WPADGSEEYG NFLVTQKSVQ VLAYYTVRNF TLRNTKIKKG 1020 

SQKGRPSGRV VTQYHYTQWP DMGVPEYSLP VLTFVRKAAY AKRHAVGPW VHCSAGVGRT 1080 

GTYIVLDSML QQIQHEGTVN IFGFLKHIRS QRNYLVQTEE QYVFIHDTLV EAILSKETEV 1140 

LDSHIHAYVN ALLIPGPAGK TKLEKQFQGL TLSPRLECRG TISAHCNLPL PGLTDPPTSA 1200 

SRVAGTILLS QSNIQQSDYS AALKQCNREK NRTSSIIPVE RSRVGISSLS GEGTDYINAS 1260 

YIMGYYQSNE FIITQHPDLH TIKDFWRMIW DHNAQLWMI PDGQNMAEDE FVYWPNKDEP 1320 

INCESFKVTL MAEEHKCLSN EEKLIIQDFI LEATQDDYVL EVRHFQCPKW PNPDSPISKT 13 80 

FELISVIKEE AANRDGPMIV HDEHGGVTAG TFCALTTLMH QLEKENSVDV YQVAKMINLM 1440 
RPGVFADIEQ YQFLYKVILS LVGTRQEENP STSLDSNGAA LPDGNIAESL ESLV 
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CACACATACG CACGCACGAT CTCACTTCGA TCTATACACT GGAGGATTAA AACAAACAAA 60 

CAAAAAAAAC ATTTCCTTCG CTCCCCCTCC CTCTCCACTC TGAGAAGCAG AGGAGCCGCA 120 

CGGCGAGGGG CCGCAGACCG TCTGGAAATG CGAATCCTAA AGCGTTTCCT CGCTTGCATT 180 

CAGCTCCTCT GTGTTTGCCG CCTGGATTGG GCTAATGGAT ACTACAGACA ACAGAGAAAA 240 

CTTGTTGAAG AGATTGGCTG GTCCTATACA GGAGCACTGA ATCAAAAAAA TTGGGGAAAG 300 

AAATATCCAA CATGTAATAG CCCAAAACAA TCTCCTATCA ATATTGATGA AGATCTTACA 360 

CAAGTAAATG TGAATCTTAA GAAACTTAAA TTTCAGGGTT GGGATAAAAC AT C ATTGGAA 420 

AACACATTCA TTCATAACAC TGGGAAAACA GTGGAAATTA ATCTCACTAA TGACTACCGT 480 

GTCAGCGGAG GAGTTTCAGA AATGGTGTTT AAAGCAAGCA AGATAACTTT TCACTGGGGA 54 0 

AAATGCAATA TGTCATCTGA TGGATCAGAG CATAGTTTAG AAGGACAAAA ATTT CCACTT 600 

GAGATGCAAA TCTACTGCTT TGATGCGGAC CGATTTTCAA GTTTTGAGGA AGCAGTCAAA 660 

GGAAAAGGGA AGTTAAGAGC TTTATCCATT TTGTTTGAGG TTGGGACAGA AGAAAATTTG 72 0 

GATTTCAAAG CGATTATTGA TGGAGT CGAA AGTGTTAGTC GTTTTGGGAA GCAGGCTGCT 780 

TTAGATCCAT TCATACTGTT GAACCTTCTG CCAAACTCAA CTGACAAGTA TTACATTTAC 840 

AATGG CTCAT TGACATCTCC TCCCTGCACA GACACAGTTG ACTGGATTGT TTTTAAAGAT 90 0 

ACAGTT AG CA TCTCTGAAAG CCAGTTGGCT GTTTTTTGTG AAGTTCTTAC AATGCAACAA 960 

TCTGGTTATG TCATGCTGAT GGACTACTTA CAAAACAATT TTCGAGAGCA ACAGTACAAG 1020 

TTCT CTAGAC AGGTGTTTTC CT CATACACT GGAAAGGAAG AGATT CATGA AGCAGTTTGT 1080 

AGTT CAGAAC CAGAAAATGT TCAGGCTGAC CCAGAGAATT ATACCAGCCT TCTTGTTACA 1140 

TGGGAAAGAC CTCGAGTCGT TTATGATACC ATGATTGAGA AGTTTGCAGT TTTGTACCAG 1200 

CAGTTGGATG GAGAGGACCA AACCAAGCAT GAATTTTTGA CAGATGGCTA TCAAGACTTG 1260 

GGTGCTATTC TCAATAATTT GCTACCCAAT ATGAGTTATG TTCTTCAGAT AGTAGCCATA 132 0 

TGCACTAATG GCTTATATGG AAAATACAGC GACCAACTGA TTGTCGACAT GCCTACTGAT 1380 

AATCCTGAAC TTGATCTTTT CCCTGAATTA ATTGGAACTG AAGAAATAAT CAAGGAGGAG 1440 

GAAGAGGGAA AAGACATTGA AGAAGGCGCT ATTGTGAATC CTGGTAGAGA CAGTGCTACA 1500 

AACCAAATCA GGAAAAAGGA ACCCCAGATT T CT ACCACAA CACACTACAA TCGCATAGGG 1560 

ACGAAATACA ATGAAGCCAA GACTAACCGA TCCCCAACAA GAGGAAGTGA ATTCTCTGGA 1620 

AAGGGTGATG TTCCCAATAC ATCTTTAAAT TCCACTTCCC AACCAGTCAC TAAATTAGCC 1680 

ACAGAAAAAG ATATTTCCTT GACTTCTCAG ACTGTGACTG AACTGCCACC TCACACTGTG 1740 

GAAGGTACTT CAGCCTCTTT AAATGATGGC TCTAAAACTG TTCTTAGATC TCCACATATG 1800 

AACTTGTCGG GGACTGCAGA AT CCTTAAAT ACAGTTTCTA TAACAGAATA TGAGGAGGAG 1860 

AGTTTATTGA CCAGTTT CAA GCTTGATACT GGAGCTGAAG ATTCTTCAGG CTCCAGTCCC 192 0 

GCAACTTCTG CTATCCCATT CATCTCTGAG AACATAT CCC AAGGGTATAT ATTTTCCTCC 1980 

GAAAACCCAG AGACAATAAC ATATGATGTC CTTATACCAG AATCTGCTAG AAATGCTTCC 2040 

GAAGATTCAA CTTCATCAGG TTCAGAAGAA TCACTAAAGG ATCCTTCTAT GGAGGGAAAT 2100 

GTGTGGTTTC CTAGCT CTAC AGACATAACA GCACAGCCCG ATGTTGGATC AGGCAGAGAG 2160 

AGCTTTCTCC AGACTAATTA CACTGAGATA CGTGTTGATG AATCTGAGAA GACAACCAAG 222 0 

TCCTTTTCTG CAGGCCCAGT GATGTCACAG GGTCCCTCAG TTACAGATCT GGAAATGCCA 2280 

CATTATTCTA CCTTTGCCTA CTTCCCAACT GAGGTAACAC CTCATGCTTT TACCCCATCC 2340 

T CCAGACAAC AGGATTTGGT CTCCACGGTC AACGTGGTAT ACTCGCAGAC AACCCAACCG 2400 

GTATACAATG GTGAGACACC TCTTCAACCT TCCTACAGTA GTGAAGTCTT TCCTCTAGTC 2460 

ACCCCTTTGT TGCTTGACAA TCAGATCCTC AACACTACCC CTGCTGCTTC AAGTAGTGAT 2 520 
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TCGGCCTTGC ATGCTACGCC TGTATTTCCC AGTGTCGATG TGTCATTTGA ATCCATCCTG 2580 

TCTTCCTATG ATGGTGCACC TTTGCTTCCA TTTTCCTCTG CTTCCTTCAG TAGTGAATTG 2 640 

TTTCGCCATC TGCATACAGT TTCTCAAATC CTTCCACAAG TTACTTCAGC TACCGAGAGT 2700 

GATAAGGTGC CCTTGCATGC TTCTCTGCCA GTGGCTGGGG GTGATTTGCT ATTAGAGCCC 2760 

AGCCTTGCTC AGTATT CTGA TGTGCTGTCC ACTACTCATG CTGCTTCAGA GACGCTGGAA 2820 

TTTGGTAGTG AATCTGGTGT TCTTTATAAA ACGCTTATGT TTTCTCAAGT TGAACCACCC 2 880 

AGCAGTGATG CCATGATGCA TGCACGTTCT TCAGGGCCTG AACCTTCTTA TGCCTTGTCT 2 940 

GATAATGAGG GCTCCCAACA CATCTTCACT GTTTCTTACA GTTCTGCAAT ACCTGTGCAT 3000 

GATTCTGTGG GTGTAACTTA TCAGGGTTCC TTATTTAGCG GCCCTAGCCA TATACCAATA 30 60 

CCTAAGTCTT CGTTAATAAC CCCAACTGCA TCATTACTGC AGCCTACTCA TGCCCTCTCT 3120 

GGTGATGGGG AATGGTCTGG AGCCTCTTCT GATAGTGAAT TTCTTTTACC TGACACAGAT 3180 

GGGCTGACAG CCCTTAACAT TTCTTCACCT GTTTCTGTAG CTGAATTTAC ATATACAACA 3240 

TCTGTGTTTG GTGATGATAA TAAGGCGCTT TCTAAAAGTG AAATAATATA TGGAAATGAG 3300 

ACTGAACTGC AAATTCCTTC TTTCAATGAG ATGGTTTACC CTTCTGAAAG CACAGTCATG 33 60 

CCCAACATGT ATGATAATGT AAATAAGTTG AATGCGTCTT TACAAGAAAC CTCTGTTTCC 3420 

ATTTCTAGCA CCAAGGGCAT GTTTCCAGGG TCCCTTGCTC ATACCACCAC TAAGGTTTTT 3480 

GATCATGAGA TTAGTCAAGT TCCAGAAAAT AACTTTTCAG TTCAACCTAC ACATACTGTC 3540 

TCTCAAGCAT CTGGTGACAC TTCGCTTAAA CCTGTGCTTA GTGCAAACTC AGAGCCAGCA 3 60 0 

TCCTCTGACC CTGCTTCTAG TGAAATGTTA TCTCCTTCAA CTCAGCTCTT ATTTTATGAG 3 660 

ACCTCAGCTT CTTTTAGTAG TGAAGTATTG CTACAACCTT CCTTTCAGGC TTCTGATGTT 3720 

GACACCTTGC TTAAAACTGT TCTTCCAGCT GTGCCCAGTG ATCCAATATT GGTTGAAACC 3780 

CCCAAAGTTG ATAAAATTAG TTCTACAATG TTGCATCTCA TTGTAT CAAA TTCTGCTTCA 3840 

AGTGAAAACA TGCTGCACTC TACATCTGTA CCAGTTTTTG ATGTGTCGCC TACTTCTCAT 3 900 

ATGCACTCTG CTTCACTTCA AGGTTTGACC ATTTCCTATG CAAGTGAGAA ATATGAACCA 3 960 

GTTTTGTTAA AAAGTGAAAG TTCCCACCAA GTGGTACCTT CTTTGTACAG TAATG AT GAG 4 02 0 

TTGTTCCAAA CGGCCAATTT GGAGATTAAC CAGGCCCATC CCCCAAAAGG AAGG CATGT A 4080 

TTTGCTACAC CTGTTTTATC AATTGATGAA CCATTAAATA CACTAATAAA TAAGCTTATA 4140 

CATTCCGATG AAATTTTAAC CTCCACCAAA AGTTCTGTTA CTGGTAAGGT ATTTGCTGGT 42 00 

ATT CCAACAG TTGCTTCTGA TACATTTGTA TCTACTGATC ATTCTGTTCC TATAGGAAAT 4260 

GGGCATGTTG CCATTACAGC TGTTTCTCCC CACAGAGATG GTTCTGTAAC CT CAACAAAG 4320 

TTGCTGTTTC CTTCT AAGG C AACTTCTGAG CTGAGT CAT A GTGCCAAATC TGATGCCGGT 4380 

TTAGTGGGTG GTGGTGAAGA TGGTGACACT GATGATGATG GTGATGATGA TGATGACAGA 4440 

GATAGTGATG GCTTATCCAT TCATAAGTGT ATGTCATGCT CATCCTATAG AGAATCACAG 4500 

GAAAAGGTAA TG AATGATT C AGACACCCAC GAAAACAGTC TTATGGATCA GAATAATCCA 4560 

ATCTCATACT CACTATCTGA GAATTCTGAA GAAGATAATA GAGTCACAAG TGTATCCTCA 4620 

GACAGTCAAA CTGGTATGGA CAGAAGTCCT GGTAAATCAC CATCAGCAAA TGGGCTATCC 4680 

CAAAAGCACA ATGATGGAAA AGAGGAAAAT GACATTCAGA CTGGTAGTGC TCTGCTTCCT 4740 

CTCAGCCCTG AATCTAAAGC ATGGGCAGTT CTGACAAGTG ATGAAGAAAG TGGATCAGGG 480 0 

CAAGGTACCT CAGATAGCCT TAATGAGAAT GAGACTTCCA CAGATTTCAG TTTTGCAGAC 4860 

ACTAATGAAA AAGATGCTGA TGGGATCCTG GCAGCAGGTG ACTCAGAAAT AACTCCTGGA 4920 

TTCCCACAGT CCCCAACATC ATCTGTTACT AGCGAGAACT CAGAAGTGTT CCACGTTTCA 4980 

GAGGCAGAGG CCAGTAATAG TAGCCATGAG TCTCGTATTG GTCTAGCTGA GGGGTTGGAA 5040 

TCCGAGAAGA AGGCAGTTAT ACCCCTTGTG ATCGTGTCAG CCCTGACTTT TATCTGTCTA 5100 

GTGGTTCTTG TGGGTATTCT CATCTACTGG AGGAAATGCT TCCAGACTGC ACACTTTTAC 5160 

TTAGAGGACA GTACATCCCC TAGAGTTATA TCCACACCTC CAACACCTAT CTTTCCAATT 5220 

TCAGATGATG TCGGAGCAAT TCCAATAAAG CACTTTCCAA AGCATGTTGC AGATTTACAT 52 80 

GCAAGTAGTG GGTTTACTGA AGAATTTGAG ACACTGAAAG AGTTTTACCA GGAAGTGCAG 5340 

AGCTGTACTG TTGACTTAGG TATTACAGCA GACAGCT CCA ACCACCCAGA CAACAAGCAC 5400 

AAGAATCGAT ACATAAATAT CGTTGCCTAT GATCATAGCA GGGTTAAGCT AGCACAGCTT 5460 

GCTGAAAAGG ATGGCAAACT GACTGATTAT ATCAATGCCA ATTATGTTGA TGGCTACAAC 552 0 

AGACCAAAAG CTTATATTGC TGCCCAAGGC CCACTGAAAT CCACAGCTGA AGATTTCTGG 5580 

AGAATGATAT GGGAACATAA TGTGGAAGTT ATTGTCATGA TAACAAACCT CGTGGAGAAA 5 640 

GGAAGGAGAA AATGTGATCA GTACTGGCCT GCCGATGGGA GTGAGGAGTA CGGGAACTTT 5700 

CTGGTCACTC AGAAGAGTGT GCAAGTGCTT GCCTATTATA CTGTGAGGAA TTTTACTCTA 5760 

AGAAACACAA AAATAAAAAA GGGCTCCCAG AAAGGAAGAC CCAGTGGACG TGTGGTCACA 582 0 

CAGTATCACT ACACGCAGTG GCCTGACATG GGAGTACCAG AGTACTCCCT GCCAGTGCTG 5880 

ACCTTTGTGA GAAAGGCAGC CTATGCCAAG CGCCATGCAG TGGGGCCTGT TGTCGTCCAC 5 940 

TGCAGTGCTG GAGTTGGAAG AACAGGCACA TATATTGTGC TAGACAGTAT GTTGCAGCAG 6000 

ATT CAACACG AAGGAACTGT CAACATATTT GGCTTCTTAA AACACATCCG TTCACAAAGA 6060 

AATTATTTGG TACAAACTGA GGAGCAATAT GTCTTCATTC ATGATACACT GGTTGAGGCC 6120 

ATACTTAGTA AAGAAACTGA GGTGCTGGAC AGT CATATTC ATGCCTATGT TAATGCACTC 6180 

CTCATTCCTG GACCAGCAGG CAAAACAAAG CTAGAGAAAC AATTCCAGCT CCTGAGCCAG 6240 

TCAAATATAC AGCAGAGTGA CTATTCTGCA GCCCTAAAGC AATGCAACAG GGAAAAGAAT 6300 

CGAACTTCTT CTATCATCCC TGTGGAAAGA TCAAGGGTTG GCATTTCATC CCTGAGTGGA 6360 

GAAGGCACAG ACTACATCAA TGCCTCCTAT ATCATGGGCT ATTACCAGAG CAATGAATTC 6420 

ATCATTACCC AGCACCCTCT CCTTCATACC ATCAAGGATT TCTGGAGGAT GATATGGGAC 6480 

CATAATGCCC AACTGGTGGT TATGATTCCT GATGGCCAAA ACATGGCAGA AGATGAATTT 6540 

GTTTACTGGC CAAATAAAGA TGAGCCTATA AATTGTGAGA GCTTTAAGGT CACTCTTATG 6600 

GCTGAAGAAC ACAAATGTCT AT CT AATGAG GAAAAACTTA TAATTCAGGA CTTTATCTTA 6660 

GAAGCTACAC AGGATGATTA TGTACTTGAA GTGAGGCACT TTCAGTGTCC TAAATGGCCA 672 0 

AATCCAGATA GCCCCATTAG TAAAACTTTT GAACTTATAA GTGTTATAAA AGAAGAAGCT 6780 

GCCAATAGGG ATGGGCCTAT GATTGTTCAT GATGAGCATG GAGGAGTGAC GGCAGGAACT 6840 

TTCTGTGCTC TGACAACCCT TATGCACCAA CTAGAAAAAG AAAATTCCGT GGATGTTTAC 6900 

CAGGTAGCCA AGATGATCAA TCTGATGAGG CCAGGAGTCT TTGCTGACAT TG AG CAGT AT 6960 

CAGTTTCTCT ACAAAG TG AT CCTCAGCCTT GTGAGCACAA GGCAGGAAGA GAATCCATCC 7020 

ACCTCTCTGG ACAGTAATGG TGCAGCATTG CCTGATGGAA ATATAGCTGA GAGCTTAGAG 7080 

TCTTTAGTTT AACACAGAAA GGGGTGGGGG GACTCACATC TGAGCATTGT TTTCCTCTTC 7140 

CTAAAATTAG GCAGGAAAAT CAGTCTAGTT CTGTTATCTG TTGATTTCCC AT CACCTGAC 7200 

AGTAACTTTC ATGACATAGG ATTCTGCCGC CAAATTTATA T CAT T AACAA TGTGTGCCTT 72 60 

TTTGCAAGAC TTGTAATT T A CTTATTATGT TTGAACTAAA ATGATTGAAT TTTACAGTAT 7320 

TTCTAAGAAT GGAATTGTGG TATTTTTTTC TGTATTGATT TTAACAGAAA ATTTCAATTT 73 80 

ATAGAGGTTA GGAATTCCAA ACTACAGAAA ATGTTTGTTT TTAGTGTCAA ATTTTTAGCT 7440 

GTATTTGTAG CAATT AT CAG GTTTGCTAGA AATATAACTT TTAATACAGT AGCCTGTAAA 7500 

TAAAACACTC TTCCATATGA TATTCAACAT TTTACAACTG CAGTATTCAC CTAAAGTAGA 7560 

AATAATCTGT TACTTATTGT AAATACTGCC CTAGTGTCTC CATGGACCAA ATTTATATTT 7620 

ATAATTGTAG ATTTTTATAT TTTACTACTG AGT CAAGTTT TCTAGTTCTG TGTAATTGTT 7680 

TAGTTTAATG ACGTAGTTCA TTAGCTGGTC TTACTCTACC AGTTTTCTGA CATTGTATTG 7740 
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TGTTACCTAA GTCATTAACT TTGTTTCAGC ATGTAATTTT AACTTTTGTG GAAAATAGAA 78 0 0 
ATACCTTCAT TTTGAAAGAA GTTTTTATGA GAATAACACC TTACCAAACA TTGTTCAAAT 7860 
GGTTTTTATC CAAGGAATTG CAAAAATAAA TATAAATATT GCCATTAAAA AAAAAAAAAA 792 0 
AAAAAAAAAA AAAAAAAAAA A 

Seq ID NO: 583 Protein sequence 
Protein Accession #; NP__002 842.1 
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10 
15 

20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



MRILKRFLAC 
QSPINIDEDL 
FKAS KIT FHW 
I I/FE VGTEEN 
TDTVDW I VFK 
TGKEEIHEAV 
HEFLTDGYQD 
LIGTEEIIKE 
RSPTRGSEFS 
GSKTVLRSPH 
ENISQGYIFS 
TAQPDVGSGR 
TEVTPHAFTP 
LNTTPAASSS 
ILPQVTSATE 
KTLMFSQVEP 
SLFSGPSHIP 
PVSVAEFTYT 
LNASLQETSV 
KPVLSANSEP 
AVPSDPILVE 
TISYASEKYE 
EPLNTLINKL 
PHRDGSVTST 
CMSCSSYRES 
PGKSPSANGL 
NETSTDFSFA 
ESRIGLAEGL 
ISTPPTPIFP 
ADSSNHPDNK 
GPLKSTAEDF 
LAYYTVRNFT 
KRHAVGPVW 
YVF IHDTLVE 
AALKQCNREK 
TIKDFWRMIW 
EEKLIIQDFI 
HDEHGGVTAG 
LVSTRQEENP 



11 
I 

IQLLCVCRLD 
TQVNVNLKKL 
GKCNMSSDGS 
LDFKAIIDGV 
DTVSISESQL 
CSSEPENVQA 
LGAILNNLLP 
EEEGKD I EEG 
GKGDVPNTSL 
MNLSGTAESL 
SENPETITYD 
ESFLQTNYTE 
SSRQQDLVST 
DSALHATPVF 
SDKVPLHASL 
PSSDAMMHAR 
IPKSSLITPT 
TSVFGDDNKA 
SISSTKGMFP 
ASSDPASSEM 
TPKVDKISST 
PVLLKSESSH 
IHSDEILTST 
KLLFPSKATS 
QEKVMNDSDT 
SQKHNDGKEE 
DTNEKDADGI 
ESEKKAVI PL 
ISDDVGAIPI 
HKNRYINIVA 
WRMIWEHNVE 
LRNTKIKKGS 
HCSAGVGRTG 
AILSKETEVL 
NRTSSIIPVE 
DHNAQIj WM I 
LEATQDDYVL 
TFCALTTLMH 
STSLDSNGAA 



21 
I 

WANGYYRQQR 
KFQGWDKTSL 
EHSLEGQKFP 
ESVSRFGKQA 
AVFCEVLTMQ 
DPENYTSLLV 
NMSYVLQIVA 
AIVNPGRDSA 
NSTSQPVTKL 
NTVS I TEYEE 
VLIPESARNA 
IRVDESEKTT 
VNWYSQTTQ 
PSVDVSFESI 
PVAGGDLLLE 
SSGPEPSYAL 
ASLLQPTHAL 
LSKSEIIYGN 
GSLAHTTTKV 
LSPSTQLLFY 
MLHLIVSNSA 
QWPSLYSND 
KSSVTGKVFA 
ELSHSAKSDA 
HENSLMDQNN 
NDIQTGSALtL 
LAAGDSEITP 
VIVSALTFIC 
KHFPKHVADL 
YDHSRVKLAQ 
VIVMITNLVE 
QKGRPSGRW 
TYIVLDSMLQ 
DSHIHAYVNA 
RSRVGISSLS 
PDGQNMAEDE 
EVRHFQCPKW 
QLEKENSVDV 
LPDGNIAESL 



31 

I 

KLVEEIGWSY 
ENTF IHNTGK 
LEMQIYCFDA 
ALDPFILLNL 
QSGYVMLMDY 
TWERPRWYD 
ICTNGLYGKY 
TNQIRKKEPQ 
ATEKDISLTS 
ESLLTSFKLD 
SEDSTSSGSE 
KSFSAGPVMS 
PVYNGETPLQ 
LSSYDGAPLL 
PSLAQYSDVL 
SDNEGSQHIF 
SGDGEWSGAS 
ETELQIPSFN 
FDHEISQVPE 
ETSASFSTEV 
SSENMLHSTS 
ELFQTANLEI 
GIPTVASDTF 
GLVGGGEDGD 
PISYSLSENS 
PLSPESKAWA 
GFPQSPTSSV 
LWLVGILIY 
HASSGFTEEF 
LAEKDGKLTD 
KGRRKCDQYW 
TQYHYTQWPD 
QIQHEGTVNI 
LLIPGPAGKT 
GEGTDYINAS 
FVYWPNKDEP 
PNPDSPISKT 
YQVAKMINLM 
ESLV 



41 

I 

TGALNQKNWG 
TVEINLTNDY 
DRFSSFEEAV 
LPNSTDKYYI 
LQNNFREQQY 
TMIEKFAVIiY 
SDQLIVDMPT 
ISTTTHYNRI 
QTVTELPPHT 
TGAEDSSGSS 
ESLKDPSMEG 
QGPSVTDLEM 
PSYSSEVFPL 
PFSSASFSSE 
STTHAASETL 
TVSYSSAIPV 
SDSEFLLPDT 
EMVYPSESTV 
NNFSVQPTHT 
LLQPSFQASD 
VPVFDVSPTS 
NQAHPPKGRH 
VSTDHSVPIG 
TDDDGDDDDD 
EEDNRVTSVS 
VLTSDEESGS 
TSENSEVFHV 
WRKCFQTAHF 
ETLKEFYQEV 
YINANYVDGY 
PADGSEEYGN 
MGVPEYSLPV 
FGFLKHIRSQ 
KLEKQFQLLS 
YIMGYYQSNE 
INCESFKVTL 
FELISVIKEE 
RPGVFADIEQ 



Seq ID NO: 584 DNA sequence 

Nucleic Acid Accession #: NM_005688.1 

Coding sequence: 126.. 4439 



CCGGGCAGGT 
AGGGGCGCAG 
AGAAGATGAA 
GTGTGAGGGA 
GGAGAACTCG 
TCTCTCTTGA 
GAAAGTACCA 
ACCCAGTGGA 
CCCGTGTGGC 
ACGAGTCTTC 
AAGTTGGGCC 
TCATCCTGTC 
TCATGGTGAA 
TGTTGTTAGT 
CTTGGGCATT 
TTAAGAAGAT 
TTTGCTCCAA 
GAGGACCCGT 
GCTTCCTGGG 
T C ACAGCAT A 
ATGAAGTTCT 
AGAGTGTTCA 
AGGGTATCAC 
CTGTTCATAT 
TCTTCAATTC 
AAGCCT CAGT 
TAAAGAACAA 
GGGACTCCTC 
ACAAGAGGGC 
AGGCGGTGCT 
CCGAAGAGGA 



11 

! 

GGCTCATGCT 
GAATTCTGAT 
GGATAT CGAC 
GAGAACCAGC 
ACCGTTGGAA 
TGCCTCCATG 
TCATGGCTTG 
CAATGCTGGG 
CCACAAGAAG 
TGACGTGAAC 
AGACGCTGCT 
CAT CGTGTGC 
ACACCTCTTG 
GCTGGGCCTC 
GAATTACCGA 
CCTTAAGTTA 
CGATGGGCAG 
TGTTGCCATC 
ATCAGCTGTT 
TTTCAGGAGA 
TACTTACATT 
AAAAATCCGC 
TGTGGGTGTG 
GACCCTGGGC 
CATGACTTTT 
GGCTGTTGAC 
ACCAGCCAGT 
CCACTCCAGT 
TTCCAGGGGC 
GGCAGAGCAG 
AGAAGGCAAG 



21 
I 

CGGGAGCGTG 
GTGAAACTAA 
ATAGGAAAAG 
ACTTCTGGGA 
TGCCAAGATG 
CATTCTCAGC 
AGTGCTCTGA 
CTTTTTTCCT 
GGGGAGCTCT 
TGCAGAAGAC 
TCCCTGCGAA 
CTGATGATCA 
GAGTATACCC 
CTCCTGACGG 
ACCGGTGTCC 
AAGAACATTA 
AGAATGTTTG 
TTAGGCATGA 
TTTATCCTCT 
AAATGCGTGG 
AAATTTATCA 
GAGGAGGAGC 
GCTCCCATTG 
TTCGATCTGA 
GCTTTGAAAG 
AGATTTAAGA 
CCTCACATCA 
ATCCAGAACT 
AAGAAAGAGA 
AAAGGCCACC 
CACATCCACC 



31 

I 

GTTGAGCGGC 
CAGTCTGTGA 
AGTATATCAT 
CGCACAGAGA 
CCTTGGAAAC 
TCAGAATCCT 
AGCCCATCCG 
GTATGACTTT 
CAATGGAAGA 
TAGAGAGACT 
GGGTTGTGTG 
CGCAGCTGGC 
AGGCAACAGA 
AAAT CGTGCG 
GCTTGCGGGG 
AAGAGAAATC 
AGGCAGCAGC 
TTTATAATGT 
TTTACCCAGC 
CCGCCACGGA 
AAATGTATGC 
GTCGGATATT 
TGGTGGTGAT 
CAGCAGCACA 
TAACACCGTT 
GTTTGTTTCT 
AGATAGAGAT 
CGCCCAAGCT 
AGGTGAGGCA 
TCCTCCTGGA 
TGGGCCACCT 



41 
I 

TGGCGCGGTT 
GCCCTGGAAC 
CCCCAGTCCT 
CCGTGAAGAT 
AGCAGCCCGA 
GG ATG AG GAG 
GACTACTTCC 
TTCGTGGCTT 
CGTGTGGTCT 
GTGGCAAGAA 
GATCTTCTGC 
TGGCTTCAGT 
GTCTAACCTG 
GTCTTGGTCG 
GGCCATCCTA 
CCTGGGTGAG 
CGTTGGCAGC 
AATTATTCTG 
AATGATGTTT 
TGAACGTGTC 
CTGGGT CAAA 
GGAAAAAGCC 
TGCCAGCGTG 
GGCTTTCACA 
TTCAGTAAAG 
AATGGAAGAG 
GAAAAATGCC 
GACCCCCAAA 
GCTGCAGCGC 
CAGTGACGAG 
GCGCTTACAG 



51 
I 

KKYPTCNSPK 
RVSGGVSEMV 
KGKGKLRALS 
YNGSLTSPPC 
KFSRQVFSSY 
QQLDGEDQTK 
DNPELDLFPE 
GTKYNEAKTN 
VEGTSASLND 
PATSAIPFIS 
NVWFPSSTDI 
PHYSTFAYFP 
VTPLLLDNQI 
LFRHLHTVSQ 
EFGSESGVLY 
HDSVGVTYQG 
DGLTALNISS 
MPNMYDNVNK 
VSQASGDTSL 
VDTLLKTVLP 
HMHSASLQGL 
VFATPVLSID 
NGHVAITAVS 
RDSDGLSIHK 
SDSQTGMDRS 
GQGTSDSLNE 
SEAEASNSSH 
YLEDSTSPRV 
QSCTVDLGIT 
NRPKAYIAAQ 
FLVTQKSVQV 
LTFVRKAAYA 
RNYLVQTEEQ 
QSNIQQSDYS 
FIITQHPLLH 
MAEEHKCLSN 
AANRDGPMIV 
YQFLYKVILS 



51 
I 

GTCCTGGAGC 
CTCCGCTCAG 
GGGTATAGAA 
TCCAAGTTCA 
GCCGAGGGCC 
CATCCCAAGG 
AAACACCAGC 
TCTTCTCTGG 
CTGTCCAAGC 
GAGCTGAATG 
CGCACCAGGC 
GGACCAGCCT 
CAGTACAGCT 
CTTGCACTGA 
ACCATGGCAT 
CT CAT CAACA 
CTGCTGGCTG 
GGACCAACAG 
GCATCACGGC 
CAGAAGATGA 
GCATTTTCTC 
GGGTACTTCC 
GTGACCTTCT 
GTGGTGACAG 
TCCCTCTCAG 
GTTCACATGA 
ACCTTGGCAT 
ATGAAAAAAG 
ACTGAGCATC 
CGGCCCAGTC 
AGGACACTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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ACAGCAT CGA TCTGGAGATC CAAGAGGGTA AACTGGTTGG AATCTGCGGC AGTGTGGGAA 192 0 

GTGGAAAAAC CTCTCTCATT TCAGCCATTT TAGGCCAGAT GACGCTTCTA GAGGGCAGCA 1980 

TTG CAATCAG TGGAACCTTG GCTTATGTGG CCCAGCAGGC CTGGATCCTC AATGCTACTC 2 040 

TGAGAGACAA CATCCTGTTT GGGAAGGAAT ATGATGAAGA AAGATACAAC TCTGTGCTGA 2100 

5 ACAGCTGCTG CCTGAGGCCT GACCTGGCCA TTCTTCCCAG CAGCGACCTG ACGGAGATTG 2160 

GAGAGCGAGG AGCCAACCTG AGCGGTGGGC AGCGCCAGAG GATCAGCCTT GCCCGGGCCT 222 0 

TGTATAGTGA CAGGAGCATC TACATCCTGG ACGACCCCCT CAGTGCCTTA GATGCCCATG 2280 

TGGG CAACCA CATCTTCAAT AGTGCTATCC GGAAACATCT CAAGTCCAAG ACAGTTCTGT 2340 

TTGTTACCCA CCAGTTACAG TACCTGGTTG ACTGTGATGA AGTGATCTTC ATGAAAGAGG 2400 

10 GCTGTATTAC GGAAAGAGGC ACCCATGAGG AACTGATGAA TTTAAATGGT GACTATGCTA 2460 

CCATTTTTAA TAACCTGTTG CTGGGAGAGA CACCGCCAGT TGAGATCAAT TCAAAAAAGG 252 0 

AAACCAGTGG TTCACAGAAG AAGTCACAAG ACAAGGGTCC TAAAACAGGA TCAGTAAAGA 25 80 

AGGAAAAAGC AGTAAAGCCA GAGGAAGGGC AGCTTGTGCA GCTGGAAGAG AAAGGGCAGG 2640 

GTTCAGTGCC CTGGTCAGTA TATGGTGTCT ACATCCAGGC TGCTGGGGGC CCCTTGGCAT 2700 

15 TCCTGGTTAT TATGGCCCTT TTCATGCTGA ATGTAGGCAG CACCGCCTTC AGCACCTGGT 2 760 

GGTTGAGTTA CTGGATCAAG CAAGGAAGCG GGAACACCAC TGTGACTCGA G GG AACG AG A 2 82 0 

CCTCGGTGAG TGACAGCATG AAGGACAATC CTCATATGCA GTACTATGCC AGCATCTACG 2 880 

CCCTCTCCAT GGCAGTCATG CTGATCCTGA AAGCCATTCG AGGAGTTGTC TTTGTCAAGG 2 940 

GCACGCTGCG AGCTTCCTCC CGGCTGCATG ACGAGCTTTT CCGAAGGATC CTTCGAAGCC 3000 

20 CTATGAAGTT TTTTGACACG ACCCCCACAG GGAGGATTCT CAACAGGTTT T CCAAAGAC A 3060 

TGGATGAAGT TGACGTGCGG CTGCCGTTCC AGGCCGAGAT GTTCATCCAG AACGTTATCC 312 0 

TGGTGTTCTT CTGTGTGGGA ATGATCGCAG GAGTCTTCCC GTGGTTCCTT GTGGCAGTGG 3180 

GGCCCCTTGT CATCCTCTTT TCAGTCCTGC ACATTGTCTC CAGGGTCCTG ATTCGGGAGC 3240 

TGAAGCGTCT GGACAATATC ACGCAGT CAC CTTTCCTCTC CCACATCACG TCCAGCATAC 3300 

25 AGGGCCTTGC CACCATCCAC GCCTACAATA AAGGGCAGGA GTTTCTGCAC AGATACCAGG 3360 

AGCTGCTGGA TGACAACCAA GCTCCTTTTT TTTTGTTTAC GTGTGCGATG CGGTGGCTGG 3420 

CTGTGCGGCT GGACCTCATC AGCATCGCCC TCATCACCAC CACGGGGCTG ATGATCGTTC 3480 

TTATGCACGG GCAGATTCCC CCAGCCTATG CGGGTCTCGC CATCTCTTAT GCTGTCCAGT 3540 

TAACGGGGCT GTTCCAGTTT ACGGTCAGAC TGGCATCTGA GACAGAAGCT CGATTCACCT 3600 

30 CGGTGGAGAG GATCAATCAC TACATTAAGA CTCTGTCCTT GGAAGCACCT GCCAGAATTA 3660 

AGAACAAGGC TCCCTCCCCT GACTGGCCCC AGGAGGGAGA GGTGACCTTT GAGAACGCAG 3 720 

AGATGAGGTA CCGAGAAAAC CTCCCTCTTG TCCTAAAGAA AGT AT CCTTC ACGATCAAAC 3780 

CTAAAGAGAA GATTGGCATT GTGGGGCGGA CAGGATCAGG GAAGT CCTCG CTGGGGATGG 3 840 

CCCTCTTCCG TCTGGTGGAG TTATCTGGAG GCTGCATCAA GATTGATGGA GTGAGAATCA 3900 

35 GTGATATTGG CCTTGCCGAC CTCCGAAGCA AACTCTCTAT CATTCCTCAA GAGCCGGTGC 3960 

TGTTCAGTGG CACTGTCAGA TCAAATTTGG ACCCCTTCAA CCAGTACACT GAAGACCAGA 4020 

TTTGGGATGC CCTGGAGAGG ACACACATGA AAGAATGTAT TGCTCAGCTA CCTCTGAAAC 40 80 

TTGAATCTGA AGTGATGGAG AATGGGGATA ACTTCTCAGT GGGGGAACGG CAGCTCTTGT 4140 

GCATAGCTAG AGCCCTGCTC CGCCACTGTA AGATTCTGAT TTTAGATGAA GCCACAGCTG 4200 

40 CCATGGACAC AGAGACAGAC TTATTGATTC AAGAGACCAT CCGAGAAGCA TTTGCAGACT 42 60 

GTACCATGCT GACCATTGCC CATCGCCTGC ACACGGTTCT AGGCTCCGAT AGGATTATGG 432 0 

TGCTGGCCCA GGGACAGGTG GTGGAGTTTG ACACCCCATC GGTCCTTCTG TCCAACGACA 4380 

GTTCCCGATT CTATGCCATG TTTGCTGCTG CAGAGAACAA GGTCGCTGTC AAGGGCTGAC 4440 

TCCTCCCTGT TGACGAAGTC TCTTTTCTTT AGAGCATTGC CATTCCCTGC CTGGGGCGGG 4500 

45 CCCCTCATCG CGTCCTCCTA CCGAAACCTT GCCTTTCTCG ATTTTATCTT TCGCACAGCA 4560 

GTTCCGGATT GGCTTGTGTG TTTCACTTTT AGGGAGAGTC ATATTTTGAT TATTGTATTT 462 0 

ATTCCATATT CATGTAAACA AAATTTAGTT TTTGTTCTTA ATTGCACTCT AAAAGGTTCA 4680 

GGGAACCGTT ATTATAATTG TATCAGAGGC CTATAATGAA GCTTTATACG TGTAGCTATA 4740 

T CT AT AT AT A ATTCTGTACA TAGCCTATAT TTACAGTGAA AATGTAAGCT GTTTATTTTA 4800 

50 TAT TAAAATA AGCACTGTGC TAATAACAGT GCATATTCCT TTCTATCATT TTTGTACAGT 4 860 

TTGCTGTACT AGAGATCTGG TTTTGCTATT AGACTGTAGG AAGAGTAGCA TTTCATTCTT 4 92 0 

CTCTAGCTGG TGGTTTCACG GTGCCAGGTT TTCTGGGTGT CCAAAGGAAG ACGTGTGGCA 4980 

ATAGTGGGCC CTCCGACAGC CCCCTCTGCC GCCTCCCCAC AGCCGCTCCA GGGGTGGCTG 5040 

GAGACGGGTG GGCGGCTGGA GACCATGCAG AGCGCCGTGA GTTCTCAGGG CTCCTGCCTT 510 0 

55 CTGTCCTGGT GTCACTTACT GTTTCTGTCA GGAGAGCAGC GGGGCGAAGC CCAGGCCCCT 5160 

TTTCACTCCC TCCATCAAGA ATGGGGATCA CAGAGACATT CCTCCGAGCC GGGGAGTTTC 5220 

TTTCCTGCCT TCTTCTTTTT GCTGTTGTTT CTAAACAAGA ATCAGTCTAT CCACAGAGAG 52 8 0 

TCCCACTGCC TCAGGTTCCT ATGGCTGGCC ACTGCACAGA GCTCTCCAGC TCCAAGACCT 5340 

GTTGGTTCCA AGCCCTGGAG CCAACTGCTG CTTTTTGAGG TGGCACTTTT TCATTTGCCT 5400 

60 ATTCCCACAC CTCCACAGTT CAGTGGCAGG GCTCAGGATT TCGTGGGTCT GTTTTCCTTT 5460 

CTCACCGCAG TCGTCGCACA GTCTCTCTCT CTCTCTCCCC TCAAAGTCTG CAACTTTAAG 5520 

CAGCTCTTGC TAATCAGTGT CTCACACTGG CGTAGAAGTT TTTGTACTGT AAAGAGACCT 5580 

ACCTCAGGTT GCTGGTTGCT GTGTGGTTTG GTGTGTTCCC GCAAACCCCC TTTGTGCTGT 5640 

GGGGCTGGTA GCTCAGGTGG GCGTGGTCAC TGCTGTCATC AGTTGAATGG TCAGCGTTGC 5700 

65 ATGTCGTGAC CAACTAGACA TTCTGTCGCC TTAGCATGTT TGCTGAACAC CTTGTGGAAG 5760 

CAAAAATCTG AAAATGTGAA TAAAATTATT TTGGATTTTG TAAAAAAAAA AAAAAAAAAA 582 0 

AAAAAAAAAA AAAAAAAA 

Seq ID NO: 585 Protein sequence 
70 Protein Accession #: NP_005679.1 

1 11 21 31 41 51 

I I I I I I 

MKDIDIGKEY IIPSPGYRSV RERTSTSGTH RDREDSKFRR TRPLECQDAL ETAARAEGLS 60 

75 LDASMHSQLR ILDEEHPKGK YHHGLSALKP IRTTSKHQHP VDNAGLFS CM TFSWLSSLAR 120 

VAHKKGELSM EDVWSLSKHE SSDVNCRRLE RLWQEELNEV GPDAASLRRV VWIFCRTRLI 180 

LSIVCLMITQ LAGFSGPAFM VKHLLEYTQA TESNLQYSLIj LVLGLLLTE I VRSWSLALTW 240 

ALNYRTGVRL RGAILTMAFK KILKLKNIKE KSLGELINIC SNDGQRMFEA AAVGSLLAGG 300 

PWAILGMIY NVI ILGPTGF LGSAVFILFY PAMMFASRLT AYFRRKCVAA TDERVQKMNE 3 60 

80 VLTYIKFIKM YAWVKAFSQS VQKIREEERR ILEKAGYFQG ITVGVAPIW VIASWTFSV 42 0 

HMTLGFDLTA AQAFTWTVF NSMTFALKVT PFSVKSLSEA SVAVDRFKSL FLMEEVHMIK 480 

NKPASPHIKI EMKNATLAWD SSHSSIQNSP KLTPKMKKDK RASRGKKEKV RQLQRTEHQA 540 

VLAEQKGHLL LDSDERPSPE EEEGKHIHLG HLRLQRTLHS IDLEIQEGKL VGICGSVGSG 600 

KTSLISAILG QMTLLEGSIA I SGTFAYVAQ QAWILNATLR DNILFGKEYD EERYNSVLNS 660 

85 CCLRPDLAIL PSSDLTEIGE RGANLSGGQR QRISLARALY SDRSIYILDD PLSALDAHVG 72 0 

NHIFNSAIRK HLKSKTVLFV THQLQYLVDC DEVIFMKEGC ITERGTHEEL MNLNGDYATI 780 

FNNLLLGETP PVEINSKKET SGSQKKSQDK GPKTGSVKKE KAVKPEEGQL VQLEEKGQGS 840 



413 



10 



WO 02/086443 

VPWSVYGVYI QAAGGPLAFL VIMALFMLNV GSTAFSTWWIi SYWIKQGSGN TTVTRGNETS 900 

VSDSMKDNPH MQYYASIYAL SMAVMLILKA IRGWFVKGT LRASSRLHDE LFRRILRSPM 960 

KFFDTTPTGR ILNRFSKDMD EVDVRLPFQA EMFIQNVILV FFCVGMIAGV FPWFLVAVGP 1020 

LVILFSVLHI VSRVLIRELK RLDNITQSPF LSHXTSSIQG LATIHAYNKG QEFLHRYQEL 1080 

LDDNQAPFFL FTCAMRWLAV RLDLISIALI TTTGLMIVLM HGQIPPAYAG LAISYAVQLT 1140 

GLFQFTVRLA SETEARFTSV ERINHYIKTL SLEAPARIKN KAPSPDWPQE GEVTFENAEM 12 00 

RYRENLPLVli KKVSFTIKPK EKIGIVGRTG SGKSSLGMAL FRLVELSGGC IKIDGVRISD 12 60 

IGLADLRSKL SIIPQEPVLF SGTVRSNLDP FNQYTEDQIW DALERTHMKE CIAQLPLKLE 1320 

SEVMENGDNF SVGERQLLCI ARALLRHCKI LILDEATAAM DTETDLLIQE TIREAFADCT 1380 
MLTIAHRLHT VLGSDRIMVL AQGQWEFDT PSVLLSNDSS RFYAMFAAAE NKVAVKG 



PCT/US02/12476 



15 

20 
25 
30 



Seq ID NO: 586 DNA sequence 

Nucleic Acid Accession #: NM_001327.1 

Coding sequence: 89.. 631 



AGCAGGGGGC 
CTGAGAGCCG 
GACGGGCGAT 
TGGCGGCCCA 
AAGGGCCTCG 
GCTGAATGGA 
CCTCGCCATG 
GGATGCCCCA 
CATACTGACT 
CTGTCTCCAG 
GGCTCAGCCT 
GCCTCCTCCC 
GTTTGTCGCT 



11 

I 

GCTGTGTGTA 
GGCAGAGGCT 
GCTGATGGCC 
GGAGAGGCGG 
GGGCCGGGAG 
TG CTGCAG AT 
CCTTTCGCGA 
CCGCTTCCCG 
AT CCGACTG A 
CAGCTTTCCC 
CCCTCAGGGC 
CTAGGGAATG 
GGAGGAGGAC 



21 
I 

CCGAGAATAC 
CCGGAGCCAT 
CAGGAGGCCC 
GTGCCACGGG 
GAGGCGCCCC 
GCGGGGCCAG 
CACCCATGGA 
TGCCAGGGGT 
CTGCTGCAGA 
TGTTGATGTG 
AGAGGCGCTA 
GTCCCAGCAC 
GGCTTACATG 



31 
I 

GAGAATACCT 
GCAGGCCGAA 
TGGCATTCCT 
CGGCAGAGGT 
GCGGGGTCCG 
GGGGCCGGAG 
AGCAGAGCTG 
GCTTCTGAAG 
CCACCGCCAA 
GATCACGCAG 
AGCCCAGCCT 
GAGTGGCCAG 
TTTGTTTCTG 



41 
I 

CGTGGGCCCT 
GGCCGGGGCA 
GATGGCCCAG 
CCCCGGGGCG 
CATGGCGGCG 
AGCCGCCTGC 
GCCCGCAGGA 
GAGTTCACTG 
CTGCAGCTCT 
TGCTTTCTGC 
GGCGCCCCTT 
TTCATTGTGG 
TAGAAAATAA 



51 
I 

GACCTTCTCT 
CAGGGGGTTC 
GGGGCAATGC 
CAGGGGCAGC 
CGGCTTCAGG 
TTGAGTTCTA 
GCCTGGCCCA 
TGTCCGGCAA 
CCATCAGCTC 
CCGTGTTTTT 
CCTAGGTCAT 
GGGCCTGATT 
AACTGAGCTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



35 



40 



45 



50 



55 



Seq ID NO: 5 87 Protein sequence 
Protein Accession #: NP_001318.1 

1 11 21 31 41 51 

I I I I 1 I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 
PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 
VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 

Seq ID NO: 588 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 52.. 459 



CCTCGTGGGC 
GAAGGCCAGG 
GCTGATGGCC 
GGTCCCCGGG 
CCGCATGGCG 
GACAGCCGCC 
ATCAGCTCCT 
GTGTTTTTGG 
TAGGTCATGC 
GCCTGATTGT 
CTGAGCTA 



11 
I 

CCTGACCTTC 
GCACAGGGGG 
CAGGGGGCAA 
GCGCAGGGGC 
GTGCCGCTTC 
TGCTTCAGTT 
GTCTCCAGCA 
CTCAGGCTCC 
CTCCTCCCCT 
TTGTCGCTGG 



21 
I 

TCTCTGAGAG 
TTCGACGGGC 
TGCTGGCGGC 
AGCAAGGGCC 
TGCGCAGGAT 
CCGACTGACT 
GCTTTCCCTG 
CTCAGGGCAG 
AGGGAATGGT 
AGGAGGACGG 



31 
I 

CCGGGCAGAG 
GATGCTGATG 
CCAGGAGAGG 
TCGGGGCCGA 
GGAAGGTGCC 
GCTGCAGACC 
TTGATGTGGA 
AGGCGCTAAG 
CCCAGCACGA 
CTTACATGTT 



41 
I 

GCTCCGGAGC 
GCCCAGGAGG 
CGGGTGCCAC 
GAGGAGGCGC 
CCTGCGGGGC 
ACCGCCAACT 
TCACGCAGTG 
CCCAGCCTGG 
GTGGCCAGTT 
TGTTTCTGTA 



51 

I 

CATGCAGGCC 
CCCTGGCATT 
GGGCGGCAGA 
CCCGCGGGGT 
CAGGAGGCCG 
GCAGCTCTCC 
CTTTCTGCCC 
CGCCCCTTCC 
CATTGTGGGG 
GAAAATAAAG 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



60 



Seq ID NO: 589 Protein sequence 
Protein Accession #: Eos sequence 



11 



21 



31 



41 



51 



MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 60 
65 PRGPHGGAAS AQDGRCPCGA RRPDSRLLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 12 0 

FLPVFLAQAP SGQRR 

Seq ID NO: 5 90 DNA sequence 
Nucleic Acid Accession #: NM_005562.1 
70 Coding sequence: 90.. 3671 



75 



80 



85 



ACAGCGGAGC 
AGACAGAGAC 
GCTTCTCGCT 
ATGGGAAGTC 
TCCGCTGCCT 
GCTTTTACCG 
CTCTTAGTGC 
CCAGATGCGA 
ACCAGAGACT 
ACGCGGGCCG 
CAGGTTACTA 
GGCATTCAGC 
TTCATCAAGA 
AATGGT CACA 



11 
1 

GCAGAGTGAG 
TGAGCGGCCC 
CCTCCTGCCC 
CAGGCAGTGT 
CAACTG CAAT 
GCACAGAGAA 
TCGATGTGAC 
CCGATGTCTG 
GCTAGACTCC 
CTGTGTCTGC 
TAATCTGGAT 
CAGCTGCCGC 
TGTTGATGGC 
GCGCCATCAA 



21 
I 

AACCACCAAC 
GGCACCGCCA 
GCAGCCCGGG 
ATCTTTGATC 
GACAACACTG 
AGGGACCGCT 
AACTCTGGAC 
CCAGGCTTCC 
AAGTGTGACT 
AAGCCAGCTG 
GGGGGGAACC 
AGCTCTGCAG 
TGGAAGGCTG 
GATGTGTTTA 



31 

I 

CGAGGCGCCG 
TGCCTGCGCT 
CCACCTCCAG 
GGGAACTTCA 
ATGGCATTCA 
GTTTGCCCTG 
GGTGCAGCTG 
ACATGCTCAC 
GTGACCCAGC 
TTACTGGAGA 
CTGAGGGCTG 
AATACAGTGT 
TCCAACGAAA 
GCTCAGCCCA 



41 

I 

GGCAGCGACC 
CTGGCTGGGC 
GAGGGAAGTC 
CAGACAAACT 
CTGCGAGAAG 
CAATTGTAAC 
TAAACCAGGT 
GGATGCGGGG 
TGGCATCGCA 
ACGCTGTGAT 
TACCCAGTGT 
CCATAAGATC 
TGGGTCTCCT 
ACGACTAGAC 



51 

I 

CCTGCAGCGG 
TGCTGCCTCT 
TGTGATTGCA 
GGTAATGGAT 
TGCAAGAATG 
TCCAAAGGTT 
GTGACAGGAG 
TGCACCCAAG 
GGGCCCTGTG 
AGGTGTCGAT 
TTCTGCTATG 
ACCTCTACCT 
GCAAAGCTCC 
CCTGTCTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



414 



WO 02/086443 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 90 0 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

GTGCTGGTCT ACGGAT CACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTT ACT T TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

C AT ATG GAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 12 60 

' AATTCTGCCA GG AT TGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTGGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATT CAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTG CTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 22 80 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

AGTTCAAGCG TACACAAAAG AAT CTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2 82 0 

AGAATGGAAA AAGTGGGAGA GAGAAAT CAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 28 80 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2 940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3 000 

AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3 060 

AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 312 0 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3 3 60 

CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3 480 

AGACCCAGAT CAACAGCCAA CTGCGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3 540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3 600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3 660 

AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGAT CTCAG 3 720 

GGCTCGGGAG CCATGT CATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3 7 80 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3 840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3 900 

ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3 960 

ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 402 0 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 40 80 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 

ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 42 60 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 432 0 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 43 80 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4 800 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGT CACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4 92 0 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4 980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seq ID NO: 591 Protein sequence 
Protein Accession #: NP_005553.1 



MPALWLGCCL 
DGIHCEKCKN 
HMLTDAGCTQ 
PEGCTQCFCY 
SSAQRLDPVY 
MPLGKTLPCG 
DNVTLISARP 
GGACD PDTGD 



11 
I 

CFSL3JLPAAR 
GFYRHRERDR 
DQRLIjDSKCD 
GHSASCRSSA 
FVAPAKFLGN 
LTKTYTFRLN 
VSGAPAPWVE 
CYSGDENPDI 



21 
I 

ATSRREVCDC 
CLPCNCNSKG 
CDPAGIAGPC 
EYSVHKITST 
QQVSYGQSLS 
EHPSNNWSPQ 
QCICPVGYKG 
ECADCPIGFY 



31 

1 

NGKSRQCIFD 
SLSARCDNSG 
DAGRCVCKPA 
FHQDVDGWKA 
FDYRVDRGGR 
LSYFEYRRLL 
QFCQDCASGY 
NDPHDPRSCK 



41 
I 

RELHRQTGNG 
RCSCKPGVTG 
VTGERCDRCR 
VQRNGSPAKIj 
HPSAHDVILE 
RNLTALRIRA 
KRDSARLGPF 
PCPCHNGFSC 



51 
I 

FRCLNCNDNT 
ARCDRCLPGF 
SGYYNLDGGN 
QWSQRHQDVF 
GAGLRITAPL 
TYGEYSTGYI 
GTCIPCNCQG 
SVMPETEEW 



60 
120 
180 
240 
300 
360 
420 
480 



415 



WO 02/086443 

CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGN CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PIjAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG DGWPDTELE GRMQQAEQAL S60 

QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

5 RLITQMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRLAESHVE SASNMEQLTR 780 

ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQL TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS DSTDVTRHMD EFKRTQKNLG 900 

NWKEEAQQLB QNGKSGREKS DQLLSRANLA KSRAQEAIiSM GNATFYEVES ILKNLREFDL 960 

QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALEISSE 1020 

10 IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEIi ERKEkEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGIiLHLMD QPLSVDEEGL VLLEQKLSRA KTQ1NSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENlRDNli PPGCYNTQAL EQQ 
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Seq ID NO i 592 DNA sequence 

Nucleic Acid Accession #: AF101051.1 

Coding sequence: 221.85 6 



GAGCAACCTC 
CGACCCAGAG 
GCGGGGCCCA 
ACCTGCCACC 
GCTGTTGGGC 
GCCCCAGTGG 
CGAGGGGCTG 
TGACTCCTTG 
CATCCTCCTG 
CTTGGAAGAC 
TCTTGCAGGT 
ATTCTATGAC 
TGGCTGGGCT 
CCGAAAAACA 
GAAAGACTAC 
GGACATTGAG 
GTATGGTATT 
AAACATGGCT 
TTGTATTACT 
TATATATAGA 
CTCATTATGT 

CCATATTGAT 
CAGT CAAATA 
CTAATTTACC 
TTATTTTTTA 
TTTCATTGGT 
AGCCAAGAAG 
GTGATAAATT 
TTTGCTTTGA 
CACAACTTTA 
ACCTTTTTGT 
TATATCTTCC 
GATAATCTGG 
TCTTTTTTCT 
AATATTAATT 
TTTATTTGCT 
CTTCATGTGA 
ACACATACCT 
AAACCTACGC 
ATTCTTTCAG 
TTTCCAGTCT 
GCACTGGTGT 
AGCAAGGCAT 
CTGATCTTCC 
GTGGTTTTGT 
CGTTTTGGTG 
GCCTTAACCA 
AAGATTCTGA 
ACAGATGTAA 
TTTTGAATCA 
TGTTAGCTGG 
CTACACAAGG 
TGCCTTCCAA 
ATACATAGAT 
ACAAAAAAAT 
TTTGATCTTT 
TTATAATGGG 
AAAAGGAAAA 



11 

I 

AGCTTCTAGT 
CTTCTCCAGC 
GCCACCTTCG 
CCTGAGCCAG 
TTCATTCTCG 
AGGATTTACT 
TGGATGTCCT 
CTGAATCTGA 
GGAGTGATAG 
GATGAGGTGC 
CTGGCTATTT 
CCTATGACCC 
GCTGCTTCTC 
ACCTCTTACC 
GTGTGACACA 
ATACTATCAT 
ACAAAACAAA 
TAATCTTATT 
GCTTCCCATT 
TATGTATATA 
TGATACTAGC 
GAAGATGTTT 
TCATTTACTC 
AAGGATGAAT 
CCATAATCTT 
CTCTATCTCC 
AATTTATTAC 
CCTGTTGACC 
AAATATTTGT 
TTGATTGAAT 
TCCCCATTCC 
TAATAAGGTG 
TGACAAATAT 
ATCTGCCAAA 
AGTTTATATT 
CAGCTGGCTG 
TTCACTGCCT 
TCATGTGGTT 
ACATACCTTC 
CTGTGTCTGA 
GTACAGAATG 
CTGGAGACCT 
TTGGCTGCTG 
CACCTCACAG 
AATTTGAAAA 
TTGCTTTTCA 
GTCTCTCAAG 
GGAAGTCTTA 
TGGGAAGAAA 
TAATAACTCA 
CAGCTGACGC 
AAAGTCAGCC 
ACCTGAGAAT 
CTTCATGATG 
TTTATGGCCC 
TTATATTCTT 
AATTTGTATA 
AAAAAAAAAA 



21 
I 

ATCCAGACTC 
GGCGGCGCAG 
GGAGTCCGGG 
CGCGGGCGCC 
CCTTCCTGGG 
CCTATGCCGG 
GCGTGTCGCA 
GCAGCACATT 
CAATCTTTGT 
AGAAGATGAG 
TAGTTGCCAC 
CAGTCAATGC 
TCTGCCTTCT 
CAACACCAAG 
GAGGCAAAAG 
TAACATTAGG 
CAAACAAACA 
TTATCTTCTT 
GAGTAATCAT 
TACATGTTTT 
ATACTTAAAA 
ATTGGTATAT 
TTCTTCATTA 
TCTTTCAATT 
ATAGCACTTG 
TGAATCTAAC 
AAATCAGAAC 
TTCCCACACA 
CCAATTGAGT 
TTTTAAGCTA 
TTAATTGTAT 
TGGTCTGTTT 
TCTCTCTGTA 
TTGAGATAAT 
ACTCTCATTC 
AGACACTGAA 
TCCTCTCTCT 
CAGTGCCTTC 
ATGTGGCTCA 
CATGTTTGTG 
CTATTTCACT 
GGATTTGAGT 
TAAGCTTATT 
TGATGTTGTG 
GTGCTATACT 
AATGTTTGAA 
TGATGAGACA 
TCTTCTGCAG 
TAAAAGCCTA 
TAAGGTGCTA 
TGCTAGGATA 
ACCGTGTCTT 
ATATG CTTTT 
TGTGAGTGTA 
AAAATGACCA 
CTACCACACC 
AAGCATTACT 
AAA 



31 

! 

CAGCGCCGCC 
CGAGCAGGGC 
TTGCCCACCT 
CGAGCGAGTC 
ATGGATCGGC 
CGACAACATC 
GAGCACCGGG 
GCAAGCAACC 
GGCCACCGTT 
GATGGCTGTC 
AGCATGGTAT 
CAGGTACGAA 
GGGAGGTGCC 
GCCCTATCCA 
GAGAAAATCA 
ACCTTAGAAT 
AAAAACCCAT 
TCCT CAAT AT 
ACTCAAATGG 
TCTATTAAAA 
TATCTCTAAA 
TTTCTTTTTC 
GCTTTGGGTG 
CTT CATGCGT 
CATCGTTATT 
ACATTTCATA 
TTTGGAGGCA 
ATCCCTGTAC 
AGCTGCATGC 
CTTATTCATA 
TGTTTTCCCA 
GTCTGAACAA 
GCTGTAAGCA 
GATACTTAAC 
TTTGAACATG 
GAAGTCACTG 
ACCAGTCTAT 
CTCTCTCTAC 
GTGCCTTCCT 
CTCTGTTCCA 
TGAGCAAGAT 
CTTGGTGCTA 
GCTTCATCTG 
GGGAT CCAGT 
AAGGGAAAGA 
AATAAAAAAA 
GTGAAGTAAA 
TGAGTATGGC 
CGTGTTGGTA 
TCTGTTCAGT 
GTTAGTTTGG 
ATGAGGAATT- 
GGAAGTTAAA 
ATTCCATGTG 
ACGAAATTGT 
TGGAAACAGA 
CTTTT T CAAT 



41 

I 

CCGGGCGCGG 
TCCCCGCCTT 
GCAAACTCTC 
ATGGCCAACG 
GCCATCGTCA 
GTGACCGCCC 
CAGAT CCAGT 
CGTGCCTTGA 
GGCATGAAGT 
ATTGGGGGTG 
GGCAATAGAA 
TTTGGTCAGG 
CTACTTTGCT 
AAACCTGCAC 
TGTTGAAACA 
TTTGGGTATT 
GTGTTAAAAT 
AGGAGGGAAG 
GGGAAGGGGT 
ATAGACAGTA 
ATAGGTAAAT 
GTCCTTATAT 
CCTTTGCCAC 
GCCCTTTTCA 
AAGCCCTTAT 
GCCTACATTT 
AATCTTTCTG 
TCTGACCCAT 
TGTTCCCCCA 
GTTTTATATC 
AGTGTAATTA 
AGTGCTAGAC 
AGTCACTTAA 
CAGTTAGAAG 
AACTATG CCT 
AACAAAACCT 
TTCCACTGAA 
CAGTCTATTT 
CTCTCTACCA 
TTTTAACAAC 
GATGTATGGA 
TCAATCACCG 
TAAGCGGTGG 
GAGATAGAAT 
ATTGAGGAAT 
TGTTAAGAAA 
ATTGAGTGCA 
CCAATGCTTT 
AATCCAACAG 
GATGCCCTCA 
AAATGGTACT 
GGACCTAATA 
ATTTAAATGG 
GATATCAGTT 
TACAATAGAA 
CCAATAGACA 
AAATTGTTTT 



51 
I 

ACCCCAACCC 
AACTTCCTCC 
CGCCTTCTGC 
CGGGGCTGCA 
GCACTGCCCT 
AGGCCATGTA 
GCAAAGTCTT 
TGGTGGTTGG 
GTATGAAGTG 
CGATATTTCT 
TCGTTCAAGA 
CTCTCTTCAC 
GTTCCTGTCC 
CTTCCAGCGG 
AACCGAAAAT 
GTAATCTGAA 
ACTCAGTGCT 
ATTTTACCAT 
GCTCCTTAAA 
AAATACTATT 
GTATTTAATT 
ACATATGTAA 
AAGACCTAGC 
TATACTTATT 
TTGTTTTGTG 
TAGTTTCTAA 
CAT G AC CAAA 
AGCACTCTTG 
GGTGTTGTAA 
CCCCTAAACT 
TCATGCGTTT 
TTTCTGGAGT 
TCTTTCTACC 
AGGTAGTGTG 
ATGTAGTGTC 
ACACACGTAC 
CAAAACCTAC 
CCACTGAACA 
GTCTATTTCC 
TGCTCTTACT 
AAGGGTGTTG 
TCTGTGTTTG 
TTTGTAATTC 
ACATGTAAGT 
TAACTGCATA 
TGGGTTTCTT 
CTAAACGAAT 
CTGTGGCTAA 
CAAGGGAGAT 
GAGCTCTTGC 
TCATAATAAA 
AATTTTAGTG 
CTTTTGCCAC 
ACCAAACATT 
TTTATCCAAT 
TTTTGGGGTT 
TTAATTTAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



Seq ID NO: 593 Protein sequence 
Protein Accession #: AAD16433.1 



1 11 21 31 41 51 

1 I I I I I 

MANAGLQLLG FILAFLGWIG AIVSTALPQW RIYSYAGDNI VTAQAMYEGL WMSCVSQSTG 
QIQCKVFDSL LNLSSTLQAT RALMWG ILL GVIAIFVATV GMKCMKCLED DEVQKMRMAV 
IGGAIFLLAG LA I LVATAW Y GNRIVQEFYD PMTPVNARYE FGQALFTGWA AASLCLLGGA 
LLCCSCPRKT TSYPTPRPYP KPAP.SSGKDY V 



60 
120 
180 
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Seq ID NO: 594 DNA sequence 

Nucleic Acid Accession #: NM_00 6180.1 

Coding sequence: 352.. 2820 

1 11 21 31 41 51 

I 1 I 1 I I 

CCCCCATTCG CAT CTAAC AA GGAATCTGCG CCCCAGAGAG TCCCGGACGC CGCCGGTCGG 60 

TGCCCGGCGC GCCGGGCCAT GCAGCGACGG CCGCCGCGGA GCTCCGAGCA GCGGTAGCGC 120 

CCCCCTGTAA AGCGGTTCGC TATGCCGGGA CCACTGTGAA CCCTGCCGCC TGCCGGAACA 180 

CTCTTCGCTC CGGACCAGCT CAGCCTCTGA TAAGCTGGAC TCGGCACGCC CGCAACAAGC 240 

ACCGAGGAGT TAAGAGAGCC GCAAGCGCAG GGAAGGCCTC CCCGCACGGG TGGGGGAAAG 300 

CGGCCGGTGC AGCGCGGGGA CAGGCACTCG GGCTGGCACT GGCTGCTAGG GATGTCGTCC 360 

TGGATAAGGT GGCATGGACC CGCCATGGCG CGGCTCTGGG GCTTCTGCTG GCTGGTTGTG 420 

GGCTTCTGGA GGGCCGCTTT CGCCTGTCCC ACGTCCTGCA AATGCAGTGC CTCTCGGATC 480 

TGGTGCAGCG ACCCTTCTCC TGGCATCGTG GCATTTCCGA GATTGGAGCC TAACAGTGTA 540 

GATCCTGAGA ACATCACCGA AATTTTCATC GCAAACCAGA AAAGGT TAG A AAT CAT CAAC 600 

GAAGATGATG TTGAAGCTTA TGTGGGACTG AGAAAT CTGA CAATTGTGGA TTCTGGATTA 660 

AAATTTGTGG CTCATAAAGC ATTTCTGAAA AACAGCAACC TGCAGCACAT CAATTTTACC 720 

CGAAACAAAC TGACGAGTTT GTCTAGGAAA CATTTCCGTC ACCTTGACTT GTCTGAACTG 780 

ATCCTGGTGG GCAATCCATT TACATGCTCC TGTGACATTA TGTGGATCAA GACTCTCCAA 840 

GAGGCTAAAT CCAGTCCAGA CACTCAGGAT TTGTACTGCC TGAATGAAAG CAGCAAGAAT 90 0 

ATTCCCCTGG CAAACCTGCA GATACCCAAT TGTGGTTTGC CATCTGCAAA TCTGGCCGCA 960 

CCTAACCTCA CTGTGGAGGA AGGAAAGTCT AT CACATTAT CCTGTAGTGT GGCAGGTGAT 102 0 

CCGGTTCCTA ATATGTATTG GGATGTTGGT AACCTGGTTT CCAAACATAT GAATGAAACA 1080 

AGCCACACAC AGGGCTCCTT AAGGATAACT AACATTTCAT CCGATGACAG TGGGAAGCAG 1140 

ATCTCTTGTG TGGCGGAAAA TCTTGTAGGA GAAGATCAAG ATTCTGTCAA CCTCACTGTG 1200 

CATTTTGCAC CAAC TAT C AC ATTTCTCGAA TCTCCAACCT CAGACCACCA CTGGTGCATT 12 60 

CCATTCACTG TGAAAGGCAA CCCCAAACCA GCGCTTCAGT GGTTCTATAA CGGGGCAATA 1320 

TTGAATGAGT CCAAATACAT CTGTACTAAA ATACATGTTA CCAATCACAC GGAGTACCAC 1380 

GGCTGCCTCC AGCTGGATAA TCCCACTCAC ATGAACAATG GGGACTACAC TCTAATAGCC 1440 

AAGAATGAGT ATGGGAAGGA TGAGAAACAG ATTTCTGCTC ACTT CATGGG CTGGCCTGGA 1500 

ATTGACGATG GTGCAAACCC AAATTATCCT GATGTAATTT ATGAAGATTA TGGAACTGCA 1560 

GCGAATGACA TCGGGGACAC CACGAACAGA AGTAATGAAA TCCCTTCCAC AGACGTCACT 162 0 

GATAAAACCG GTCGGGAACA TCTCTCGGTC TATGCTGTGG TGGTGATTGC GTCTGTGGTG 1680 

GGATTTTGCC TTTTGGTAAT GCTGTTTCTG CTTAAGTTGG CAAG AC ACT C CAAGTTTGGC 1740 

ATGAAAGGCC CAGCCTCCGT TATCAGCAAT GATGATGACT CTGCCAGCCC ACTCCATCAC 1800 

ATCTCCAATG GGAGTAACAC TCCATCTTCT TCGGAAGGTG GCCCAGATGC TGTCATTATT 1860 

GGAATGACCA AGATCCCTGT CATTGAAAAT CCCCAGTACT TTGGCAT C AC CAACAGTCAG 1920 

CTCAAGCCAG ACACATTTGT TCAGCACATC AAGCGACATA ACATTGTTCT GAAAAGGGAG 1980 

CTAGGCGAAG GAGCCTTTGG AAAAGTGTTC CTAGCTGAAT GCTATAACCT CTGTCCTGAG 2040 

CAGGACAAGA TCTTGGTGGC AGTGAAGACC CTGAAGGATG CCAGTGACAA TGCACGCAAG 2100 

GACTTCCACC GTGAGGCCGA GCTCCTGACC AACCTCCAGC ATGAGCACAT CGTCAAGTTC 2160 

TATGGCGTCT GCGTGGAGGG CGACCCCCTC ATCATGGTCT TTGAGTACAT GAAGCATGGG 222 0 

GACCTCAACA AGTTCCTCAG GGCACACGGC CCTGATGCCG TGCTGATGGC TGAGGGCAAC 2280 

CCGCCCACGG AACTGACGCA GTCGCAGATG CTGCATATAG CCCAGCAGAT CGCCGCGGGC 2340 

ATGGTCTACC TGGCGTCCCA GCACTTCGTG CACCGCGATT TGGCCACCAG GAACTGCCTG 240 0 

GTCGGGGAGA ACTTGCTGGT G AAAAT CGGG GACTTTGGGA TGTCCCGGGA CGTGTACAGC 2460 

ACTGACTACT ACAGGGTCGG TGGCCACACA ATGCTGCCCA TTCGCTGGAT GCCTCCAGAG 2520 

AGCATCATGT ACAGGAAATT CACGACGGAA AGCGACGTCT GGAGCCTGGG GGT CGTGTTG 2580 

TGGGAGATTT TCACCTATGG CAAACAGCCC TGGTACCAGC TGTCAAACAA TGAGGTGATA 2640 

GAGTGTATCA CTCAGGGCCG AGTCCTGCAG CGACCCCGCA CGTGCCCCCA GGAGGTGTAT 2700 

GAG CTGATGC TGGGGTGCTG GCAGCGAGAG CCCCACATGA GGAAGAACAT CAAGGGCATC 2760 

CATACCCTCC TTCAGAACTT GGCCAAGGCA TCTCCGGTCT ACCTGGACAT TCTAGGCTAG 2820 

GGCCCTTTTC CCCAGACCGA TCCTTCCCAA CGTACTCCTC AGACGGGCTG AGAGGATGAA 2880 

CATCTTTTAA CTGCCGCTGG AGGCCACCAA GCTGCTCTCC TTCACTCTGA CAGTATTAAC 2 940 

ATCAAAGACT CCGAGAAGCT CTCGAGGGAA GCAGTGTGTA CTTCTTCATC CATAGACACA 3000 

GTATTGACTT CTTTTTGGCA TTATCTCTTT CTCTCTTTCC ATCTCCCTTG GTTGTTCCTT 3060 

TTTCTTTTTT TAAATTTTCT TTTTCTTCTT TTTTTTCGTC TTCCCTGCTT CACGATT CTT 3120 

ACCCTTTCTT TTGAAT CAAT CTGGCTTCTG CATTACTATT AACTCTGCAT AGACAAAGGC 3180 

CTTAACAAAC GTAATTTGTT ATATCAGCAG ACACTCCAGT TTGCCCACCA CAACTAACAA 3240 

TGCCTTGTTG TATTCCTGCC TTTGATGTGG ATGAAAAAAA GGGAAAACAA ATATTTCACT 33 00 

TAAACTTTGT CACTTCTGCT GTACAGATAT CGAGAGTTTC TATGGATTCA CTTCTATTTA 33 60 

TTTAT TATTA TTACTGTTCT TATTGTTTTT GGATGGCTTA AGCCTGTGTA TAAAAAAGAA 3420 

AACTTGTGTT CAATCTGTGA AGCCTTTATC TATGGGAGAT TAAAACCAGA GAGAAAGAAG 3480 

ATTTATTATG AACCGCAATA TGGGAGGAAC AAAGACAACC ACTGGGATCA GCTGGTGTCA 3540 

GTCCCTACTT AGGAAATACT CAGCAACTGT TAGCTGGGAA GAATGTATTC GGCACCTTCC 3600 

CCTGAGGACC TTTCTGAGGA GTAAAAAGAC TACTGGCCTC TGTGCCATGG ATGATTCTTT 3 660 
TCCCATCACC AGAAATGATA GCGTGCAGTA GAGAGCAAAG ATGGCTT 

Seq ID NO: 595 Protein sequence 
Protein Accession #: NP_006171.1 

1 11 21 31 41 51 

I I I I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRIiE I INEDDVEAY VGLRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 

NETSHTQGSL RI TNI SSDDS GKQISCVAEN LVGEDQDSVN LTVHFAPTIT FLESPTSDHH 300 

WCIPFTVKGN PKPALQWFYN GAILNESKYI CTKIHVTNHT EYHGCLQLDN PTHMNNGDYT 360 

LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY GTAANDIGDT TNRSNE I PST 420 

DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS KFGMKGPASV ISNDDDSASP 480 

LHHISNGSNT PSSSEGGPDA VIIGMTKIPV IENPQYFGIT NSQLiKPDTFV QHIKRHNIVL 540 

KRELGEGAFG KVFLAECYNL CPEQDKILVA VKTL KD A SDN ARKDFHREAE LLTNLQHEHI 600 

VKFYGVCVEG DPLIMVFEYM KHGDLNKFLR AHGPDAVLMA EGNPPTELTQ SQMLHIAQQI 660 

AAGMVYLASQ HFVHRDLATR NCLVGENLLV KIGDFGMSRD VYS TDYYRVG GHTMLPIRWM 720 



417 
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PPESIMYRKF TTESDVWSLG WLWEIFTYG KQPWYQLSNN EVIECITQGR VLQRPRTCPQ 780 
EVYELMLGCW QREPHMRKNI KGIHTLLQNL AKASPVYLDI LG 

Seq ID NO: 596 DNA sequence 
Nucleic Acid Accession #: AF410899 
Coding sequence: 483.. 2 9 99 

1 11 21 31 41 51 

1 I I I I I 

GGGAGCAGGA GCCTCGCTGG CTGCTTCGCT CGCGCTCTAC GCGCTCAGTC CCCGGCGGTA 60 

GCAGGAGCCT GGACCCAGGC GCCGGCGGCG GGCGTGAGGC GCCGGAGCCC GGCCTCGAGG 120 

TGCATACCGG ACCCCCATTC GCATCTAACA AGGAATCTGC GCCCCAGAGA GTCCCGGACG 180 

CCGCCGGTCG GTGCCCGGCG CGCCGGGCCA TGCAGCGACG GCCGCCGCGG AGCTCCGAGC 240 

AGCGGTAGCG CCCCCCTGTA AAGCGGTTCG CTATGCCGGG ACCACTGTGA ACCCTGCCGC 3 00 

CTGCCGGAAC ACTCTTCGCT CCGGACCAGC TCAGCCTCTG ATAAGCTGGA CTCGGCACGC 3 60 

CCGCAACAAG CACCGAGGAG TTAAGAGAGC CGCAAGCGCA GGGAAGGCCT CCCCGCACGG 420 

GTGGGGGAAA GCGGCCGGTG CAGCGCGGGG ACAGGCACTC GGGCTGGCAC TGGCTGCTAG 480 

GGATGTCGTC CTGGATAAGG TGGCATGGAC CCGCCATGGC GCGGCTCTGG GGCTTCTGCT 540 

GGCTGGTTGT GGGCTTCTGG AGGGCCGCTT TCGCCTGTCC CACGTCCTGC AAATGCAGTG 600 

CCTCTCGGAT CTGGTGCAGC GACCCTTCTC CTGGCATCGT GGCATTT CCG AGATTGGAGC 660 

CTAACAGTGT AGATCCTGAG AACATCACCG AAATTTTCAT CGCAAACCAG AAAAGGTTAG 720 

AAATCATCAA CGAAGATGAT GTTGAAGCTT ATGTGGGACT GAGAAATCTG ACAATTGTGG 780 

ATT CTGGATT AAAATTTGTG GCT CAT AAAG CATTTCTGAA AAACAGCAAC CTGCAGCACA 840 

TCAATTTTAC CCGAAACAAA CTGACGAGTT TGTCTAGGAA ACATTTCCGT CACCTTGACT 900 

TGTCTGAACT GATCCTGGTG GGCAATCCAT- TTACATGCTC CTGTGACATT ATGTGGATCA 960 

AGACTCTCCA AGAGGCTAAA TCCAGTCCAG ACACTCAGGA TTTGTACTGC CTGAATGAAA 1020 

GCAGCAAGAA TATTCCCCTG GCAAACCTGC AGATACCCAA TTGTGGTTTG CCATCTGCAA 1080 

ATCTGGCCGC ACCTAACCTC ACTGTGGAGG AAGGAAAGTC TATCACATTA TCCTGTAGTG 1140 

TGGCAGGTGA TCCGGTTCCT AATATGTATT GGGATGTTGG TAACCTGGTT TCCAAACATA 12 00 

TGAATGAAAC AAGCCACACA CAGGGCTCCT TAAGGATAAC TAACATTTCA TCCGATGACA 1260 

GTGGGAAGCA GATCTCTTGT GTGGCGGAAA ATCTTGTAGG AGAAGATCAA GATTCTGTCA 1320 

ACCTCACTGT GCATTTTGCA CCAACTATCA CATTTCTCGA ATCTCCAACC TCAGACCACC 1380 

ACTGGTGCAT TCCATTCACT GTGAAAGGCA ACCCCAAACC AGCGCTTCAG TGGTTCTATA 1440 

ACGGGGCAAT ATTGAATGAG TCCAAATACA TCTGTACTAA AATACATGTT ACCAATCACA 1500 

CGGAGTACCA CGGCTGCCTC CAG CTGG AT A ATCCCACTCA CATGAACAAT GGGGACTACA 1560 

CTCTAATAGC CAAGAATGAG TATGGGAAGG ATGAGAAACA GATTTCTGCT CACTTCATGG 162 0 

GCTGGCCTGG AATTGACGAT GGTGCAAACC CAAATT AT CC TGATGTAATT TATGAAGATT 1680 

ATGGAACTGC AGCGAATGAC ATCGGGGACA CCACGAACAG AAGTAATGAA ATCCCTTCCA 1740 

CAGACGTCAC TGATAAAACC GGTCGGGAAC ATCTCTCGGT CTATGCTGTG GTGGTGATTG 1800 

CGTCTGTGGT GGGATTTTGC CTTTTGGTAA TGCTGTTTCT GCTTAAGTTG GCAAGACACT 1860 

CCAAGTTTGG CATGAAAGAT TTCTCATGGT TTGGATTTGG GAAAGTAAAA TCAAGACAAG 1920 

GTGTTGGCCC AGCCTCCGTT ATCAGCAATG ATG ATG ACT C TGCCAGCCCA CTCCATCACA 1980 

TCTCCAATGG GAGTAACACT CCATCTTCTT CGGAAGGTGG CCCAGATGCT GTCATTATTG 2 040 

GAATGACCAA GATCCCTGTC ATTGAAAATC CCCAGTACTT TGGCATCACC AACAGTCAGC 2100 

TCAAGCCAGA CACATTTGTT CAGCACATCA AGCGACATAA CATTGTTCTG AAAAGGGAGC 2160 

TAGGCGAAGG AGCCTTTGGA AAAGTGTTCC TAGCTGAATG CTATAACCTC TGTCCTGAGC 222 0 

AGGACAAGAT CTTGGTGGCA GTGAAGACCC TGAAGGATGC CAGTGACAAT GCACGCAAGG 22 80 

ACTTCCACCG TGAGGCCGAG CTCCTGACCA ACCTCCAGCA TGAGCACATC GTCAAGTTCT 2 340 

ATGGCGTCTG CGTGGAGGGC GACCCCCTCA TCATGGTCTT TGAGTACATG AAGCATGGGG 240 0 

ACCTCAACAA GTTCCTCAGG GCACACGGCC CTGATGCCGT GCTGATGGCT GAGGGCAACC 24 60 

CGCCCACGGA ACTGACGCAG TCGCAGATGC TGCATATAGC CCAGCAGATC GCCGCGGGCA 2 520 

TGGTCTACCT GGCGTCCCAG CACTTCGTGC ACCGCGATTT GGCCACCAGG AACTGCCTGG 2580 

TCGGGGAGAA CTTGCTGGTG AAAATCGGGG ACTTTGGGAT GTCCCGGGAC GTGTACAGCA 2 640 

CTGACTACTA CAGGGTCGGT GGCCACACAA TGCTGCCCAT TCGCTGGATG CCT CCAGAG A 2 700 

GCATCATGTA CAGGAAATTC ACGACGGAAA GCGACGTCTG GAGCCTGGGG GTCGTGTTGT 2760 

GGGAGATTTT CACCTATGGC AAACAGCCCT GGTACCAGCT GTCAAACAAT GAGGTGATAG 2820 

AGTGTAT CAC TCAGGGCCGA GTCCTGCAGC GACCCCGCAC GTGCCCCCAG GAGGTGTATG 2880 

AGCTGATGCT GGGGTGCTGG CAGCGAGAGC CCCACATGAG GAAGAACATC AAGGGCATCC 2940 

ATACCCTCCT TCAGAACTTG GCCAAGGCAT CTCCGGTCTA CCTGGACATT CTAGGCTAGG 3000 

GCCCTTTTCC CCAGACCGAT CCTTCCCAAC GTACTCCTCA GACGGGCTGA GAGGATGAAC 3 060 

ATCTTTTAAC TGCCGCTGGA GGCCACCAAG CTGCTCTCCT TCACTCTGAC AGTATTAACA 312 0 

TCAAAGACTC CGAGAAGCTC TCGAGGGAAG CAGTGTGTAC TTCTTCATCC ATAGACACAG 3180 

TATTGACTTC TTTTTGGCAT TATCTCTTTC TCT CTTTCCA TCTCCCTTGG TTGTTCCTTT 3240 

TTCTTTTTTT AAATTTTCTT TTTCTTCTTT TTTTTCGTCT TCCCTGCTTC ACGATTCTTA 3 300 

CCCTTTCTTT TGAATCAATC TGGCTTCTGC ATTACTATTA ACTCTGCATA GACAAAGGCC 3360 

TTAACAAACG TAATTTGTTA TATCAGCAGA CACTCCAGTT TGCCCACCAC AACTAACAAT 3420 

GCCTTGTTGT ATTCCTGCCT TTGATGTGGA TGAAAAAAAG GGAAAACAAA TATTTCACTT 3480 

AAACTTTGTC ACTTCTGCTG TACAGATATC GAGAGTTTCT ATGGATTCAC TTCTATTTAT 3540 

TT ATT AT TAT TACTGTTCTT ATTGTTTTTG GATGGCTTAA GCCTGTGTAT AAAAAAGAAA 3600 

ACTTGTGTTC AATCTGTGAA GCCTTTATCT ATGGGAGATT AAAACCAGAG AGAAAGAAGA 3 660 

TTTATTATGA ACCGCAATAT GGGAGGAACA AAGACAACCA CTGGGATCAG CTGGTGTCAG 3720 

TCCCTACTTA GG AAAT ACT C AGCAACTGTT AGCTGGGAAG AATGTATTCG GCACCTTCCC 3 780 

CTGAGGACCT TTCTGAGGAG TAAAAAGACT ACTGGCCTCT GTGCCATGGA TGATTCTTTT 3 840 

CCCATCACCA GAAATGATAG CGTGCAGTAG AGAGCAAAGA TGGCTTCCGT GAGACACAAG 3 900 

ATGGCGCATA GTGTGCTCGG ACACAGTTTT GTCTTCGTAG GTTGTGATGA TAGCACTGGT 3 960 

TTGTTTCTCA AGCGCTATCC ACAGAACCTT TGTCAACTTC AGTTGAAAAG AGGTGGATTC 4 02 0 
ATGTCCAGAG CTCATTTCGG GGTCAGGTGG GAAAGCC 

Seq ID NO: 597 Protein sequence 
Protein Accession #: AAL67965.1 

1 11 21 31 41 51 

I 1 1 I I I 

MSSWIRWHGP AMARLWGFCW LWGFWRAAF ACPTSCKCSA SRIWCSDPSP GIVAFPRLEP 60 

NSVDPENITE IFIANQKRLE I INEDDVEAY VGIiRNLTIVD SGLKFVAHKA FLKNSNLQHI 120 

NFTRNKLTSL SRKHFRHLDL SELILVGNPF TCSCDIMWIK TLQEAKSSPD TQDLYCLNES 180 

SKNIPLANLQ IPNCGLPSAN LAAPNLTVEE GKSITLSCSV AGDPVPNMYW DVGNLVSKHM 240 



418 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 



WO 02/086443 

NETSHTQGSL RITNISSDDS GKQISCVAEN LVGEDQDSVN 
WCIPFTVKGN PKPALQWFYN GAILNESKYI CTK I HVTNHT 
LIAKNEYGKD EKQISAHFMG WPGIDDGANP NYPDVIYEDY 
DVTDKTGREH LSVYAWVIA SWGFCLLVM LFLLKLARHS 
VGPASVISND DDSASPLHHI SNGSNTPSSS EGGPDAVIIG 
KPDTFVQHIK RHNIVLKREL GEGAFGKVFL AECYNLCPEQ 
FHREAELLTN LQHEHIVKFY GVCVEGDPLI MVFEYMKHGD 
PTELTQSQML HIAQQIAAGM VYLASQHFVH RDLATRNCLV 
DYYRVGGHTM LPIRWMPPES IMYRKFTTES DVWSLGWLW 
CITQGRVLQR PRTCPQEVYE LMLGCWQREP HMRKNIKGIH 

Seq ID NO: 5 98 DNA sequence 
Nucleic Acid Accession #: AB052906 
Coding sequence: 74.. 814 



PCT/US02/12476 



AAAACCTTGA 
CTCTGGGTCC 
GCTCCTGCTG 
CATCACCGTC 
GGATGAAAAG 
CCTGGGGAAG 
GGTGGTGGAC 
GGAACCCCTC 
TGGATCTTGG 
AATGTGGACA 
GGTTGTGGCC 
CTTCTTGATG 
CTCAGGCACA 
CATCCTCCCC 
AAGCTGATAC 
CCAGCTGCCC 
TGGACCCAAT 
TACCTAACAT 
TTCTGGCTGA 
GTACTTCTTT 
TAGACTTCAG 
ATAAGAAAAA 
TTTAAATAAA 



11 
I 

GGTGATTCAT 
TTAATGGCAG 
TCCGGCTGGT 
ATCCCTAAGT 
ACTTTTCTTC 
AAACTAAATG 
ATACTTACAG 
ACCCTGCAGG 
CAGTTCAGTT 
ACGGTTCATC 
ATGTCCTTCC 
GGCATGGACA 
ACCCAACTCA 
TGCTTCATCC 
CAAAAGGCTC 
ACGACCTACG 
AGCTCATTCA 
ATTATGCAAT 
CTAAACAAGA 
GAATGATGAT 
ACCTCTGGGG 
ATTTATATTA 
GAGTTCTATT 



21 
1 

CTTCCAGGCT 
CAGCCGCCGC 
CCCGGGCTGG 
TCAGACCTGG 
ACTATGACTG 
TCACAACGGC 
AGCAACTGCG 
CCAGGATGTC 
TCGATGGGCA 
CTGGAGCCAG 
ATTACTTCTC 
GCACCCTGGA 
GGGCCACAGC 
TCCCTGGCAT 
CTGTGAGCAC 
GTGTATGTCC 
CTGCCTTGAT 
TTTCTCTTGG 
TAT ATCAT TT 
CTCTTTCTTG 
ATTCTTTCCG 
ATGAT TGTTT 
TCCCAAAAAA 



31 
I 

CTCCTTCCAT 
TACCAAGATC 
GCGAGCCGAC 
ACCACGGTGG 
TGGCAACAAG 
CTGGAAAGCA 
TGACATTCAG 
TTGTGAGCAG 
GATCTTCCTC 
AAAGATGAAA 
AAT GGGAG AC 
GCCAAGTGCA 
CACCACCCTC 
CTGAGGAGAG 
GGTCTTGATC 
AGTGGCCTCC 
TCCTTTTGCC 
TGCTACCTGA 
TCTTTCTTCT 
CAAATGATAT 
TGTCCTGAAA 
CCTTTAGTAA 
AAAAAAAAAA 



U X vxiJr^iir ill 


ft" ip ^ pt sryHw 

i- jjjjOjt x ouiin. 


300 


E YHGCL O IjDM 


PTHMNNGDYT 


360 


GTAANDIGDT 


TNRSNEIPST 


420 


KFGMKDPSWF 


GFGKVKSRQG 


480 


MTKIPVIENP 


QYFGITNSQL 


540 


DKILVAVKTL 


KDASDNARKD 


600 


LNKFLRAHGP 


DAVLMAEGNP 


660 


GENLLVKIGD 


FGMSRDVYST 


720 


EIFTYGKQPW 


YQLSNNEVIE 


780 


TLLQNLAKAS 


PVYLDILG 




41 
1 


51 
1 




1 

pa agtptptp 


1 

CTPP.P.TAGCG 


60 


V— J. 1^— XVJXVJ>wl_ 


X Vw^C.\JJ *w X X V- X 


120 


V— X UA^w 1U1^ 


TTTGCTATGA 


180 




AAGGCCAGGT 


240 


apagtpa pa c 


CTGTPAGTCC 


300 


PAGAAPPPAG 


TAPTGAGAGA 


360 


PTGGAGAATT 

X VJVJrtVJfWl X X 


ACACACCCAA 


420 


AAAGCTGAAG 


GACACAGCAG 


480 


PTPTTTGAPT 


CAGAGAAGAG 


540 


gaaaagtggg 


AGAATGACAA 


600 


T<*2TATAf2GAT 


GGPTTGAGGA 


660 




TPGPPATGTP 
x ^uv^Ln x vj i. V— 


720 


nlv-LJ, X X x 


GPPTPPTPAT 


780 




TGAPAGGTTA 


840 


AAACTCGPCC. 


TTCTGTCTGG 


900 


AGC AGAT CAT 


GATGACATCA 


960 


AACAATTTTA 


CCAGCAGTTA 


1020 


TGGAATTCCT 


GCACTTAAAG 


1080 


CTTTTTGTTT 


GGAAAATCAA 


1140 


TGT CAGT AAA 


ATAATCACGT 


1200 


GAGAATTTTT 


AAATTATTTA 


1260 


TTTATTGTTC 


TGTACTGATA 


1320 


AA 







Seq ID NO: 599 Protein sequence 
Protein Accession #: BAB61048.1 



11 



21 



51 



31 41 

l l I I I I 

MAAAAATKIL LCLPLLLLLS GWSRAGRADP HSLCYDITVI PKFRPGPRWC AVQGQVDEKT 
FLHYDCGNKT VTPVSPLGKK LNVTTAWKAQ NPVLREWDI LTEQLRDIQL ENYTPKEPLT 
LQARMSCEQK AEGHSSGSWQ FSFDGQIFLL FDSEKRMWTT VHPGARKMKE KWENDKWAM 
SFHYFSMGDC IGWLEDFLMG MDSTLEPSAG APLAMSSGTT QLRATATTLI LCCLLI ILPC 
FILPGI 

Seq ID NO: 600 DNA sequence 

Nucleic Acid Accession #: NM_001898.1 

Coding sequence: 57.. 482 



GGCTCTCACC 
CCCAGTATCT 
GCCCCAAGGA 
AGTGGGTACA 
ACTACTACAG 
ATTACTTCTT 
ACACCTGTGC 
TCTACGAAGT 
AGGGATCTGT 
CCACCCCTGG 
GACAGACAGA 
CTTCCTTCTT 
AAACAGTAGC 



11 

I 

CTCCTCTCCT 
GAGTACCCTG 
GGAGGATAGG 
GCGTGCCCTT 
ACGTCCGCTG 
CGACGTAGAG 
CTTCCATGAA 
TCCCTGGGAG 
GCCAGGCCAT 
ACTGGTGGCC 
GAAGGCTGCA 
GCTTCTAATA 
ATCGCC 



21 
I 

GCAGCTCCAG 
CTGCTCCTGC 
ATAATCCCGG 
CACTTCGCCA 
CGGGTACTAA 
GTGGGCCGCA 
CAGCCAGAAC 
AACAGAAGGT 
TCGCACCAGC 
CCCACCCTGC 
GGAGTCCTTT 
GCCCTGGTAC 



31 
I 

CTTTGTGCTC 
TGGCCACCCT 
GTGGCATCTA 
TCAGCGAGTA 
GAGCCAGGCA 
CCATATGTAC 
TGCAGAAGAA 
CCCTGGTGAA 
CACCACCCAC 
GGGAGGCCTC 
GTTGCTCAGC 
ATGGTACACA 



41 
I 

TGCCTCTGAG 
AGCTGTGGCC 
TAACGCAGAC 
TAACAAGGCC 
ACAGACCGTT 
CAAGTCCCAG 
ACAGTTGTGC 
ATCCAGGTGT 
TCCCACCCCC 
CCCATGTGCC 
AGGGCGCTCT 
CCCCCCCACC 



51 
I 

GAGACCATGG 
CTGGCCTGGA 
CTCAATGATG 
ACCAAAGATG 
GGGGGGGTGA 
CCCAACTTGG 
TCTTTCGAGA 
CAAGAATCCT 
TGTAGTGCTC 
TGCGCCAAGA 
GCCCTCCCTC 
TCCTGCAATT 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



75 



80 



85 



Seq ID NO: 601 Protein sequence 
Protein Accession #: NP_001889.1 

1 11 21 31 41 51 

1 I 1 1 I 1 

MAQYLSTLLI* LLATLAVALA WSPKEEDRII PGGI YNADLN DEWVQRALHF AISEYNXATK 

DDYYRRPLRV LRARQQTVGG VNYFFDVEVG RTICTKSQPN LDTCAFHEQP ELQKKQLCSF 
EIYEVPWENR RSLVKSRCQE S 

Seq ID NO: 602 DNA sequence 

Nucleic Acid Accession #: NM_003976.2 

Coding sequence: 2 99.961 



11 

1 



21 



31 



41 



51 



60 
120 



419 



10 



15 



20 



25 



WO 02/086443 

CTCTGAGCTT CTCTGAGCCT TGTTTGCTCA TCTGGAAAAA GGGGATTAAA CCATTTACCT 60 

CATGGAGTTG TGAAAGAATA GCTGCAAAGC ACCTAACACA TAGTAAGGTT CCCAGTGCAG 120 

CTACTTCTGC TGGGTTGAGT CTAGCTGTGT AGGCCCCTTG TTCCTCACCT GGAGAAACTG 180 

GGGTGGCAGG CCGGTCCCCC ACAAAAGATA ACTCATCTCT TAATTTGCAA GCTGCCTCAA 240 

CAGGAGGGTG GGGGAACAGC TCAACAATGG CTGATGGGCG CTCCTGGTGT TGATAGAGAT 300 

GGAACTTGGA CTTGGAGGCC TCTCCACGCT GTCCCACTGC CCCTGGCCTA GGCGGCAGCC 360 

TGCCCTGTGG CCCACCCTGG CCGCTCTGGC TCTGCTGAGC AGCGTCGCAG AGGCCTCCCT 420 

GGGCTCCGCG CCCCGCAGCC CTGCCCCCCG CGAAGGCCCC CCGCCTGTCC TGGCGTCCCC 480 

CGCCGGCCAC CTGCCGGGGG GACGCACGGC CCGCTGGTGC AGTGGAAGAG CCCGGCGGCC 540 

GCCGCCGCAG CCTTCTCGGC CCGCGCCCCC GCCGCCTGCA CCCCCATCTG CTCTTCCCCG 600 

CGGGGGCCGC GCGGCGCGGG CTGGGGGCCC GGGCAGCCGC GCTCGGGCAG CGGGGGCGCG 660 

GGGCTGCCGC CTGCGCTCGC AGCTGGTGCC GGTGCGCGCG CTCGGCCTGG GCCACCGCTC 720 

CGACGAGCTG GTGCGTTTCC GCTTCTGCAG CGGCTCCTGC CGCCGCGCGC GCTCTCCACA 780 

CGACCTCAGC CTGGCCAGCC TACTGGGCGC CGGGGCCCTG CGACCGCCCC CGGGCTCCCG 840 

GCCCGTCAGC CAGCCCTGCT GCCGACCCAC GCGCTACGAA GCGGTCTCCT TCATGGACGT 900 

CAACAGCACC TGGAGAACCG TGGACCGCCT CTCCGCCACC GCCTGCGGCT GCCTGGGCTG 960 

AGGGCTCGCT CCAGGGCTTT G CAGACTGG A CCCTTACCGG TGGCTCTTCC TGCCTGGGAC 1020 

CCTCCCGCAG AGTCCCACTA GCCAGCGGCC TCAGCCAGGG ACGAAGGCCT CAAAGCTGAG 1080 

AGGCCCCTAC CGGTGGGTGA TGGAT AT CAT CCCCGAACAG GTGAAGGGAC AACTGACTAG 1140 

CAGCCCCAGA GCCCTCACCC TGCGGATCCC AGCCTAAAAG ACACCAGAGA CCTCAGCTAT 1200 

GGAGCCCTTC GGACCCACTT CTCACAGACT CTGGCACTGG CCAGGCCTCG AACCTGGGAC 1260 

CCCTCCTCTG ATGAACACTA CAGTGGCTGA GGCAT CAGCC CCCGCCCAGG CCCTGTAGGG 1320 

ACAGCATTTG AAGGACACAT ATTGCAGTTG CTTGGTTGAA AGTGCCTGTG CTGGAACTGG 13 80 
CCTGTACTCA CTCATGGGAG CTGGCCCC 

Seq ID NO: 603 Protein sequence 
Protein Accession #: NP 003967.1 



PCT/US02/12476 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



1 11 21 31 41 51 

I I ! 1 I I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 
PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 
RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR L S AT ACGCLG 

Seq ID NO: 604 DNA sequence 

Nucleic Acid Accession #: NM_057091.1 

Coding sequence: 783.. 1445 



60 
120 
180 



ACTGGCCGCT 
GGACCCCCAA 
TCGCTCCCCG 
CGCGTGTCTA 
CTCCATATCC 
CAAGCTAGGG 
CGGGGCAGGG 
CACCGGACGG 
CAGACAAGGC 
CCCGGGCCTG 
TAAAAGAGGC 
GCCCAGCACT 
TCAACAGGAG 
AGATGGAACT 
AGCCTGCCCT 
CCCTGGGCTC 
CCCCCGCCGG 
GGCCGCCGCC 
CCCGCGGGGG 
CGCGGGGCTG 
GCTCCGACGA 
CACACGACCT 
CCCGGCCCGT 
ACGTCAACAG 
GCTGAGGGCT 
GGACCCTCCC 
TGAGAGGCCC 
CTAGCAGCCC 
CTATGGAGCC 
GGACCCCTCC 
AGGGACAGCA 
CTGGCCTGTA 



11 
I 

GAGAGAAGAA 
ATCTGCACGT 
CCCTCACTCA 
CAAACTCAAC 
GAGGGGCCCC 
GGGACTGGAT 
GCGCTCCCAG 
CTGCGGCGGC 
CCGGGGGCTC 
GAGCCCCACA 
ACTGCCAGGT 
GGTCCCCGGA 
GGTGGGGGAA 
TGGACTTGGA 
GTGGCCCACC 
CGCGCCCCGC 
CCACCTGCCG 
GCAGCCTTCT 
CCGCGCGGCG 
CCGCCTGCGC 
GCTGGTGCGT 
CAGCCTGGCC 
CAGCCAGCCC 
CACCTGGAGA 
CGCTCCAGGG 
GCAGAGTCCC 
CTACCGGTGG 
CAGAGCCCTC 
CTTCGGACCC 
TCTGATGAAC 
TTTGAAGGAC 
CTCACTCATG 



21 

) 

TCGGGTGGAG 
ACCAGCAGTC 
CTTTCTCCCG 
TCCCGGTTTC 
TCCCAGCATC 
CCGACGGGTG 
CCCCACCCCG 
GGGCAGGAGG 
CGCCAGCAGC 
CCCGAGGGTG 
GTACAGTCCT 
AAGGTGCCTA 
CAGCTCAACA 
GGCCTCTCCA 
CTGGCCGCTC 
AGCCCTGCCC 
GGGGGACGCA 
CGGCCCGCGC 
CGGGCTGGGG 
TCGCAGCTGG 
TTCCGCTTCT 
AGCCTACTGG 
TGCTGCCGAC 
ACCGTGGACC 
CTTTGCAGAC 
ACTAGCCAGC 
GTGATGGATA 
ACCCTGCGGA 
ACTTCTCACA 
ACTACAGTGG 
ACATATTGCA 
GGAGCTGGCC 



31 

I 

CAGAGAGCAG 
AGCCGCCCCA 
CCCTCGGCCC 
CGTGCCTCTC 
TACCCCCCTC 
GAGCAGCCAG 
GGATCTGGTG 
CTGCTGAGGG 
AGGTCCCTCG 
CAGACTGGCT 
GGGCATGCGC 
GAAGAACAAG 
ATGGCTGATG 
CGCTGTCCCA 
TGGCTCTGCT 
CCCGCGAAGG 
CGGCCCGCTG 
CCCCGCCGCC 
GCCCGGGCAG 
TGCCGGTGCG 
GCAGCGGCTC 
GCGCCGGGGC 
CCACGCGCTA 
GCCTCTCCGC 
TGGACCCTTA 
GGCCTCAGCC 
TCATCCCCGA 
TCCCAGCCTA 
GACTCTGGCA 
CTGAGGCATC 
GTTGCTTGGT 
CC 



41 
I 

CTGCTGCAGG 
CGCAGGGACC 
GGCCTCCCAG 
CACCGCTCGA 
CCAACCTCGG 
GTGAGCCCCG 
ACGCTGGGGC 
ATGGAGTTGG 
GGCCCCAGCC 
GCCAAGGCCA 
TGTTTGAGCT 
GTGCAGGACC 
GGCGCTCCTG 
CTGCCCCTGG 
GAGCAGCGTC 
CCCCCCGCCT 
GTG CAGTGG A 
TGCACCCCCA 
CCGCGCTCGG 
CGCGCTCGGC 
CTGCCGCCGC 
CCTGCGACCG 
CGAAGCGGTC 
CACCGCCTGC 
CCGGTGGCTC 
AGGGACGAAG 
ACAGGTGAAG 
AAAGACACCA 
CTGGCCAGGC 
AGCCCCCGCC 
TGAAAGTGCC 



51 

I 

GCAGACAGCC 
GGCTTACCCC 
CTCTCTACTT 
GTTCTCTACT 
GGGACCTAGC 
AAAGGTGGGG 
TGGAATTTGA 
GCCCGGCCCC 
CTCGCTGCCA 
CACTTTTGGC 
TCGGGGGAGA 
CCGTGCTGCC 
GTGTTGATAG 
CCTAGGCGGC 
GCAGAGGCCT 
GTCCTGGCGT 
AGAGCCCGGC 
TCTGCTCTTC 
GCAGCGGGGG 
CTGGGCCACC 
GCGCGCTCTC 
CCCCCGGGCT 
TCCTTCATGG 
GGCTGCCTGG 
TTCCTGCCTG 
GCCTCAAAGC 
GGACAACTGA 
GAGACCTCAG 
CTCGAACCTG 
CAGGCCCTGT 
TGTGCTGGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



Seq ID NO: 605 Protein sequence 
Protein Accession #: NP 003 967.1 



80 



85 



1 11 21 31 41 51 

I I I I 1 I 

MELGLGGLST LSHCPWPRRQ PALWPTLAAL ALLSSVAEAS LGSAPRSPAP REGPPPVLAS 

PAGHLPGGRT ARWCSGRARR PPPQPSRPAP PPPAPPSALP RGGRAARAGG PGSRARAAGA 

RGCRLRSQLV PVRALGLGHR SDELVRFRFC SGSCRRARSP HDLSLASLLG AGALRPPPGS 
RPVSQPCCRP TRYEAVSFMD VNSTWRTVDR LSATACGCLG 



Seq ID NO: 606 DNA sequence 

Nucleic Acid Accession #: NM_057160.1 



60 
120 
180 



420 



WO 02/086443 

Coding sequence : 1 . . 



PCT/US02/12476 



10 



15 



20 



25 



ATGCCCGGCC 
CACCTGGGTG 
TGGCCCACCC 
GCGCCCCGCA 
CACCTGCCGG 
CAGCCTTCTC 
CGCGCGGCGC 
CGCCTGCGCT 
CTGGTGCGTT 
AGCCTGGCCA 
AGCCAGCCCT 
ACCTGGAGAA 
GCTCCAGGGC 
CAGAGTCCCA 
TACCGGTGGG 
AGAGCCCTCA 
TTCGGACCCA 
CTGATGAACA 
TTGAAGGACA 
TCACTCATGG 



11 
I 

TGATCTCAGC 
CCCTCTTTCT 
TGGCCGCTCT 
GCCCTGCCCC 
GGGGACGCAC 
GGCCCGCGCC 
GGGCTGGGGG 
CGCAGCTGGT 
TCCGCTTCTG 
GCCTACTGGG 
GCTGCCGACC 
CCGTGGACCG 
TTTGCAGACT 
CTAGCCAGCG 
TGATGGATAT 
CCCTGCGGAT 
CTTCTCACAG 
CTACAGTGGC 
CATATTGCAG 
GAGCTGGCCC 



21 

1 

CCGAGGACAG 
CCCTGAGGCT 
GGCTCTGCTG 
CCGCGAAGGC 
GGCCCGCTGG 
CCCGCCGCCT 
CCCGGGCAGC 
GCCGGTGCGC 
CAGCGGCTCC 
CGCCGGGGCC 
CACGCGCTAC 
CCTCTCCGCC 
GGACCCTTAC 
GCCTCAGCCA 
CATCCCCGAA 
CCCAGCCTAA 
ACTCTGGCAC 
TGAGGCATCA 
TTGCTTGGTT 
C 



31 

! 

CCCCTCCTTG 
CCACTTGGTC 
AGCAGCGTCG 
CCCCCGCCTG 
TGCAGTGGAA 
GCACCCCCAT 
CGCGCTCGGG 
GCGCTCGGCC 
TGCCGCCGCG 
CTGCGACCGC 
GAAGCGGTCT 
ACCGCCTGCG 
CGGTGGCTCT 
GGGACGAAGG 
CAGGTGAAGG 
AAGACACCAG 
TGGCCAGGCC 
GCCCCCGCCC 
GAAAGTGCCT 



41 
I 

AGGTCCTTCC 
TCTCCGCGCA 
CAGAGGCCTC 
TCCTGGCGTC 
GAGCCCGGCG 
CTGCTCTTCC 
CAGCGGGGGC 
TGGGCCACCG 
CGCGCTCTCC 
CCCCGGGCTC 
CCTTCATGGA 
GCTGCCTGGG 
TCCTGCCTGG 
CCTCAAAGCT 
GACAACTGAC 
AGACCTCAGC 
TCGAACCTGG 
AGGCCCTGTA 
GTGCTGGAAC 



51 
I 

TCCCCAAGCC 
GCCTGCCCTG 
CCTGGGCTCC 
CCCCGCCGGG 
GCCGCCGCCG 
CCGCGGGGGC 
GCGGGGCTGC 
CTCCGACGAG 
ACACGACCTC 
CCGGCCCGTC 
CGTCAACAGC 
CTGAGGGCTC 
GACCCTCCCG 
GAGAGGCCCC 
TAGCAGCCCC 
TATGGAGCCC 
GACCCCTCCT 
GGGACAGCAT 
TGGCCTGTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



Seq ID NO: 607 Protein sequence 
Protein Accession #: NP 476501.1 



30 



35 



40 



45 



50 



55 



60 



1 11 21 31 41 51 

I I I I I I 

MPGLISARGQ PLLEVLPPQA HLGALFLPEA PLGLSAQPAL WPTLAALALL SSVAEASLGS 
APRSPAPREG PPPVLASPAG HLPGGRTARW CSGRARRPPP QPSRPAPPPP APPSALPRGG 
RAARAGGPGS RARAAGARGC RLRSQLVPVR ALGLGHRSDE LVRFRFCSGS CRRARSPHDL 
SLASLLGAGA LRPPPGSRPV SQPCCRPTRY EAVSFMDVNS TWRTVDRLSA TACGCLG 

Seq ID NO: 608 DNA sequence 

Nucleic Acid Accession #: NM_0570 90.1 

Coding sequence: 29.. 715 



60 
120 
180 



1 
I 

CTGATGGGCG 
GTCCCACTGC 
GTGGCCCACC 
CGCGCCCCGC 
CCACCTGCCG 
GCAGCCTTCT 
CCGCGCGGCG 
CCGCCTGCGC 
GCTGGTGCGT 
CAGCCTGGCC 
CAGCCAGCCC 
CACCTGGAGA 
CGCTCCAGGG 
GCAGAGTCCC 
CTACCGGTGG 
CAGAGCCCTC 
CTTCGGACCC 
TCTGATGAAC 
TTTGAAGGAC 
CTCACTCATG 



11 

I 

CTCCTGGTGT 
CCCTGGCCTA 
CTGGCCGCTC 
AGCCCTGCCC 
GGGGGACGCA 
CGGCCCGCGC 
CGGGCTGGGG 
TCGCAGCTGG 
TTCCGCTTCT 
AGCCTACTGG 
TGCTGCCGAC 
ACCGTGGACC 
CTTTGCAGAC 
ACTAGCCAGC 
GTGATGGATA 
ACCCTGCGGA 
ACTTCTCACA 
ACTACAGTGG 
ACATATTGCA 
GGAGCTGGCC 



21 
I 

TGATAGAGAT 
GGCGGCAGGC 
TGGCTCTGCT 
CCCGCGAAGG 
CGGCCCGCTG 
CCCCGCCGCC 
GCCCGGGCAG 
TGCCGGTGCG 
GCAGCGGCTC 
GCGCCGGGGC 
CCACGCGCTA 
GCCTCTCCGC 
TGGACCCTTA 
GGCCTCAGCC 
TCATCCCCGA 
TCCCAGCCTA 
GACTCTGGCA 
CTGAGGCATC 
GTTGCTTGGT 
CC 



31 

! 

GGAACTTGGA 
TCCACTTGGT 
GAGCAGCGTC 
CCCCCCGCCT 
GTGCAGTGGA 
TGCACCCCCA 
CCGCGCTCGG 
CGCGCTCGGC 
CTGCCGCCGC 
CCTGCGACCG 
CGAAGCGGTC 
CACCGCCTGC 
CCGGTGGCTC 
AGGGACGAAG 
ACAGGTGAAG 
AAAGACACCA 
CTGGCCAGGC 
AGCCCCCGCC 
TGAAAGTGCC 



41 

I 

CTTGGAGGCC 
CTCTCCGCGC 
GCAGAGGCCT 
GTCCTGGCGT 
AGAGCCCGGC 
TCTGCTCTTC 
GCAGCGGGGG 
CTGGGCCACC 
GCGCGCTCTC 
CCCCCGGGCT 
TCCTTCATGG 
GGCTGCCTGG 
TTCCTGCCTG 
GCCTCAAAGC 
GGACAACTGA 
GAGACCTCAG 
CTCGAACCTG 
CAGGCCCTGT 
TGTGCTGGAA 



51 
I 

TCTCCACGCT 
AGCCTGCCCT 
CCCTGGGCTC 
CCCCCGCCGG 
GGCCGCCGCC 
CCCGCGGGGG 
CGCGGGGCTG 
GCTCCGACGA 
CACACGACCT 
CCCGGCCCGT 
ACGT CAACAG 
GCTGAGGGCT 
GGACCCTCCC 
TGAGAGGCCC 
CTAGCAGCCC 
CTATGGAGCC 
GGACCCCTCC 
AGGGACAGCA 
CTGGCCTGTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



65 



70 



75 



80 



85 



Seq ID NO: 609 Protein sequence 
Protein Accession #: NP_476431.1 

1 11 21 31 41 51 

I I I I I I 

MELGLGGLST LSHCPWPRRQ APLGLSAQPA LWPTLAALAL LSSVAEASLG SAPRSPAPRE 

GPPPVLASPA GHLPGGRTAR WCSGRARRPP PQPSRPAPPP PAPPSALPRG GRAARAGGPG 

SRARAAGARG CRLRSQLVPV RALGLGHRSD ELVRFRFCSG SCRRARSPHD LSLASLLGAG 

ALRPPPGSRP VSQPCCRPTR YEAVS FMDVN STWRTVDRLS ATACGCLG 



Seq ID NO: 610 DNA sequence 
Nucleic Acid Accession #: Eos 
Coding sequence: 1..1746 



sequence 



ATGCCACTGA 
GCCTACCATG 
GGGGCACGCA 
CTCAACACGC 
GCCCTGAGGA 
GGCTCGCTGC 
TTCCAGGGCC 
CAGCCGGCCC 
CTGGAATACA 



11 
I 

AGCATTATCT 
GCTGCCCTAG 
TTGTGGCGGT 
ACATCACTGA 
TTGAGAAGAA 
GCTATCTCAG 
TGGACAGCCT 
ACTTCTCCCA 
TCCCTGACGG 



21 

1 

CCTTTTGCTG 
CGAGTGTACC 
GCCCACCCCT 
ACTCAATGAG 
TGAGCTGTCG 
CCTCGCCAAC 
TGAGTCTCTC 
GTGCAGCAAC 
AGCCTTCGAC 



31 
I 

GTGGGCTGCC 
TGCTCCAGGG 
CTGCCCTGGA 
TCCCCGTTCC 
CGCATCACGC 
AACAAGCTGC 
CTTCTGTCCA 
CTCAAGGAGC 
CACCTGGTAG 



41 
I 

AAGCCTGGGG 
CCTCCCAGGT 
ACGCCATGAG 
TCAATATCTC 
CTGGGGCCTT 
AGGTTCTGCC 
GTAACCAGCT 
TGCAGTTGCA 
GACTCACGAA 



51 

I 

TGCAGGGTTG 
GGAGTGCACC 
CCTGCAGATC 
AGCCCTCATC 
CCGAAACCTG 
CATCGGCCTC 
GTTGCAGATC 
CGGCAACCAC 
GCTCAATCTG 



60 
12 0 
180 



60 
120 
180 
240 
300 
360 
420 
480 
540 



421 
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GGCAAGAATA GCCTCACCCA CATCTCACCC 
GTCCTCCGGC TGTATGAGAA CAGGCTCACG 
GTTAACCTGC AGGAACTGGC TCTACAGCAG 
TTCCACAACA ACCACAACCT CCAGAGACTC 
5 CCACCCAGCA TCTTCATGCA GCTGCCCCAG 

CTGAAGGAGC TCTCTCTGGG GATCTTCGGG 
TATGACAACC ACATCTCTTC TCTACCCGAC 
GTCCTGATTC TTAGCCGCAA TCAGATCAGC 
ACGGAGCTTC GGGAGCTGTC CCTCCACACC 

10 TTCCGCATGT TGGCCAACCT GCAGAACATC 

CCAGGGAATA TCTTCGCCAA CGT CAATGGC 
CTGGAGAACT TGCCCCTCGG CATCTTCGAT 
TATGACAATC CCTGGAGGTG TGACTCAGAC 
AACCAGCCTA GGTTAGGGAC GGACACTGTA 

15 GGCCAGTCCC TCATTATCAT CAATGT CAAC 

GTGCCTAGTT ACCCAGAAAC ACCATGGTAC 
TCCGTCTCTT CTACCACTGA GCTAACCAGC 
ATTCAGGTCA CTGATGACCG CAGCGTTTGG 
ATTGCCGCCA TTGTAATTGG CATTGTCGCC 

20 TGTTGCTGCT GCAAGAAGAG GAGCCAAGCT 

TGTTAAAGAG GCAGGCTGGA GCAGGGCTGG 
TCATCTTTCT GCCTCCACCC CTGGGTCCAT 
CTAGATAAAG GTGTGCCTAC CTCTTCCTGA 
CGTGCCGGAC CTTCCTACAA TCAGGAAGAT 

25 GGATTTCCGA TTCATACCCC TGGGCTTCCT 

ACCTGTCCTC CAAGAACAGC CTTCCCTGCG 
AGTTAGT CC A CAGCCTGCTC ACTTCGTGGG 
CCTAAGTATT ATGTAAGTTG ATTTCCCTTC 
ACCCAGCATG TCCCCTCAAA TGAAAGTTCT 

30 TGAGTTCTCT CCTCAAAGAA GACTTCAAAC 

CAGCCTGGTT TTGGGGATGC TATGAAAGAG 
AGACAGAAGA GCCGTCATCA GTGTCTCACT 
CCCCAGCACA GCAAGCTGAG CCTTTTAGAG 
TGAAAAGTTT AGCCCTTTAA GGAATGAAAT 

35 AAAAT CAGCT TATTAATACG GGATAGAGAA 

CACCCCTAGA GTTTGTTTTA AAATTTTTAA 
GTGGGAACAT GATAGTGTAT GGCTTGGTGG 
CAGCATCTAG ACCCAGACCC AGAGCATCAC 
GGAGATGGGG GCTTCTGAAG ATGGACTTAC 

40 TCCCCCCACA GTCAGCCTGT GCAAAGGCCC 
TGTGGACAGG ATGGGAGACT GTGGCCTGAA 
AGAGACCCTG AGACCTGGGG CACCATGGCT 
GTCCGTGCAG CCACACCCTC TTCCCTGCCA 
TCCGCCTGGA GCCTTCTATG GACGTGATAT 

45 ACTTAGGGGA AGTGAAATCG CTCAGAGATG 

GAATCTAGTG TCTTTCTAAT GTGGTAAAAT 
TGAACTTCAG AATCTCACTT ACAGCAGGCG 
GTCTGGGGGC TCCCTGGAGC TCCTCCTGCG 
TCCAGGGTTA TTCTCCTCCT CGAGTCACAG 

50 CTGCTATACA CATATTCACA TGGCGCTCAA 

CTCTGGACAA CTGGCCCAGT TTACAGTGAA 
AGGAAAGAAC TTCAGCTGAC TCCACGGGGA 
TCTTATTAGC TCCCCGCTCC ACAAGACACC 
TCGGCTCTTA TTAGCTCCCC GCTCCACAAG 

55 CCCGATCGGC TCTTATTAGC TCCCCGCTCC 

CAGGAGCACG TGCTGACCAG TTTTCCCTTC 
TGTTTGCAAA CACTAGTGCA CTTTGTAGCT 
AGATGAGGCC CGTCAGAGTC AAGAGATGTC 
ACTATTGGTG GCACCTGGAG GACATGCACC 

60 CCAGAGCATG GCACATGAGC ATCACCCGCT 

GGGGCATCCC GGCCCGTACC CCTCCAGACA 
GGTGCTCCTG TGAGTGGCCT CCAGATGTCT 
GAGGGAGGTG GGAAACCTCA TCATCCGGTG 
GGTATTCCTG GCAGTAGCCA TGACATTGGA 

65 GAGGG CCACT GTCCTCAGAT GACACCACCC 

CCTTATGTGA ACCTCTTGCC TCTTCCTTTC 
GCCTCCTTTT CTTCAGCGGG CCCTTCAACC 
TACTAGAAAA GCTGAGTGGA GTCTCCTTTC 
GGGCTGGAAT GAGCCGGCTG GTCCCCCAGA 

70 CCTCTCTGTT TACAGCTCCT TGACAGTCCC 

GTGTTGGAGA AGAAACAACA AAAGCCAATT 
TGCACAGATA CTCTTCAAGC ACTGGACGTG 
GGTAGGAGTG CCGCCTCTAC CCACT TGTGA 
GGTGTTCAAT AGGCTGGGAG TTTTATTTAT 

75 TTGTCTTGGG CTTTCGTCAT TAAACCAAAG 

TTAGTCTTGG TCATCAGAAC CTCACTTGGT 
GGAAAAAATA AACTCTTCCA TCCCTTAAAG 
TGGGCTGTAT GTATATTGTT CTTCCTCCTT 
AACTTTTCAT GGACACAATT TCCACAACCT 

80 GAACTTCCAA ACTCAGGAAG TTTGCAGAGA 

AGTTGGTCGA CAGATGTTAG ATGT AT CCT A 
GCCCCCAGAT CCCACAGTCA GAACTGAATC 
GGAAGGAAGC CATGGCTGTG GTTCAGAGAG 
CTCCTTCCGC CCCAGGTTTC TTCTTCTCTT 

85 CTTCATGCTG CCTTCAAAGC T AGAT C ATGT 

GCCCCAGTGC TTGGCGATGC ATTTACAGAT 
AGCCCTGGTG GGCAGGGTTG GGGGGTCTGT 



AGGGTCTTCC AGCACCTGGG CAATCTCCAG 600 

GATATCCCCA TGGGCACTTT TGATGGGCTT 660 

AACCAGATTG GACTGCTCTC CCCTGGTCTC 720 

TACCTGTCCA ACAACCACAT CTCCCAGCTG 780 

CTCAACCGTC TTACTCTCTT TGGGAATTCC 840 

CCCATGCCCA ACCTGCGGGA GCTTTGGCTC 900 

AATGTCTTCA GCAACCTCCG CCAGTTGCAG 960 

TTCATCTCCC CGGGTGCCTT CAACGGGCTA 1020 

AACGCACTGC AGGACCTGGA CGGGAATGTC 1080 

TCCCTGCAGA ACAATCGCCT CAGACAGCTC 1140 

CTCATGGCCA TCCAGCTGCA GAACAACCAG 1200 

CACCTGGGGA AACTGTGTGA GCTGCGGCTG 12 60 

ATCCTTCCGC TCCGCAACTG GCTCCTGCTC 1320 

CCTGTGTGTT TCAGCCCAGC CAATGTCCGA 1380 

GTTGCTGTTC CAAGCGTCCA TGTCCCTGAG 1440 

CCAGACACAC CCAGTTACCC TGACACCACA 1500 

CCTGTGGAAG ACTACACTGA TCTGACTACC 1560 

GGCATGACCC AGGCCCAGAG CGGGCTGGCC 1620 

CTGGCCTGCT CCCTGGCTGC CTGCGTCGGC 1680 

GTCCTGATGC AGATGAAGGC ACCCAATGAG 1740 

GGAATGATGG GACTGGAGGA CCTGGGAATT 180 0 

GGAGCTTTCC CGTGATTGCT CTTTCTGGCC 1860 

CTTGCCTGAT TCTCCCGTAG AGAAGCAGGT 192 0 

AGATCCAACT GGCCATGGCA AAAGCCCTGG 1980 

TCGAGAGGGC TCTTCCTCCA AATCCTCCCC 2 040 

CCCAGGCCCC CTCCGGGCCT CTGTAGACTC 210 0 

AATAGTTCTC CGCTGAGATA GCCCCTCTCG 2160 

TTTTGTTTCT CTTGTTTGTG CTATGGCTTG 2220 

CCCCTTGATT TTCTGCTCCT GAAGGCAGGG 2280 

CATTTAACTG GTTTCTTAAG AGCCGTCAAT 2 340 

AGAAGGAAAA TCATGCCGCT CAGTTCCTGG 2400 

TGTGATTTTT ATCTGGAAAA GGAAGAAACA 2460 

AAGGATATTT CCAAACTGCA AACTTTGCTT 2 520 

CATGTAGAAT TTTGGACTTC TAAAAACATT 2580 

AGAAATCTGG TGCCTGGGGG TCCCTGTGTT 2 640 

TTGAAGCATG TGAAGTGTAC STGCAGAAAA 2 700 

ATTTTCACAA ACTGAACATA CCTGTGTAAT 2760 

AAATATCCCC CATCCTGGGC TTTTCCCAGA 2 820 

CTGGGACCTG CCCCCCATGA GCCAGGACGG 2 880 

CGTGGCCAGG GGTGGAGGAG AATATGTGGG 2 940 

CAGGAGATTT TATTATATCT GGAGACCCTG 3 000 

GGCCAGGTCA GAAGCATCCT GACTGCAGAG 3 060 

GCAAGTTGTC TGCGGCTCAT CGGAGGCCCC 3120 

GCCTGTATCT GTTTTTAATT TTCATTCTTC 3180 

AGATCCTTTA ATTGAAAACG AAGTGTAACG 3 240 

TCTCCATCAA CATCACAGTC AGCTGGCAGC 33 00 

ACACGGGGGT ACACCGATGG GTCACACTGG 3360 

TGTGGTCTGG TTAGGAGTTG AGTTGTTTGC 3420 

TCACACGAAT ACCTGCCTTC TCTGGCTTTC 3480 

GAAGTTAGGC TCATGGCAAC GTGTGTCTTT 3540 

ATGGAGAATT TCAGGTCTCC ACGTCTGCCC 3 600 

TCTGGAAATC CACGACCAAT CCCGATCGGC 3 660 

TGTGCTTTGG AAATCCACCA CCAATCCCGA 3 720 

ACACCTGTGA TCTGGAAATC TACCACCAAT 3 7 80 

ACAAGACACC TGTGACATCC TCCAGGGCCA 3 840 

CAGTTCCTGC ACAAAAAGTG T CC AGAGGGC 3 900 

TTTCACCCTC TGTCCCAGGG AAT CTAGGAG 3 960 

ATCCCCCCAG GGTCTCCAAG GCATTTCCAC 4020 

AAGGCTTGCC AGAGCCAACA GGAAGTGAGC 4 0 80 

GATGGTGGCC TGCTGTGCCT GGTGCCAACA 4140 

GGAAGCATGG GTTTGCCCAC AGACCTGTCG 42 00 

TTGTGCATAG GCACAAGTGG GCCAGGGCTG 42 60 

GGCCCTGCCA ATCTTAACCC AGAACCCTTA 4320 

GCACCTTCCT CTCCAGCCAG AGGCTGACCT 43 80 

AGGAGCACCC TAGGTGAGGG GTGAGGGCCC 4440 

TCCCATCAGA GTGGTTGGAT GGAGCCATTG 4500 

TCTCTGCACC ATGTTGTCTG GCTGAGGAGC 4560 

CAACAGGATG ATGCATTTGC TCAATTCTCA 462 0 

AAGCTGGAGT GGGGTACAGA GTTCAGTTTT 4680 

ACGCCCATCT GGAGTGGGAG CTGGGAGTTA 4740 

AGAACCACTA TTTTTAAAAA GTGCTTACTG 4800 

GATTCTCTCT CTAGCCCTCA GCACCCCTGC 4 860 

TGGGGTACAG AGGCACTTGC TCTTCTGCAT 492 0 

CTCTTCAAAC TTTGTACAAG AGCTCATGGC 4980 

GAAATGGAAG CCATTCCCCT GTTGCTCTCC 5040 

ACCATATAGA TCAAAAGCTT TGTAACCACA 5100 

AATAGAATAG TTTGTCCCTC TCATGGGAAT 5160 

AGAATTTAGA GATACAAGAG TTCTACTTAG 5220 

TTCAGATGCT GATGTAGAGC TATTGGGAAA 52 80 

GCAGACAGCT AGAGATAACT CGGGACCCAG 5340 

GCTTTTAGCC ATAAACCACT CAAAGATTCA 5400 

TGCGTTGTTG GGAAGCCAGC AGTGGCCTTG 5460 

GGTGGGCTGG CAAGCCACTT CCGGGGAAAA 5520 

AAGGAGAGAT TGTTCTCACC AACCCGCTGC 5580 

TTGCCTTGCT TAGAGAATTA CTGCAAATCA 5640 

TTCTAGGCCC TCAGGGTTTT GTAGAGTGTG 5700 

CTTCTGCTGG ATGCTGCTTG TAATCCATTT 5760 
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GGTGTACAGA ATCAACAATA AATAATATAC ATGTAT 
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Seq ID NO: 611 Protein sequence 
^ Protein Accession #: BAB84587.1 

1 11 21 

I I I 

MPLKHYLLLL VGCQAWGAGL AYHGCPSECT 

LNTHITELNE SPFLNISALI ALR I EKNELS 

10 FQGLDSLESL LLSSNQLLQI QPAHFSQCSN 

GKNSLTHISP RVFQHLGNLQ VLRLYENRLT 

FHNNHNLQRL YLSNNHISQL PPSIFMQLPQ 

YDNHISSLPD NVFSNLRQLQ VLILSRNQIS 

FRMLANLQNI SLQNNRLRQL PGNIFANVNG 

15 YDNPWRCDSD ILPLRNWLLL NQPRLGTDTV 

VPSYPETPWY PDTPSYPDTT SVSSTTELTS 

IAAIVIGIVA LACSLAACVG CCCCKKRSQA 

Seq ID NO: 612 DNA sequence 
20 Nucleic Acid Accession #: XM_098151 
Coding sequence: 1..447 

1 11 21 31 41 51 

I I I ! I I 

ZJ ATGATGCATT TGCTCAATTC TCAGGGCTGG AATGAGCCGG CTGGTCCCCC AGAAAGCTGG 60 

AGTGGGGTAC AGAGTT CAGT TTTCCTCTCT GTTTACAGCT CCTTGACAGT CCCACGCCCA 120 

TCTGGAGTGG GAGCTGGGAG TCAGTGTTGG AGAAGAAACA ACAAAAGCCA ATTAGAACCA 180 

CTATTTTTAA AAAGTGCTTA CTGTGCACAG ATACTCTTCA AGCACTGGAC GTGGATTCTC 240 

TCTCTAGCCC TCAGCACCCC TGCGGTAGGA GTGCCGCCTC TACCCACTTG TGATGGGGTA 3 00 

30 CAGAGGCACT TGCTCTTCTG CATGGTGTTC AATAGGCTGG GAGTTTTATT TATCTCTTCA 3 60 

AACTTTGTAC AAGAGCTCAT GGCTTGTCTT GGGCTTTCGT CATTAAACCA AAGGAAATGG 42 0 
AAGCCATTCC CCTGTTGCTC TCCTTAG 

Seq ID NO: 613 Protein sequence 
35 Protein Accession #t XP_098151 

1 11 21 31 41 51 

i l I I I I 

MMHLLNSQGW NEPAGPPESW SGVQSSVFLS VYSSLTVPRP SGVGAGSQCW RRNNKSQLEP 60 

40 LFLKSAYCAQ ILFKHWTWIL SLALSTPAVG VPPLPTCDGV QRHLLFCMVF NRLGVLFISS 120 

NFVQELMACL GLSSLNQRKW KPFPCCSP 

Seq ID NO: 614 DNA sequence 
Nucleic Acid Accession #: NMJ)02658.1 
45 Coding sequence: 77.. 1372 

1 ll 21 31 41 51 

I I I 1 I I 

GTCCCCGCAG CGCCGTCGCG CCCTCCTGCC GCAGGCCACC GAGGCCGCCG CCGTCTAGCG 60 

50 CCCCGACCTC GCCACCATGA GAGCCCTGCT GGCGCGCCTG CTTCTCTGCG TCCTGGTCGT 120 

GAGCGACTCC AAAGGCAGCA ATGAACTTCA TCAAGTTCCA TCGAACTGTG ACTGTCTAAA 180 

TGGAGGAACA TGTGTGTCCA ACAAGTACTT CTCCAACATT CACTGGTGCA ACTGCCCAAA 240 

GAAATTCGGA GGGCAGCACT GTGAAATAGA TAAGTCAAAA ACCTGCTATG AGGGGAATGG 300 

TCACTTTTAC CGAGGAAAGG CCAGCACTGA CACCATGGGC CGGCCCTGCC TGCCCTGGAA 3 60 

55 CTCTGCCACT GTCCTTCAGC AAACGTACCA TGCCCACAGA TCTGATGCTC TTCAGCTGGG 42 0 

CCTGGGGAAA CATAATTACT GCAGGAACCC AGACAACCGG AGGCGACCCT GGTGCTATGT 480 

GCAGGTGGGC CTAAAGCCGC TTGTCCAAGA GTGCATGGTG CATGACTGCG CAGATGGAAA 540 

AAAGCCCTCC TCTCCTCCAG AAGAATTAAA ATTTCAGTGT GGCCAAAAGA CTCTGAGGCC 600 

CCGCTTTAAG ATTATTGGGG GAGAATTCAC CACCATCGAG AACCAGCCCT GGTTTGCGGC 660 

60 CATCTACAGG AGGCACCGGG GGGGCT CTGT CACCTACGTG TGTGGAGGCA GCCTCATCAG 72 0 

CCCTTGCTGG GTGATCAGCG CCACACACTG CTTCATTGAT TACCCAAAGA AGGAGGACTA 780 

CATCGTCTAC CTGGGTCGCT CAAGGCTTAA CTCCAACACG CAAGGGGAGA TGAAGTTTGA 840 

GGTGGAAAAC CTCATCCTAC ACAAGGACTA CAGCGCTGAC ACGCTTGCTC AC CACAACG A 90 0 

CATTGCCTTG CTGAAGATCC GTT CCAAGGA GGGCAGGTGT GCGCAGCCAT CCCGGACTAT 960 

65 ACAGACCATC TGCCTGCCCT CGATGTATAA CGATCCCCAG TTTGGCACAA GCTGTGAGAT 1020 

CACTGGCTTT GGAAAAGAGA ATTCTACCGA CTATCTCTAT CCGGAGCAGC TGAAAATGAC 1080 

TGTTGTGAAG CTGATTTCCC ACCGGGAGTG TCAGCAGCCC CACTACTACG GCTCTGAAGT 1140 

CACCACCAAA ATGCTATGTG CTGCTGACCC CCAATGGAAA ACAGATTCCT GCCAGGGAGA 12 0 0 

CTCAGGGGGA CCCCTCGTCT GTTCCCTCCA AGGCCGCATG ACTTTGACTG GAATTGTGAG 1260 

70 CTGGGGCCGT GGATGTGCCC TGAAGGACAA GCCAGGCGTC TACACGAGAG TCTCACACTT 1320 

CTTACCCTGG ATCCGCAGTC ACACCAAGGA AGAGAATGGC CTGGCCCTCT GAGGGTCCCC 1380 

AGGGAGGAAA CGGGCACCAC CCGCTTTCTT GCTGGTTGTC ATTTTTGCAG TAGAGTCATC 1440 

TCCATCAGCT GTAAGAAGAG ACTGGGAAGA TAGGCTCTGC ACAGATGGAT TTGCCTGTGG 1500 

CACCACCAGG GTGAACGACA ATAGCTTTAC CCTCACGGAT AGGCCTGGGT GCTGGCTGCC 1560 

75 CAGACCCTCT GGCCAGGATG GAGGGGTGGT CCTGACTCAA CATGTTACTG ACCAGCAACT 1620 

TGTCTTTTTC TGGACTGAAG CCTGCAGGAG TTAAAAAGGG CAGGGCATCT CCTGTGCATG 1680 

GGCTCGAAGG GAGAGCCAGC TCCCCCGACC GGTGGGCATT TGTGAGGCCC ATGGTTGAGA 1740 

AATGAATAAT TTCCCAATTA GGAAGTGTAA GCAGCTGAGG TCTCTTGAGG GAGCTTAGCC 1800 

AATGTGGGAG CAGCGGTTTG GGGAGCAGAG AC AC T AACG A CTTCAGGGCA GGGCTCTGAT 1860 

80 AT T CCATGAA TGTATCAGGA AATATATATG TGTGTGTATG TTTGCACACT TGTTGTGTGG 192 0 

GCTGTGAGTG TAAGTGTGAG TAAGAGCTGG TGTCTGATTG TTAAGTCTAA ATATTTCCTT 1980 

AAACTGTGTG GACTGTGATG CCACACAGAG TGGTCTTTCT GGAGAGGTTA TAGGTCACTC 2 04 0 

CTGGGGCCTC TTGGGTCCCC CACGTGACAG TGCCTGGGAA TGTACTTATT CTGCAGCATG 2100 

ACCTGTGACC AGCACTGTCT CAGTTTCACT TT CACATAGA TGTCCCTTTC TTGGCCAGTT 2160 

85 ATCCCTTCCT TTTAGCCTAG TTCATCCAAT CCTCACTGGG TGGGGTGAGG ACCACTCCTT 2220 

ACACTGAATA TTTATATTTC ACTATTTTTA TTTATATTTT TGTAATTTTA AATAAAAGTG 22 80 
ATCAATAAAA TGTGATTTTT CTGA 



31 41 51 

I 1 I 

CSRASQVECT GARIVAVPTP LPWNAMSLQI 60 

R I T PGAFRNL GSLRYLSLAN NKLQVLPIGL 120 

LKELQLHGNH LEYIPDGAFD HLVGLTKLNL 180 

DIPMGTFDGL VNLQELALQQ NQIGLLSPGL 240 

LNRLTLFGNS LKELSLGIFG PMPNLRELWL 300 

FISPGAFNGL TELRELSLHT NALQDLDGNV 360 

LMAIQLQNNQ LENLPLGIFD HLGKLCELRL 42 0 

PVCFSPANVR GQSLIIINVN VAVPSVHVPE 480 

PVEDYTDLTT IQVTDDRSVW GMTQAQSGLA 540 

VLMQMKAPNE C 



423 
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Seq ID NO: 615 Protein sequence 
Protein Accession #: NP_002649.1 

1 11 21 31 41 51 

I I 1 I I I 

MRALLARKLL CVLWSDSKG SNELHQVPSN CDCLNGGTCV SNKYFSNIHW CNCPKKFGGQ 60 

HCEIDKSKTC YEGNGHFYRG KASTDTMGRP CLPWNSATVL QQTYHAHRSD ALQLGLGKHN 120 

YCRNPDNRRR PWCYVQVGLK PLVQECMVHD CADGKKPSSP PEELKFQCGQ KTLRPRFKI I 180 

GGEFTTIENQ PWFAAIYRRH RGGSVTYVCG GSLISPCWVI SATHCFIDYP KKEDYIVYLG 240 

RSRLNSNTQG EMKFEVENLI LHKDYSADTL AHHND I ALLK IRSKEGRCAQ PSRTIQTICL 300 

PSMYNDPQFG TSCEITGFGK ENSTDYLYPE QLKMTWKLI SHRECQQPHY YGSEVTTKML 360 

CAADPQWKTD SCQGDSGGPL VCSLQGRMTL TGIVSWGRGC ALKDKPGVYT RVSHFLPWIR 420 
SHTKEENGLA L 

Seq ID NO: 616 DNA sequence 

Nucleic Acid Accession #: NM_024422.1 

Coding sequence: 202,. 2907 

1 11 21 31 41 51 

I I 1 I I I 

CGCCAAAGGA AAAGCCC CTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 12 0 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG AT CTTAATAT TTG CCAGTG A TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC 3 60 

CTGAAAGAGT GCT TTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 420 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 540 

GAGCATCAAA CAAAGGT CCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGTCCTG GAGTTGACCA AGAACCT CGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAh AGATGAGCCT 102 0 

GACACGATGC ACACACGCCT GAAGTACTCC ATCATTGGGC AGGTGCCACC ATCACCCACC 1080 

CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 1200 

GGTCTACAGA CAACTT CAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 1260 

ACAT TT ACT C GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 1320 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGAT CTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GAT CAGGATG AGGGCCCTGA GTGTAACCCT 162 0 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

AAAG CAT ATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT ACAGGATCAA TCAAAGTTTT CAGAAGCCTG 1800 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 192 0 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 204 0 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 210 0 

CGTCTTTCCT ATCAGAATGA TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 2220 

GAAAATGACT GCACACATCG TGTAGATCCA AGGATTGGCG GTGGAGGAGT ACAACTTGGA 22 80 

AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2 340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 2400 

CAGCAGAACC TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 246 0 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2 52 0 

TCAGGAATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2 640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2 70 0 

CTTGGTGAAA AAGTGTATCT GTGTAATCAA GATGAAAATC ACAAGCATGC CCAAGACTAT 2 760 

GTCCTGACAT ATAACTATGA AGGAAGAGGA TCGGTGGCTG GGT CTGTAGG TTGTTGCAGT 2 82 0 

GAACGACAAG AAGAAGATGG GCTTGAATTT TTGGATAATT TGGAGCCCAA ATTTAGGACA 2 8 80 

CTAGCAGAAG CATGCATGAA GAGATGAGTG TGTTCTAATA AGTCT CTGAA AGCCAGTGGC 2 940 

TTTATGACTT TTAAAAAAAA TTACAAACCA AGAATTTTTT AAAGCAGAAG ATGCTATTTG 3000 

TGGGGGTTTT TCTCTCATTA TTTGGATGGA ATCTCTTTGG TCAAATGCAC ATTTACAGAG 3 060 

AGACACTATA AACAAGTACA CAAATTTTTC AATTTTTACA TATTTTTAAA TTACTTATCT 3120 

TCTATCCAAG GAGGTCTACA GAGAAATTAA AGTCTGCCTT ATTTGTTACA TTTGGGTATA 3180 

AT G ACAACAG CCAATTTATA GTGCAATAAA ATGTAATTAA TTCAAGTCCT TATTATAGAC 3240 

TATTTGAAGC ACAACCTAAT GGAAAATTGT AGAGACCTTG CTTTAACATT ATCTCCAGTT 3 3 00 

AATTAAGTGT TCATGTGGTG CTTGGAAACT GTTGTTTTCC TGAACATCTA AAGTGTGTAG 3 3 60 

ACTGCATTCT TGCTATTATT TTATTCTTGT AATGTGACCT TTTCACTGTG CAAAGGGAGA 342 0 
TTTCTAGCCA GGCATTGACT ATTACAATTT CATT 

Seq ID NO: 617 Protein sequence 
Protein Accession #: NP_077740.1 

1 11 21 31 41 51 

1 I I 1 I 1 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKLDAEKLVG RVNLKECFTA 60 

ANLIHSSDPD FQILEDGSVY TTNTILLSSE KRSFTILLSN TENQEKKKIF VFLEHQTKVL 120 



424 
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KKRHTKEKVL RRAKRRWAPI PCSMLENSLG PFPLFLQQVQ SDTAQNYTIY YSIRGPGVDQ 180 

EPRNLFYVER DTGNLYCTRP VDREQYESFE I I AFATTPDG YTPEDPLPLI IKIEDENDNY 240 

PIFTEETYTF T I FENCRVGT TVGQVCATDK DEPDTMHTRL KYSIIGQVPP SPTLFSMHPT 300 

TGVITTTSSQ LDRELIDKYQ LKIKVQDMDG QYFGLQTTST CIINIDDVND HIiPTFTRTSY 360 

5 VTSVEENTVD VEILRVTVED KDLVNTANWR ANYT I LKGNE NGNFKIVTDA KTNEGVLCW 42 0 

KPLNYEEKQQ MILQIGWNE APFSREASPR SAMSTATVTV NVEDQDEGPE CNPPIQTVRM 480 

KENAEVGTTS NGYKAYD PET RSSSGIRYKK LTDPTGWVTI DENTGSIKVF RSDDREAETI 540 

KNGIYNITVL ASDQGGRTCT GTLGIILQDV NDNSPFIPKK TVIICKPTMS SAEIVAVDPD 600 

EPIHGPPFDF SLESSTSEVQ RMWRLKAIND TAARLSYQND PPFGSYWPI TVRDRLGMSS 660 

10 VTSLDVTLCD CITENDCTHR VDPRIGGGGV QLGKWAILAI LLGIALLFCI LFTLVCGASG 72 0 

TSKQPKVIPD DLAQQNLIVS NTEAPGDDKV YSANGFTTQT VGASAQGVCG TVGSGIKNGG 780 

QETIEMVKGG HQTSESCRGA GHHHTLDSCR GGHTEVDNCR YTYSEWHSFT QPRLGEKVYL 840 

CNQDENHKHA QDYVLTYNYE GRGSVAGSVG CCSERQEEDG LEFLDNLEPK FRTLAEACMK 90 0 



Seq ID NO-- 618 DNA sequence 

Nucleic Acid Accession ft: NM_004949.1 

Coding sequence: 202.. 2745 

20 1 11 21 31 41 51 

r i i i i i 

CGCCAAAGGA AAAGCCCCTT GGATGAGAGG CAGGCGCTTC AGAGAAGCTA AGAAAAGCAC 60 

CTCTCCGCGC GCCCCACCTC CTCCGCCTCG CGCTCCTCCT GAGCAGCGGG CCCAGACTGC 120 

GCTCCGGCCG CGGCCCTCGC CCCGCGGAGC CCTCCTACCC CGGCCCGACG CTCGGCCCGC 180 

25 GACCTGCCCC GAGCCCTCTC CATGGAGGCA GCCCGCCCCT CCGGCTCCTG GAACGGAGCC 240 

CTCTGCCGGC TGCTCCTGCT GACCCTCGCG ATCTTAATAT TTGCCAGTGA TGCCTGCAAA 300 

AATGTGACAT TACATGTTCC CTCCAAACTA GATGCCGAGA AACTTGTTGG TAGAGTTAAC . 360 

CTGAAAGAGT GCTTTACAGC TGCAAATCTA ATTCATTCAA GTGATCCTGA CTTCCAAATT 42 0 

TTGGAGGATG GTTCAGTCTA TACAACAAAT ACTATTCTAT TGTCCTCGGA GAAGAGAAGT 480 

30 TTTACCATAT TACTTTCCAA CACTGAGAAC CAAGAAAAGA AGAAAATATT TGTCTTTTTG 54 0 

GAGCATCAAA CAAAGGT CCT AAAGAAAAGA CATACTAAAG AAAAAGTTCT AAGGCGCGCC 600 

AAGAGAAGAT GGGCTCCAAT TCCTTGTTCG ATGCTAGAAA ACTCCTTGGG TCCTTTTCCA 660 

CTTTTCCTTC AACAGGTTCA ATCTGACACG GCCCAAAACT ATACCATATA CTATTCCATA 720 

AGAGGT CCTG GAGTTGACCA AGAACCTCGG AATTTATTTT ATGTGGAGAG AGACACTGGA 780 

35 AACTTGTATT GTACTCGTCC TGTAGATCGT GAGCAGTATG AATCTTTTGA GATAATTGCC 840 

TTTGCAACAA CTCCAGATGG GTATACTCCA GAACTTCCAC TGCCCCTAAT AATCAAAATA 900 

GAGGATGAAA ATGATAACTA CCCAATTTTT ACAGAAGAAA CTTATACTTT TACAATTTTT 960 

GAAAATTGCA GAGTGGGCAC TACTGTGGGA CAAGTGTGTG CTACTGACAA AGATGAGCCT 102 0 

GACACGATGC ACACACGCCT GAAGTACTCC AT CATTGGGC AGGTGCCACC ATCACCCACC 1080 

40 CTATTTTCTA TGCATCCAAC TACAGGCGTG ATCACCACAA CATCATCTCA GCTAGACAGA 1140 

GAGTTAATTG ACAAGTACCA GTTGAAAATA AAAGTACAAG ACATGGATGG TCAGTATTTT 12 00 

GGT CTACAGA CAACTTCAAC TTGTATCATT AACATTGATG ATGTAAATGA CCACTTGCCA 12 60 

ACATTTACTC GTACTTCTTA TGTGACATCA GTGGAAGAAA ATACAGTTGA TGTGGAAATC 132 0 

TTACGAGTTA CTGTTGAGGA TAAGGACTTA GTGAATACTG CTAACTGGAG AGCTAATTAT 1380 

45 ACCATTTTAA AGGGCAATGA AAATGGCAAT TTTAAAATTG TAACAGATGC CAAAACCAAT 1440 

GAAGGAGTTC TTTGTGTAGT TAAGCCTTTG AATTATGAAG AAAAGCAACA GATGATCTTG 1500 

CAAATTGGTG TAGTTAATGA AGCTCCATTT TCCAGAGAGG CTAGTCCAAG ATCAGCCATG 1560 

AGCACAGCAA CAGTTACTGT TAATGTAGAA GATCAGGATG AGGGCCCTGA GTGTAACCCT 1620 

CCAATACAGA CTGTTCGCAT GAAAGAAAAT GCAGAAGTGG GAACAACAAG CAATGGATAT 1680 

50 AAAGCATATG ACCCAGAAAC AAGAAGTAGC AGTGGCATAA GGTATAAGAA ATTAACTGAT 1740 

CCAACAGGGT GGGTCACCAT TGATGAAAAT AC AGG AT C AA TCAAAGTTTT CAGAAGCCTG 180 0 

GATAGAGAGG CAGAGACCAT CAAAAATGGC ATATATAATA TTACAGTCCT TGCATCAGAC 1860 

CAAGGAGGGA GAACATGTAC GGGGACACTG GGCATTATAC TTCAAGACGT GAATGATAAC 1920 

AGCCCATTCA TACCTAAAAA GACAGTGATC ATCTGCAAAC CCACCATGTC ATCTGCGGAG 1980 

55 ATTGTTGCGG TTGATCCTGA TGAGCCTATC CATGGCCCAC CCTTTGACTT TAGTCTGGAG 2 040 

AGTTCTACTT CAGAAGTACA GAGAATGTGG AGACTGAAAG CAATTAATGA TACAGCAGCA 2100 

CGTCTTTCCT AT CAG AATG A TCCTCCATTT GGCTCATATG TAGTACCTAT AACAGTGAGA 2160 

GATAGACTTG GCATGTCTAG TGTCACTTCA TTGGATGTTA CACTGTGTGA CTGCATTACC 222 0 

* GAAAATGACT GCACACATCG TGTAGATC C A AGGATTGGCG GTGGAGGAGT ACAACTTGGA 2280 

60 AAGTGGGCCA TCCTTGCAAT ATTGTTGGGC ATAGCATTGC TCTTTTGCAT CCTGTTTACG 2 340 

CTGGTCTGTG GGGCTTCTGG GACGTCTAAA CAACCAAAAG TAATTCCTGA TGATTTAGCC 240 0 

CAG CAG AAC C TAATTGTATC AAACACAGAA GCTCCTGGAG ATGACAAAGT GTATTCTGCG 2460 

AATGGCTTCA CAACCCAAAC TGTGGGCGCT TCTGCTCAGG GAGTTTGTGG CACCGTGGGA 2 520 

TCAGG AATCA AAAACGGAGG TCAGGAGACC ATCGAAATGG TGAAAGGAGG ACACCAGACC 2580 

65 TCGGAATCCT GCCGGGGGGC TGGCCACCAT CACACCCTGG ACTCCTGCAG GGGAGGACAC 2640 

ACGGAGGTGG ACAACTGCAG ATACACTTAC TCGGAGTGGC ACAGTTTTAC TCAGCCCCGT 2700 

CTTGGTGAAG AAT CCATTAG AGGACACACT CTGATTAAAA ATTAAACAAT GAAAGAAAGT 2 760 

GTATCTGTGT AATCAAGATG AAAATCACAA GCATGCCCAA GACTATGTCC TGACATATAA 2820 

CTATGAAGGA AGAGGATCGG TGGCTGGGTC TGTAGGTTGT TGCAGTGAAC GACAAGAAGA 2880 

70 AGATGGGCTT GAATTTTTGG ATAATTTGGA GCCCAAATTT AGGACACTAG CAGAAGCATG 2 940 

CATGAAGAGA TGAGTGTGTT CTAATAAGTC TCTGAAAGCC AGTGGCTTTA TGACTTTTAA 3000 

AAAAAATTAC AAACCAAGAA TTTTTTAAAG CAGAAGATGC TATTTGTGGG GGTTTTTCTC 3 060 

TCATTATTTG GATGGAATCT CTTTGGTCAA ATGCACATTT ACAGAGAGAC ACTATAAACA 3120 

AGTACACAAA TTTTTCAATT TTTACATATT TTTAAATTAC TTATCTTCTA TCCAAGGAGG 3180 

75 T CT AC AG AG A AATTAAAGTC TGCCTTATTT GTTACATTTG GGT AT AATG A CAACAGCCAA 3240 

TTTATAGTGC AATAAAATGT AATTAATTCA AGT CCTTATT ATAGACTATT TGAAGCACAA 3300 

CCTAATGGAA AATTGTAGAG ACCTTGCTTT AACATTATCT CCAGTTAATT AAGTGTTCAT 33 60 

GTGGTGCTTG GAAACTGTTG TTTTCCTGAA CAT CT AAAGT GTGTAGACTG CATTCTTGCT 3420 

ATTATTTTAT TCTTGTAATG TGACCTTTTC ACTGTGCAAA GGGAGATTTC TAGCCAGGCA 3480 

80 TTGACTATTA CAATTTCATT 

Seq ID NO: 619 Protein sequence 
Protein Accession #: NP_004940.1 

85 1 11 21 31 41 51 

I I I I I I 

MEAARPSGSW NGALCRLLLL TLAILIFASD ACKNVTLHVP SKDDAEKLVG RVNLKECFTA 60 
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ANLIHSSDPD FQILEDGSVY 
KKRHTKEKVL RRAKRRWAPI 
EPRNLFYVER DTGNLYCTRP 
PIFTEETYTF TIFENCRVGT 
TGVITTTSSQ LDRELIDKYQ 
VTSVEENTVD VEILRVTVED 
KPLNYEEKQQ MILQIGWNE 
KENAEVGTTS NGYKAYDPET 
KNGIYNITVL ASDQGGRTCT 
EPIHGPPFDF SLESSTSEVQ 
VTSLDVTLCD C I TEND CTHR 
TSKQPKVIPD DLAQQNLIVS 
QETIEMVKGG HQTSESCRGA 
GHTLIKN 



TTNTILLSSE KRSFTILLSN 
PCSMLENSLG PFPLFLQQVQ 
VDREQYESFE I I AFATTPDG 
TVGQVCATDK DEPDTMHTRL 
LKIKVQDMDG QYFGLQTTST 
KDLVNTANWR ANYTILKGNE 
APFSREASPR SAMSTATVTV 
RSSSGIRYKK LTDPTGWVTI 
GTLGIILQDV NDNSPFIPKK 
RMWRLKAIND TAARLSYQND 
VDPRIGGGGV QLGKWAILAI 
NTEAPGDDKV YSANGFTTQT 
GHHHTLDSCR GGHTEVDNCR 



TENQEKKKIF VFLEHQTKVL 
SDTAQNYTIY YSIRGPGVDQ 
YTPELPLPLI IKIEDENDNY 
KYSIIGQVPP SPTLFSMHPT 
CIINIDDVND HLPTFTRTSY 
NGNFKIVTDA KTNEGVLCW 
NVEDQDEGPE CNPPIQTVRM 
DENTGSIKVF RSLDREAETI 
TVIICKPTMS SAEIVAVDPD 
PPFGSYWPI TVRDRLGMS S 
LLGIAL.LFCI LFTLVCGASG 
VGASAQGVCG TVGSGIKNGG 
YTYSEWHSFT QPRLGEESIR 



Seq ID NO: 620 DNA sequence 

Nucleic Acid Accession #: NM_032545.1 

Coding sequence: 4 6.. 71 8 



1 11 21 31 41 51 

] 1 I 1 I I 

AAACTGATCT TCAATGCACT AAGAGAAGGA GACTCTCAAA CCAAAAATGA CCTGGAGGCA 
CCATGTCAGG CTTCTGTTTA CGGTCAGTTT GGCATTACAG ATCATCAATT TGGGAAACAG 
CTATCAAAGA GAGAAACATA ACGGCGGTAG AG AGGAAGT C ACCAAGGTTG CCACTCAGAA 
GCACCGACAG TCACCGCTCA ACTGGACCTC CAGT CATTTC GGAGAGGTGA CTGGGAGCGC 
CGAGGGCTGG GGGCCGGAGG AGCCGCTCCC CTACTCCCGG GCTTTCGGAG AGGGTGCGTC 
CGCGCGGCCG CGCTGCTGCA GGAACGGCGG TACCTGCGTG CTGGGCAGCT TCTGCGTGTG 
CCCGGCCCAC TTCACCGGCC GCTACTGCGA GCATGACCAG AGGCGCAGTG AATGCGGCGC 
CCTGGAGCAC GGAGCCTGGA CCCTCCGCGC CTGCCACCTC TGCAGGTGCA TCTTCGGGGC 
CCTGCACTGC CTCCCCCTCC AGACGCCTGA CCGCTGTGAC CCGAAAGACT TCCTGGCCTC 
CCACGCTCAC GGGCCGAGCG CCGGGGGCGC GCCCAGCCTG CTACTCTTGC TGCCCTGCGC 
ACTCCTGCAC CGCCTCCTGC GCCCGGATGC GCCCGCGCAC CCTCGGTCCC TGGTCCCTTC 
CGTCCTCCAG CGGGAGCGGC GCCCCTGCGG AAGGCCGGGA CTTGGGCATC GCCTTTAATT 
TTCTATGTTG TAAATAATAG ATGTGTTTAG TTTACCGTAA GCTGAAGCAC TGGGTGAATA 
TTTTTATTGG GTAATAAATA TTTTCATGAA AGCGCCAAAA AAAAAAAAAA AAAAAAAAAA 
AAAAAA 



Seq ID NO: 621 Protein sequence 
Protein Accession #: NP_115934.1 



1 11 

I I 
MTWRHHVRLL FTVSLALQII 
VTGSAEGWGP EEPLPYSRAF 
SECGALEHGA WTLRACHLCR 
LLPCALLHRL LRPDAPAHPR 



21 31 

I I 
NLGNSYQREK HNGGREEVTK 
GEGASARPRC CRNGGTCVLG 
CIFGAIiHCLP LQTPDRCDPK 
SLVPSVLQRE RRPCGRPGLG 



41 51 

I I 

VATQKHRQSP LNWTSSHFGE 

SFCVCPAHFT GRYCEHDQRR 

DFLASHAHGP SAGGAPSLLL 
HRL 



Seq ID NO: 622 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 
Coding sequence: 1..390 

1 11 21 31 41 51 

I 1 I I I I 

ATGAGGTTCA GTGTCTCAGG CATGAGGACC GACTACCCCA GGAGTGTGCT GGCTCCTGCT 
TATGTGT CAG TCTGTCTCCT CCTCTTGTGT CCAAGGGAAG TCATCGCTCC CGCTGGCTCA 
GAACCATGGC TGTGCCAGCC GGCACCCAGG TGTGGAGACA AGATCTACAA CCCCTTGGAG 
CAGTGCTGTT ACAATGACGC CATCGTGTCC CTGAGCGAGA CCCGCCAATG TGGTCCCCCC 
TGCACCTTCT GGCCCTGCTT TGAGCTCTGC TGTCTTGATT CCTTTGGCCT CACAAACGAT 
TTTGTTGTGA AGCTGAAGGT TCAGGGTGTG AATTCCCAGT GCCACTCATC TCCCATCTCC 
AGTAAATGTG AAAGAGGCCG GATATGTTAG 

Seq ID NO: 623 Protein sequence 
Protein Accession #: FGENESH predicted 



1 11 21 31 41 51 

1 I 1 I I I 

MRFSVSGMRT DYPRSVLAPA YVSVCLLLLC PREVIAPAGS EPWLCQPAPR CGDKIYNPLE 

QCCYNDAIVS LSETRQCGPP CTFWPCFELC CLDSFGLTND FWKLKVQGV NSQCHSSPIS 
SKCERGRIC 



Seq ID NO: 624 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 51.. 1085 



1 11 

1 I 
GGAGCTCAAG CTCCTCTACA 
CCTCAGCCCC TCCCTGCAGA 
TTCTAACCTT CTGGAACCCA 
ATGTCGCAGA GGGGAAGGAG 
GTTACAGCTG GTACAAAGGC 
TAGGAACTCA ACAAGCTACC 
ATGCATCCCT GCTG AT CCAG 
TCATAAAGTC AGATCTTGTG 
TGCCCAAGCC CTCCATCTCC 
CCTTCACCTG TGAACCTGAG 
GCCTCCCGGT CAGTCCCAGG 



21 31 

I i 
AAGAGGTGGA CAGAGAAGAC 
TTGCATGTCC CCTGGAAGGA 
CCCACCACTG CCAAGCTCAC 
GTTCTTCTAC TCGCCCACAA 
GAAAGAGTGG ATGGCAACAG 
CCAGGGCCCG CATACAGTGG 
AACGTCACCC AGAATGACAC 
AATGAAGAAG CAACCGGACA 
AGCAACAACT CCAACCCCGT 
GTTCAGAACA CAACCTACCT 
CTGCAGCTGT CCAATGGCAA 



41 51 

I I 
AGCAGAGACC ATGGGACCCC 
GGTCCTGCTC ACAGCCTCAC 
TATTGAATCC ACGCCATTCA 
CCTGCCCCAG AAT CGTATTG 
TCTAATTGTA GG AT AT GT AA 
TCGAGAGACA ATATACCCCA 
AGGATTCTAT ACCCTACAAG 
GTTCCATGTA TACCCGGAGC 
GGAGGACAAG GATGCTGTGG 
GTGGTGGGTA AATGGTCAGA 
CATGACCCTC ACT CT ACTCA 
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GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGT CA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT .1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 12 60 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGT CAA 13 80 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGAT CTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 162 0 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 1920 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCAT CAA ATAGTGAATG GTCTCTCTTT 2 04 0 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCAT CAGGAG AAC AT CAT AA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAG C ACT AAT G CTTTAAG AT TTGGTCACAC 2160 

TCTCACCTAG GTGAGCGCAT TG AG CCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 222 0 

GTTAAGGAAG AAGAT AGAT C CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 

ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 

TTTACAAAAA AGTAACCTGA ACT AAT CTG A TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 

TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 

CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO: 625 Protein sequence 
Protein Accession #: AAA59907.1 

1 11 21 31 41 51 

I I 1 I ! I 

MGPPSAPPCR LHVPWKEVLL TASLiLTFWNP PTTAKLTIES TPFNVAEGKE VLLLAHNLPQ 60 

NRIGYSWYKG ERVDGNSLIV GYVIGTQQAT PGPAYSGRET IYPNASLLIQ NVTQNDTGFY 120 

TLQVIKSDLV NEEATGQFHV YPELPKPSIS SNNSNPVEDK DAVAFTCEPE VQNTTYLWWV 180 

NGQSLPVSPR LQLSNGNMTL TLLSVKRNDA GSYECEIQNP ASANRSDPVT LNVLYGPDVP 240 

TISPSKANYR PGENLNLSCH AASNPPAQYS WFINGTFQQS TQELFIPNIT VNNSGSYMCQ 300 
AHNSATGLNR TTVTMITVSG SAPVLSAVAT VGITIGVLAR VALI 

Seq ID NO: 626 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 1355.. 1657 

1 11 21 31 41 51 

1 I I I I I 

GGAGCTCAAG CTCCTCTACA AAGAGGTGGA CAGAGAAGAC AGCAGAGACC ATGGGACCCC 60 

CCTCAGCCCC TCCCTGCAGA TTGCATGTCC CCTGGAAGGA GGTCCTGCTC ACAGCCTCAC 12 0 

TTCTAACCTT CTGGAACCCA CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA 180 

ATGTCGCAGA GGGGAAGGAG GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGT AT TG 240 

GTTACAGCTG GTACAAAGGC GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA 300 

TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA 360 

ATGCATCCCT GCTGATCCAG AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG 42 0 

TCATAAAGTC AGAT CTTGTG AATGAAGAAG CAACCGGACA GTTCCATGTA TACCCGGAGC 480 

TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG 540 

CCTTCACCTG TGAACCTGAG GTTCAGAACA CAACCTACCT GTGGTGGGTA AATGGTCAGA 600 

GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CATGACCCTC ACTCTACTCA 660 

GCGTCAAAAG GAACGATGCA GGATCCTATG AATGTGAAAT ACAGAACCCA GCGAGTGCCA 720 

ACCGCAGTGA CCCAGTCACC CTGAATGTCC TCTATGGCCC AGATGTCCCC ACCATTTCCC 780 

CCTCAAAGGC CAATTACCGT CCAGGGGAAA ATCTGAACCT CTCCTGCCAC GCAGCCTCTA 840 

ACCCACCTGC ACAGTACTCT TGGTTTATCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 900 

TCTTTATCCC CAACATCACT GTGAATAATA GCGGATCCTA TATGTGCCAA GCCCATAACT 960 

CAGCCACTGG CCTCAATAGG ACCACAGT CA CGATGATCAC AGTCTCTGGA AGTGCTCCTG 1020 

TCCTCTCAGC TGTGGCCACC GTCGGCATCA CGATTGGAGT GCTGGCCAGG GTGGCTCTGA 1080 

TATAGCAGCC CTGGTGTATT TTCGATATTT CAGGAAGACT GGCAGATTGG ACCAGACCCT 1140 

GAATTCTTCT AGCTCCTCCA ATCCCATTTT ATCCCATGGA ACCACTAAAA ACAAGGTCTG 1200 

CTCTGCTCCT GAAGCCCTAT ATGCTGGAGA TGGACAACTC AATGAAAATT TAAAGGGAAA 12 60 

ACCCTCAGGC CTGAGGTGTG TGCCACTCAG AGACTTCACC TAACTAGAGA CAGTCAAACT 1320 

GCAAACCATG GTGAGAAATT GACGACTTCA CACTATGGAC AGCTTTTCCC AAGATGTCAA 1380 

AACAAGACTC CTCATCATGA TAAGGCTCTT ACCCCCTTTT AATTTGTCCT TGCTTATGCC 1440 

TGCCTCTTTC GCTTGGCAGG ATGATGCTGT CATTAGTATT TCACAAGAAG TAGCTTCAGA 1500 

GGGTAACTTA ACAGAGTGTC AGAT CTATCT TGTCAATCCC AACGTTTTAC ATAAAATAAG 1560 

AGATCCTTTA GTGCACCCAG TGACTGACAT TAGCAGCATC TTTAACACAG CCGTGTGTTC 162 0 

AAATGTACAG TGGTCCTTTT CAGAGTTGGA CTTCTAGACT CACCTGTTCT CACTCCCTGT 1680 

TTTAATTCAA CCCAGCCATG CAATGCCAAA TAATAGAATT GCTCCCTACC AGCTGAACAG 1740 

GGAGGAGTCT GTGCAGTTTC TGACACTTGT TGTTGAACAT GGCTAAATAC AATGGGTATC 1800 

GCTGAGACTA AGTTGTAGAA ATTAACAAAT GTGCTGCTTG GTTAAAATGG CTACACTCAT 1860 

CTGACTCATT CTTTATTCTA TTTTAGTTGG TTTGTATCTT GCCTAAGGTG CGTAGTCCAA 192 0 

CTCTTGGTAT TACCCTCCTA ATAGTCATAC TAGTAGTCAT ACTCCCTGGT GTAGTGTATT 1980 

CTCTAAAAGC TTTAAATGTC TGCATGCAGC CAGCCAT CAA ATAGTGAATG GTCTCTCTTT 204 0 

GGCTGGAATT ACAAAACTCA GAGAAATGTG TCAT CAGGAG AACAT CAT AA CCCATGAAGG 2100 

ATAAAAGCCC CAAATGGTGG TAACTGATAA TAGCACTAAT GCTTTAAGAT TTGGTCACAC 2160 
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TCTCACCTAG GTGAGCGCAT TGAGCCAGTG GTGCTAAATG CTACATACTC CAACTGAAAT 2220 
GTTAAGGAAG AAGATAGATC CAATTAAAAA AAATTAAAAC CAATTTAAAA AAAAAAAAGA 2280 
ACACAGGAGA TTCCAGTCTA CTTGAGTTAG CATAATACAG AAGTCCCCTC TACTTTAACT 2340 
TTTACAAAAA AGTAACCTGA ACTAATCTGA TGTTAACCAA TGTATTTATT TCTGTGGTTC 2400 
TGTTTCCTTG TTCCAATTTG ACAAAACCCA CTGTTCTTGT ATTGTATTGC CCAGGGGGAG 2460 
CTATCACTGT ACTTGTAGAG TGGTGCTGCT TTAATTCATA AATCACAAAT AAAAGCCAAT 2520 
TAGCTCTATA ACT 

Seq ID NO: 627 Protein sequence 
Protein Accession #: AAAS9908.1 

1 11 21 31 41 51 

I I I I I I 

MDSFSQDVKT RLLIMIRLLP PFNLSLLMPA SPAWQDDAVI SISQEVASEG NLTECQIYLV 60 
NPNVLHKIRD PLVHPVTDIS SIFNTAVCSN VQWSFSELDF 

Seq ID NO: 628 DNA sequence 
Nucleic Acid Accession #: M18728.1 
Coding sequence: 2370.. 2501 



PCT/US02/12476 



GGAGCTCAAG 
CCTCAGCCCC 
TTCTAACCTT 
ATGT CGCAG A 
GTTACAGCTG 
TAGGAACTCA 
ATGCATCCCT 
T C AT AAAGT C 
TGCCCAAGCC 
CCTTCACCTG 
GCCTCCCGGT 
GCGTCAAAAG 
ACCGCAGTGA 
CCTCAAAGGC 
ACCCACCTGC 
TCTTTATCCC 
CAGCCACTGG 
TCCTCTCAGC 
TATAGCAGCC 
GAATTCTTCT 
CTCTGCTCCT 
ACCCTCAGGC 
GCAAACCATG 
AACAAGACTC 
TGCCTCTTTC 
GGGTAACTTA 
AGATCCTTTA 
AAATGTACAG 
TTTAATTCAA 
GGAGGAGTCT 
GCTGAGACTA 
CTGACTCATT 
CTCTTGGTAT 
CTCTAAAAGC 
GGCTGGAATT 
ATAAAAGCCC 
TCTCACCTAG 
GTTAAGGAAG 
ACACAGGAGA 
TTTACAAAAA 
TGTTTCCTTG 
CTATCACTGT 
TAGCTCTATA 



11 
I 

CTCCTCTACA 
TCCCTGCAGA 
CTGGAACCCA 
GGGGAAGGAG 
GTACAAAGGC 
AC AAGCT AC C 
GCTGATCCAG 
AGATCTTGTG 
CTCCATCTCC 
TGAACCTGAG 
CAGTCCCAGG 
GAACGATGCA 
CCCAGTCACC 
CAATTACCGT 
ACAGTACTCT 
CAACATCACT 
CCTCAATAGG 
TGTGGCCACC 
CTGGTGTATT 
AGCTCCTCCA 
GAAGCCCTAT 
CTGAGGTGTG 
GTGAGAAATT 
CTCATCATGA 
GCTTGGCAGG 
ACAGAGTGTC 
GTGCACCCAG 
TGGTCCTTTT 
CCCAGCCATG 
GTGCAGTTTC 
AGTTGTAGAA 
CTTTATTCTA 
TACCCTCCTA 
TTTAAATGTC 
ACAAAACTCA 
CAAATGGTGG 
GTGAGCGCAT 
AAGATAGATC 
TTCCAGTCTA 
AGTAACCTGA 
TTCCAATTTG 
ACTTGTAGAG 
ACT 



21 
I 

AAGAGGTGGA 
TTGCATGTCC 
CCCACCACTG 
GTTCTTCTAC 
GAAAGAGTGG 
CCAGGGCCCG 
AACGTCACCC 
AATGAAGAAG 
AGCAACAACT 
GTT CAGAAC A 
CTGCAGCTGT 
GGATCCTATG 
CTGAATGTCC 
CCAGGGGAAA 
TGGTTTATCA 
GTGAATAATA 
ACCACAGTCA 
GTCGGCATCA 
TTCGATATTT 
ATCCCATTTT 
ATGCTGGAGA 
TGCCACTCAG 
GACGACTTCA 
TAAGGCTCTT 
ATGATGCTGT 
AGATCTATCT 
TGACTGACAT 
CAGAGTTGGA 
CAATGCCAAA 
TGACACTTGT 
ATTAACAAAT 
TTTTAGTTGG 
AT AGT CAT AC 
TGCATGCAGC 
GAGAAATGTG 
TAACTGATAA 
TGAGCCAGTG 
CAATTAAAAA 
CTTGAGTTAG 
ACTAATCTGA 
ACAAAACCCA 
TGGTGCTGCT 



31 
I 

CAGAGAAGAC 
CCTGGAAGGA 
CCAAGCTCAC 
TCGCCCACAA 
ATGGCAACAG 
CATACAGTGG 
AGAATGACAC 
CAACCGGACA 
CCAACCCCGT 
CAACCTACCT 
CCAATGGCAA 
AATGTGAAAT 
TCTATGGCCC 
ATCTGAACCT 
ATGGGACGTT 
GCGGATCCTA 
CGATGATCAC 
CGATTGGAGT 
CAGGAAGACT 
ATCCCATGGA 
TGGACAACTC 
AGACTTCACC 
CACTATGGAC 
ACCCCCTTTT 
CATTAGTATT 
TGTCAATCCC 
TAGCAGCATC 
CTTCTAGACT 
TAATAGAATT 
TGTTGAACAT 
GTGCTGCTTG 
TTTGTATCTT 
TAGTAGTCAT 
CAGCCAT CAA 
TCATCAGGAG 
TAGCACTAAT 
GTGCTAAATG 
AAATTAAAAC 
CATAATACAG 
TGTTAACCAA 
CTGTTCTTGT 
TTAATTCATA 



41 
I 

AGCAGAGACC 
GGTCCTGCTC 
TATTGAATCC 
CCTGCCCCAG 
TCTAATTGTA 
TCGAGAGACA 
AGGATTCTAT 
GTTCCATGTA 
GGAGGACAAG 
GTGGTGGGTA 
CATGACCCTC 
ACAGAACCCA 
AGATGTCCCC 
CTCCTGCCAC 
CCAGCAATCC 
TATGTGCCAA 
AGTCTCTGGA 
GCTGGCCAGG 
GGCAGATTGG 
ACCACTAAAA 
AATGAAAATT 
TAACTAGAGA 
AGCTTTTCCC 
AATTTGTCCT 
TCACAAGAAG 
AACGTTTTAC 
TTTAACACAG 
CACCTGTTCT 
GCTCCCTACC 
GGCTAAATAC 
GTTAAAATGG 
GCCTAAGGTG 
ACTCCCTGGT 
ATAGTGAATG 
AACATCATAA 
GCTTTAAGAT 
CTACATACTC 
CAATTTAAAA 
AAGTCCCCTC 
TGTATTTATT 
ATTGTATTGC 
AATCACAAAT 



51 
I 

ATGGGACCCC 
ACAGCCTCAC 
ACGCCATTCA 
AATCGTATTG 
GGATATGTAA 
ATATACCCCA 
ACCCTACAAG 
TACCCGGAGC 
GATGCTGTGG 
AATGGTCAGA 
ACTCTACTCA 
GCGAGTGCCA 
ACCATTTCCC 
GCAGCCTCTA 
ACACAAGAGC 
GCCCATAACT 
AGTGCTCCTG 
GTGGCTCTGA 
ACCAGACCCT 
ACAAGGTCTG 
TAAAGGGAAA 
CAGTCAAACT 
AAGATGTCAA 
TGCTTATGCC 
TAGCTTCAGA 
ATAAAATAAG 
CCGTGTGTTC 
CACTCCCTGT 
AGCTGAACAG 
AATGGGTATC 
CTACACTCAT 
CGTAGTCCAA 
GTAGTGTATT 
GTCTCTCTTT 
CCCATGAAGG 
TTGGTCACAC 
CAACTGAAAT 
AAAAAAAAGA 
TACTTTAACT 
TCTGTGGTTC 
CCAGGGGGAG 
AAAAGCCAAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



Seq ID NO: 629 Protein sequence 
Protein Accession #: AAA59909.1 

1 11 21 31 41 

1 ! i 1 I 

MLTNVFISW LFPCSNLTKP TVLVLYCPGG AITVLVEWCC FNS 



51 



75 
80 
85 



Seq ID NO: 630 DNA sequence 

Nucleic Acid Accession #: NM_01663 9.1 

Coding sequence: 40.. 42 9 



GCGGCGGGCG 
CGGTTGCTGC 
GAGCAAGCGC 
AAGTGCATGG 
GCTGCAGCAC 
CTGACCTTCG 
GAGAAGTTCA 



11 
I 

CAGACAGCGG 
GGCTCCTCGT 
CAGGCACCGC 
ACTGCGCGTC 
CTCCTGCCCC 
TGCTGGGGCT 
CCACCCCCAT 



21 

I 

CGGGCGCAGG 
GCTGGGGCTC 
CCCCTGCTCC 
TTGCAGGGCG 
CTTCCGGCTG 
GCTTTCTGGC 
AGAGGAGACC 



31 

I 

ACGTGCACTA 
TGGCTGGCGT 
CGCGGCAGCT 
CGACCGCACA 
CTTTGGCCCA 
TTTTTGGTCT 
GGCGGAGAGG 



41 
i 

TGGCTCGGGG 
TGCTGCGCTC 
CCTGGAGCGC 
GCGACTTCTG 
TCCTTGGGGG 
GGAGACGATG 
GCTGCCCAGC 



51 
I 

CTCGCTGCGC 
CGTGGCCGGG 
GGACCTGGAC 
CCTGGGCTGC 
CGCTCTGAGC 
CCGCAGGAGA 
TGTGGCGCTG 



60 
120 
180 
240 
300 
360 
420 



428 



WO 02/086443 PCT/US02/12476 

ATCCAGTGAC AATGTGCCCC CTGCCAGCCG GGGCTCGCCC ACTCATCATT CATTCATCCA 480 

TTCTAGAGCC AGTCTCTGCC TCCCAGACGC GGCGGGAGCC AAGCTCCTCC AACCACAAGG 540 

GGGGTGGGGG GCGGTGAATC ACCTCTGAGG CCTGGGCCCA GGGTTCAGGG GAACCTTCCA 600 

AGGTGTCTGG TTGCCCTGCC TCTGGCTCCA GAACAGAAAG GGAGCCTCAC GCTGGCTCAC 660 

5 ACAAAACAGC TGACACTGAC TAAGGAACTG CAGCATTTGC ACAGGGGAGG GGGGTGCCCT 720 

CCTTCCTTAG GACCTGGGGG CCAGGCTGAC TTGGGGGGCA GACTTGACAC TAGGCCCCAC 780 

TCACTCAGAT GTCCTGAAAT TCCACCACGG GGGTCACCCT GGGGGGTTAG GGACCTATTT 840 

TTAACACTAG GGGCTGGCCC ACTAGGAGGG CTGGCCCTAA GATACAGACC CCCCCAACTC 900 

CCCAAAGCGG GGAGGAGATA TTTATTTTGG GGAGAGTTTG GAGGGGAGGG AGAATTTATT 960 
10 AATAAAAGAA TCTTTAACTT TAAAAAAAAA AAAAAAAA 

Seq ID NO: 631 Protein sequence 
Protein Accession #: NP_057723.1 

15 1 11 21 31 41 51 

I I 1 I I I 

MARGSLRRLL RLLVLGLWLA LLRSVAGEQA PGTAPCSRGS SWSADLDKCM DCASCRARPH 60 

SDFCLGCAAA PPAPFRLLWP ILGGALSLTF VLGLLSGFLV WRRCRRREKF TTPIEETGGE 120 
2^ GCPAVALIQ 

Seq ID NO j 632 DNA sequence 

Nucleic Acid Accession #: NM_003816.1 

Coding sequence: 7 9.. 2 53 8 

25 1 11 21 31 41 51 

I I 1 I I I 

CGGCAGGGTT GGAAAATGAT GGAAGAGGCG GAGGTGGAGG CGACCGAGTG CTGAGAGGAA 60 

CCTGCGGAAT CGGCCGAGAT GGGGT CTGGC GCGCGCTTTC CCTCGGGGAC CCTTCGTGTC 120 

CGGTGGTTGC TGTTGCTTGG CCTGGTGGGC CCAGTCCTCG GTGCGGCGCG GCCAGGCTTT 180 

30 CAACAGACCT CACATCTTTC TTCTTATGAA ATTATAACTC CTTGGAGATT AACTAGAGAA 240 

AGAAGAGAAG CCCCTAGGCC CTATTCAAAA CAAGTATCTT ATGTTATTCA GGCTGAAGGA 300 

AAAGAG C ATA TTATTCACTT GGAAAGGAAC AAAGACCTTT TGCCTGAAGA TTTTGTGGTT 360 

TATACTTACA ACAAGGAAGG GACTTTAATC ACTGACCATC CCAATATACA GAATCATTGT 420 

CATTATCGGG GCTATGTGGA GGGAGTTCAT AATTCATCCA TTGCTCTTAG CGACTGTTTT 480 

35 GGACTCAGAG GATTGCTGCA TTTAGAGAAT GCGAGTTATG GGATTGAACC CCTGCAGAAC 540 

AGCTCTCATT TTGAGCACAT CATTTATCGA ATGGATGATG TCTACAAAGA GCCTCTGAAA 600 

TGTGGAGTTT CCAACAAGGA TATAGAGAAA GAAACTGCAA AGGATGAAGA GGAAGAGCCT 660 

CCCAGCATGA CTCAGCTACT TCGAAGAAGA AGAGCTGTCT TGCCACAGAC CCGGTATGTG 72 0 

GAGCTGTTCA TTGTCGTAGA CAAGGAAAGG TATGACATGA TGGGAAGAAA TCAGACTGCT 780 

40 GTGAGAGAAG AGATGATTCT CCTGGCAAAC TACTTGGATA GTATGTATAT TATGTTAAAT 840 

ATTCGAATTG TGCTAGT TGG ACTGGAGATT TGGACCAATG GAAACCTGAT CAACATAGTT 90 0 

GGGGGTGCTG GTGATGTGCT GGGGAACTTC GTGCAGTGGC GGGAAAAGTT TCTTAT CAC A 960 

CGT CGGAGAC ATGACAGTGC ACAGCTAGTT CTAAAGAAAG GTTTTGGTGG AACTGCAGGA 1020 

ATGGCATTTG TGGGAACAGT GTGTTCAAGG AGCCACGCAG GCGGGATTAA TGTGTTTGGA 1080 

45 CAAATCACTG TGGAGACATT TGCTTCCATT GTTGCTCATG AATTGGGTCA TAATCTTGGA 1140 

ATGAAT CACG ATGATGGGAG AGATTGTTCC TGTGGAGCAA AGAGCTGCAT CATGAATTCA 1200 

GGAG CAT CGG GTTCCAGAAA CTTTAGCAGT TGCAGTGCAG AGGACTTTGA GAAGTTAACT 1260 

TTAAATAAAG GAGGAAACTG CCTTCTTAAT ATTCCAAAGC CTGATGAAGC CTATAGTGCT 1320 

CCCTCCTGTG GTAATAAGTT GGTGGACGCT GGGGAAGAGT GTGACTGTGG TACTCCAAAG 13 80 

50 GAATGTGAAT TGGACCCTTG CTGCGAAGGA AGTACCTGTA AGCTTAAATC ATTTGCTGAG 1440 

TGTGCATATG GTGACTGTTG TAAAGACTGT CGGTTCCTTC CAGGAGGTAC TTTATGCCGA 1500 

GGAAAAACCA GTGAGTGTGA TGTTCCAGAG TACTGCAATG GTTCTTCTCA GTTCTGTCAG 1560 

CCAGATGTTT TTATTCAGAA TGGATATCCT TGCCAGAATA ACAAAGCCTA TTGCTACAAC 1620 

GGCATGTGCC AGTATTATGA TGCTCAATGT CAAGT CATCT TTGGCTCAAA AGCCAAGGCT 1680 

55 GCCCCCAAAG ATTGTTTCAT TGAAGTGAAT TCTAAAGGTG ACAGAT TTGG CAATTGTGGT 1740 

TTCTCTGGCA ATGAATACAA GAAGTGTGCC ACTGGGAATG CTTTGTGTGG AAAGCTTCAG 1800 

TGTGAGAATG TACAAGAGAT ACCTGTATTT GGAATTGTGC CTGCTATTAT TCAAACGCCT 1860 

AGTCGAGGCA CCAAATGTTG GGGTGTGGAT TTCCAGCTAG GATCAGATGT TCCAGATCCT 1920 

GGGATGGTTA ACGAAGGCAC AAAATGTGGT GCTGGAAAGA TCTGTAGAAA CTTCCAGTGT 1980 

60 GTAGATGCTT CTGTTCTGAA TTATGACTGT GATGTTCAGA AAAAGTGTCA TGGACATGGG 2 040 

GTATGTAATA GCAATAAGAA TTGTCACTGT GAAAATGGCT GGGCTCCCCC AAATTGTGAG 2100 

ACTAAAGGAT ACGGAGGAAG TGTGGACAGT GGACCTACAT ACAATGAAAT GAATACTGCA 2160 

TTGAGGGACG GACTTCTGGT CTTCTTCTTC CTAATTGTTC CCCTTATTGT CTGTGCTATT 2220 

TTTATCTTCA TCAAGAGGGA TCAACTGTGG AGAAGCTACT TCAGAAAGAA GAG AT CACAA 2280 

65 ACATATGAGT CAGATGG CAA AAATCAAGCA AACCCTTCTA GACAGCCGGG GAGTGTTCCT 2340 

CGACATGTTT CTCCAGTGAC ACCTCCCAGA GAAGTTCCTA TATATGCAAA CAGATTTGCA 2400 

GTACCAACCT ATGCAGCCAA GCAACCTCAG CAGTTCCCAT CAAGGCCACC TCCACCACAA 2460 

CCGAAAGTAT CATCT CAGGG AAACTTAATT CCTGCCCGTC CTGCTCCTGC ACCTCCTTTA 2520 

TATAGTTCCC T CACT TG ATT TTTTTAACCT TCTTTTTGCA AATGTCTTCA GGGAACTGAG 2580 

70 CTAATACTTT TTTTTTTTCT TGATGTTTTC TTGAAAAGCC TTTCTGTTGC AACTATGAAT 2 640 

GAAAACAAAA CACCACAAAA CAGACTTCAC TAACACAGAA AAACAGAAAC TGAGTGTGAG 2700 

AGTTGTGAAA TACAAGGAAA TGCAGTAAAG CCAGGGAATT TACAATAACA TTTCCGTTTC 2760 

CAT CAT T G AA TAAGTCTTAT TCAGTCATCG GTGAGGTTAA TGC ACT AAT C ATGGATTTTT 2 820 

TGAACATGTT ATTGCAGTGA TTCTCAAATT AACTGTATTG GTGTAAGATT TTTGTCATTA 2880 

75 AGTGTTTAAG TGTTATTCTG AATTTTCTAC CTTAGTTATC ATTAATGTAG TTCCTCATTG 2940 

AACATGTGAT AATCTAATAC CTGTGAAAAC TGACTAATCA GCTGCCAATA ATATCTAATA 3000 

TTTTTCATCA TGCACGAATT AATAATCATC ATACTCTAGA ATCTTGTCTG TCACTCACTA 3060 

CATGAATAAG CAAATATTGT CTTCAAAAGA ATGCACAAGA ACCACAATTA AGATGTCATA 3120 

TTATTTTGAA AGTACAAAAT ATACTAAAAG AGTGTGTGTG TATTCACGCA GTTACTCGCT 3180 

80 TCCATTTTTA TGACCTTTCA ACTATAGGTA ATAACTCTTA GAGAAATTAA TTTAATATTA 3240 

GAATTTCTAT TATGAATCAT GTGAAAGCAT GACATTCGTT CACAATAGCA CTATTTTAAA 3300 

TAAATTATAA GCTTTAAGGT ACGAAGTATT T AAT AG AT C T AATCAAATAT GTTGATTCAT 3360 

GGCTATAATA AAGCAGGAGC AATTATAAAA TCTTCAATCA ATTGAACTTT TACAAAACCA 3420 

CTTGAGAATT TCATGAGCAC TTTAAAATCT GAACTTTCAA AGCTTGCTAT TAAATCATTT 3480 

85 AGAATGTTTA CATTTACTAA GGTGTGCTGG GTCATGTAAA ATATTAGACA CTAATATTTT 3540 

CATAGAAATT AGGCTGGAGA AAGAAGGAAG AAATGGTTTT CTTAAATACC TACAAAAAAG 3600 

TTACTGTGGT ATCTATGAGT TATCATCTTA GCTGTGTTAA AAATGAATTT TTACTATGGC 3660 



429 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

AGATATGGTA TGGATCGTAA AATTTTAAGC ACTAAAAATT TTTTCATAAC CTTTCATAAT 
AAAGTTTAAT AATAGGTTTA TTAACTGAAT TTCATTAGTT TTTTAAAAGT GTTTTTGGTT 
TGTGTATATA TACATATACA AATACAACAT TTACAATAAA TAAAATACTT GAAATTCTCA 
AAAAAAAAAA AAAAAAAAAA AAAAA 

Seg ID NO: 633 Protein sequence 
Protein Accession #= NP 003E07.1 



PCT/US02/12476 



3720 
3780 
3840 



MGSGARFPSG 
PYSKQVSYVI 
EGVHNSSIAL 
DIEKETAKDE 
LLANYLDSMY 
AQLVLKKGFG 
RDCSCGAKSC 
LVD AGEECD C 
DVPEYCNGSS 
IEVNSKGDRF 
WGVDFQLGSD 
NCHCENGWAP 
DQLWRSYFRK 
KQPQQFPSRP 



11 
I 

TLRVRWLLLL 
QAEGKEHIIH 
SDCFGLRGLL 
EEEPPSMTQL 
IMLNIRIVIiV 
GTAGMAFVGT 
IMNSGASGSR 
GTPKECELDP 
QFCQPDVFIQ 
GNCGFSGNEY 
VPDPGMVNEG 
PNCETKGYGG 
KRSQTYESDG 
PPPQPKVSSQ 



21 
1 

GLVGPVLGAA 
LERNKDLLPE 
HLENASYGIE 
LRRRRAVLPQ 
GLEIWTNGNli 
VCSRSHAGGI 
NFSSCSAEDF 
CCEGSTCKLK 
NGYPCQNNKA 
KKCATGNALC 
TKCGAGKICR 
SVDSGPTYNE 
KNQANPSRQP 
GNLIPARPAP 



31 
I 

RPGFQQTSHL 
DFWYTYNKE 
PLQNSSHFEH 
TRYVELFIW 
INIVGGAGDV 
NVFGQITVET 
EKLTLNKGGN 
SFAECAYGDC 
YCYNGMCQYY 
GKLQCENVQE 
NFQCVDASVL 
MNTALRDGLL 
GSVPRHVSPV 
APPLYSSLT 



41 

1 

SSYEIITPWR 
GTLITDHPNI 
I I YRMDDVYK 
DKERYDMMGR 
LGNFVQWREK 
FASIVAHELG 
CLLNIPKPDE 
CKDCRFLPGG 
DAQCQVIFGS 
IPVFGIVPAI 
NYDCDVQKKC 
VFFFLIVPLI 
TPPREVPIYA 



51 
I 

LTRERREAPR 
QNHCHYRGYV 
EPLKCGVSNK 
NQTAVREEMI 
FLITRRRHDS 
HNLGMNHDDG 
AYSAPSCGNK 
TLCRGKTSEC 
KAKAAPKDCF 
IQTPSRGTKC 
HGHGVCNSNK 
VCAIFIFIKR 
NRFAVPTYAA 



Seq ID NO: 634 DNA sequence 

Nucleic Acid Accession #: NM_002091.1 

Coding sequence: 56.. 503 



AGTCTCTGCT 
CGGCAGTGAG 
AGCGGTCCCG 
CCACTGGGCG 
TGAGAGAGGG 
GAATTTGCTG 
GGCCTTGGGC 
AGGTTCAAAA 
CCCCCAGCTG 
TAAGAGACTG 
AAATATTTGA 
CTTCTGGTTT 
TTTTTATATC 
TAAAAGCTTA 



11 
I 

CTTCCCAGCC 
CTCCCGCTGG 
CTGCCTGCGG 
GTGGGGCACT 
AGCCTGAAGC 
GGTCTCATAG 
AAT CAGCAGC 
GGCAAAGTTG 
AACCAGCAAT 
AGTTCTGCAA 
CTATTCTGTA 
AAACTTGTTT 
TAGGCTACCT 
AACACAT 



21 

i 

TCTCCGGCGC 
TCCTGCTGGC 
GCGGAGGGAC 
TAATGGGGAA 
AGCAGCTGAG 
AAGCAAAGGA 
CTTCGTGGGA 
GTAGACTCTC 
GATAATGATG 
GCATCAGTTC 
TCTTTCATCC 
GCTGTGAACA 
GTTGGTTAGA 



31 
I 

GCTCCAAGGG 
GCTGGTCCTC 
CGTGCTGACC 
AAAGAGCACA 
AGAGTACATC 
GAACAGAAAC 
TTCAGAGGAT 
TGCTCCAGGT 
GCCTCTCTCA 
TACGGATCAT 
TTGACTAAAT 
ATTGTCGAAA 
TTCAAGGCCC 



41 
1 

CTTCCCGTCG 
TGCCTAGCGC 
AAGATGTACC 
GGGGAGTCTT 
AGGTGGGAAG 
CACCAGCCAC 
AGCAGCAACT 
TCTCAACGTG 
AAAGAGAAAA 
CAACAAGATT 
TCGTGATTTT 
AGAGTCTTCC 
CGAGCTGTTA 



Seq ID NO: 635 Protein sequence 
Protein Accession #: NP 002082.1 



11 



21 



31 

! 



41 



51 
I 

GGACCATGCG 
CCCGGGGGCG 
CGCGCGGCAA 
CTTCTGTTTC 
AAGCTGCAAG 
CTCAACCCAA 
TCAAAGATGT 
AAGGAAGGAA 
ACAAAACCCC 
TCCTTGTGCA 
CAAGCAGCAT 
AATTAATGCT 
CCATTCACAA 



51 



MRGSELPLVL LALVLCLAPR GRAVPLPAGG GTVLTKMYPR GNHWAVGHLM GKKSTGESSS 
VSERGSLKQQ LREYIRWEEA ARNLLGLIEA KENRNHQPPQ PKALGNQQPS WDSEDSSNFK 
DVGSKGKVGR LSAPGSQREG RNPQLNQQ 

Seq ID NO: 636 DNA sequence 

Nucleic Acid Accession #: NM_016522.1 

Coding sequence: 265.. 1299 



GCGGAAGCAG 
CTGGCAAAAG 
TTTTCTCCTC 
CCGCACCCCA 
TCGGGGAAGT 
TGCCTCGTGG 
AGCGGAGATG 
GCCACCCTCA 
ACCATCCTCT 
AACACCCAAA 
TACACCTGCT 
CAAGTATCTC 
ATTAGCCTCA 
TCTCCCAAAG 
CGGGAACAGT 
CGGAGAGTAA 
GTCCCCGTGG 
TTCCAGTGGT 
AACAGACCTT 
TACACTTGCG 
CCAGGCGCCG 
CTGCCTCTTC 
CGGGAAAGGC 
CCAAT CAGAT 
GGGAGGGGAA 
CCTTGCAGAT 



11 
I 

CGAGGAGGGA 
CCGAGGCTGG 
CCCGCGCCTC 
CCCACTTCCT 
TGTGGCTGTC 
TCGTGTCTCT 
CCACCTTCCC 
GGTGCACTAT 
ATGCTGGGAA 
CGCAGTACAG 
CGGTG CAG AC 
CCAAAATTGT 
CCTGCATAGC 
CGGTTGGCTT 
CAGGGGACTA 
AGGTCACCGT 
GACAAAAGGG 
ACAAGGATGA 
TCCTCTCAAA 
TGGCCTCCAA 
TCAGCGAGGT 
TGGTCTTGCA 
TGCCGCCACC 
ATATACAAAT 
CAAAGAATAC 
ATTTAGGTAC 



21 
I 

GCCCCCTTTG 
ATTTGGGGGA 
CCGGTCGCCG 
GTGCTCGCCC 
GAGAATGGGG 
CAGGCTGCTG 
CAAAGCTATG 
TGACAACCGG 
TGACAAGTGG 
CATCGAGATC 
AGACAACCAC 
AGAGATTTCT 
AACTGGTAGA 
TGTGAGTGAA 
CGAGTGCAGT 
GAACTATCCA 
GACACTGCAG 
CAAAAGACTG 
ACTCATCTTC 
CAAGCTGGGC 
GAGCAACGGC 
CCTGCTTCTC 
ACCACCACCA 
GAAATTAGAA 
TTTGGGGGGA 
AATGGAGTTT 



31 

I 

GCCGTCCTCC 
GGAATATTAG 
CGGGTTCACC 
GGGGGGCGTG 
GTCTGTGGGT 
TTCCTTGTAC 
GACAACGTGA 
GTCACCCGGG 
TGCCTGGATC 
CAGAACGTGG 
CCAAAGACCT 
TCAGATATCT 
CCAGAGCCTA 
GACGAATACT 
GCCTCCAATG 
CCATACATTT 
TGTGAAGCCT 
ATTGAAGGAA 
TTCAATGTCT 
CACACCAATG 
ACGTCGAGGA 
AAATTTTGAT 
ACACAACAGC 
GAAACACAGC 
AAAGAGTTTT 
TCTTTTCCCA 



41 
I 

GTGGAACCGG 
ACTCGGAGGA 
GCTCAGTCCC 
TGCCGTGCGG 
ACCTGTTCCT 
CCACAGGAGT 
CGGTCCGGCA 
TGGCCTGGCT 
CTCGCGTGGT 
ATGTGTATGA 
CTAGGGTCCA 
CCATTAATGA 
CGGTTACTTG 
TGGAAATTCA 
ACGTGGCCGC 
CAGAAGCCAA 
CAGCAGTCCC 
AGAAAGGGGT 
CTGAACATGA 
CCAGCATCAT 
GGGCAGGCTG 
GTGAGTGCCA 
AATGGCAACA 
CTCATGGGAC 
AAAAAAGAAA 
AACGGGAAGA 



51 
I 

TTTTCCGAGG 
GTCTGCGCGC 
CGCGCTCGCT 
CTGCCGGAGT 
GCCCTGGAAG 
GCCCGTGCGC 
GGGGGAGAGC 
AAACCGCAGC 
CCTTCTGAGC 
CGAGGGCCCT 
CCTCATTGTG 
AGGGAACAAT 
GAGACACATC 
GGGCATCACC 
GCCCGTGGTA 
GGGTACAGGT 
CTCAGCAGAA 
GAAAGTGGAA 
CTATGGGAAC 
GCTATTTGGT 
CGTCTGGCTG 
CTTCCCCACC 
CCGACAGCAA 
AGAAATTTGA 
TTGAAAATTG 
ACACAGCACA 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



430 



WO 02/086443 

CCCGGCTTGG ACCCACTGCA AGCTGCATCG TGCAACCTCT TTGGTGCCAG TGTGGGCAAG 1620 

GGCTCAGCCT CTCTGCCCAC AGACTGCCCC CACGTGGAAC ATT CTGGAGC TGGCCATCCC 1680 

AAATTCAATC AGTCCATAGA GACGAACAGA ATGAGACCTT CCGGCCCAAG CGTGGCGCTT 1740 

CCGGCCCAAG CGTGGCGCTG CGGGCACTTT GGTAGACTGT GCCACCACGG CGTGTGTTGT 1800 
GAAACGTGAA ATAAAAAGAG CAAAAAAAAA AAAAAAAAA 
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Seq ID NO: 637 Protein sequence 
Protein Accession #: NP_057606.1 



10 



15 



MGVCGYLFLP 
NRVTRVAWLN 
NHPKTSRVHL 
SEDEYLEIQG 
LQCEASAVPS 
LGHTNASIML 



11 
I 

WKCLWVSLR 
RSTILYAGND 
IVQVSPKIVE 
ITREQSGDYE 
AEFQWYKDDK 
FGPGAVSEVS 



21 
I 

LLFLVPTGVP 
KWCLDPRWL 
ISSDISINEG 
CSASNDVAAP 
RLIEGKKGVK 
NGTSRRAGCV 



31 

I 

VRSGDATFPK 
LSNTQTQYSI 
NNI SI/TCI AT 
WRRVKVTVN 
VENRPFLSKL 
WLLPLLVLHL 



41 
I 

AMDNVTVRQG 
EIQNVDVYDE 
GRPEPTVTWR 
YPPYISEAKG 
IFFNVSEHDY 
LLKF 



51 
I 

ESATLRCTID 
GPYTCSVQTD 
HISPKAVGFV 
TGVPVGQKGT 
GNYT CVASNK 



60 
120 
180 
240 
300 



Seq ID NO: 638 DNA sequence 
20 Nucleic Acid Accession #: NM_0122 61.1 
Coding sequence: 203.. 1045 



25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



GATTTGCTCT 
ACAGAATACG 
CACTCCAGCG 
CCTCATTCGG 
ACTTCGAGTT 
GGAAAATCTC 
TGGGACGACG 
GGCCAGCAAC 
TGAGGTGAAG 
CGCATATGCA 
GGCGACTTGG 
CAAAGACGCA 
CACCCCCGCT 
TGATCCGCAG 
TATCTCAGAT 
GGAAGAAACC 
CGCGATTTAC 
ATCCCAGTAT 
CCAACTGGAT 
CATAGCTACA 
AACCCACGGA 
ATGCTGGGGA 
TGACTCTCCA 
TTGAAAACAT 
TGCTCCCTTG 
TCATGCTCCC 
GTTTAGTGAT 
AAAACGACTA 
GGGGGACCTG 
TTCTCTGGC 



11 

I 

GCCAGCAGCT 
CGCTCCCTCC 
GCGACTTTGA 
GGCACTGCGA 
CTCCTGATGT 
TCAGGCCTTT 
TGTCTCATGG 
TACGTAGATC 
GGCCGCTGTG 
CTCAAAATGC 
AGGCTGAGCA 
GTCAGTGCTG 
GGGAAGTCCT 
AAGACGGTCA 
TTTGTCTTCA 
TTGCCCCTGA 
CACGTCCACC 
AAG CACATGG 
CAGGTAGAAC 
AT CAAACAGG 
AGGGGGAGAC 
GGAGGGGAGG 
AAGAGCAATA 
GCTTCTTTGA 
GACACAGCTG 
TGCAGCAAGA 
TGTCTTGGGA 
ATGTAACTAT 
AAGAATCAAT 



21 
I 

GTCGGTGCCG 
CTCCCCCTTC 
GGGATTCCCT 
GTATGG AT CT 
TGTTCCATAC 
CCACTAACCC 
CAGAGTTTGC 
TGATCACAGA 
GCCACAGCCA 
TCTTTGTAAA 
AAGTGCAGTT 
GGAAGCACAC 
ATGAGTGTCA 
CCATGATCCT 
GTGAAGAGCA 
TTTTGGGGCT 
ACAAAATGAC 
GCTAGAGGCC 
AACAAAAGCA 
CCTGGGTATC 
TCTTTCGGAT 
AGGGTCTCAG 
AATGCCACTT 
GGAGGAAACC 
GCTTATCCTA 
CCCCTGAAAG 
ATGTTTCACT 
GCAGAGTTGT 
CTGTGTGAGT 



31 

1 

CGCTCGACAC 
TCTGTCCCCC 
CTCTGGCGGC 
CCAAGGAAGA 
AATGGCTCAA 
TGAAAAAGAT 
AGCCAAATTT 
ACAGGCCGAT 
GTCGGAGCTG 
GGAAAGCCAC 
TGTCTACGAC 
AGCCAACTCG 
AGCTCAACAA 
GTCTGCGGTC 
TAAATGCCCA 
CATCTTGGGC 
TGCCAACCAG 
GTTAGGCAGG 
CTTTTCCATC 
TGAGGCTTGC 
TTGTAGGGTG 
ACAGCTTTCG 
GGAGCTGTAT 
CCTTTAGGTT 
TACAGTTGTC 
TGATTCATGC 
GCTACCCGCA 
TTGGACTTCT 
CTGTTTTTCA 



41 

1 

CGAGTCCTAG 
GCCTCTCGCT 
CTCTGCAGCA 
GGGGTCCCCA 
AT CATGGCAG 
ATATTTGTGG 
ATTGTACCTT 
ATCGCATTGA 
CAAGTGTTCT 
AACATGTCCA 
TCCTCGGAGA 
CACCACCTCT 
ACCATTTCAC 
CACATCCAAC 
GTGGATGAGC 
CTCGTCATCA 
GTGCAGATCC 
CACCCCCTAT 
TTGTACACGA 
TTGGCTTGTG 
AAATGGCAAT 
TGCTCATGGT 
CTGGCCCCAA 
CAGAAGAATA 
AATGCACACA 
TTCTGGCTGG 
TCCAGCGACT 
TCCTGTGCCA 
AAATGAAATA 



51 
I 

CTAGGCGCTC 
CACCCCGGCC 
GCACAGCCGG 
GCATCGACAG 
AACAAGAAGT 
TGCGGGAAAA 
ATGATGTGTG 
CCCGGGGAGC 
GGGTGGATCG 
AGGGACCTGA 
AAACCCACTT 
CTGCCTTGGT 
TGGCCTCTAG 
CTTTTGACAT 
GGGAGCAACT 
TGGTAACACT 
CTCGGGACAG 
TCCTGCTCCC 
GATACACCAA 
TCCATGCTTA 
TATTCTCTCC 
GGCTTGGCTT 
AGTTTAGGGA 
TGGGGTGCTT 
GAATACAACC 
CATTCTGCAT 
G GAG CACCAG 
GGTCCAAGTC 
AAACACACTA 



Seq ID NO: 639 Protein sequence 
Protein Accession #: NP_036393.1 

1 11 21 31 41 51 

1 I I 1 I 1 

MDLQGRGVPS IDRLRVLLML FHTMAQIMAE QEVENLSGLS TNPEKDIFW RENGTTCLMA 
EFAAKFIVPY DVWASNYVDL ITEQADIALT RGAEVKGRCG HSQSELQVFW VDRAYALKML 
FVKESHNMSK GPEATWRLSK VQFVYDSSEK THFKDAVSAG KHTANSHHLS ALVTPAGKSY 
ECQAQQTISL ASSDPQKTVT MILSAVHIQP FDIISDFVFS EEHKCPVDER EQLEETLPLI 
LGLILGLVIM VTLAI YHVHH KMTANQVQIP RDRSQYKHMG 

Seq ID NO: 640 DNA sequence 

Nucleic Acid Accession #: NM_002 993.1 

Coding sequence: 64.. 408 



GGCACGAGCC 
ACTATGAGCC 
GCGCTGCTCG 
GTCTCTGCTG 
CCCAAAACGA 
GTGGTAGCCT 
AAGAAAGTCA 
ACCATGCATC 
CAGTAAGAAT 
GAAGAGTGTG 
CTAATATAGT 
CAATTGACCA 
TGAAGATAAC 
ATTTCGTATG 
ACTCACTCTT 



11 

I 

AGTCTCCGCG 
TCCCGTCCAG 
CGCTGCTGCT 
TGCTGACAGA 
TTGGTAAACT 
CCCTGAAGAA 
TCCAGAAAAT 
ATAAAATTGC 
AAGAAGGAAG 
GGGGAAAGCC 
ATTTCCACTA 
TATTGTGAGC 
TATTGTATTT 
GAAATAATGT 
CTCATAAAAT 



21 
I 

CCTCCACCCA 
CCGCGCGGCC 
CCTGCTGACG 
GCTGCGTTGC 
GCAGGTGTTC 
CGGGAAGCAA 
TTTGGACAGT 
CCAGTCTTCA 
GGTTGGTTTT 
TACGCTT CTC 
TTTACTGTTA 
AAAGAATCAC 
CTAT CATACA 
TTTATTAGTG 
AGGAAATATT 



31 
I 

GCTCAGGAAC 
CGTGTCCCGG 
CCGCCGGGGC 
ACTTGTTTAC 
CCCGCAGGCC 
GTTTGTCTGG 
GGAAACAAGA 
GCGGAGCAGT 
TTTCCATTTT 
CCTGAAGTTT 
TTTTACCTGA 
TGGTTATTAG 
TTCCTTAAAG 
TGCTGTTGAG 
TTAGTTCTGT 



41 
I 

CCGCGAACCC 
GTCCTTCGGG 
CCCTCGCCAG 
GCGTTACGCT 
CGCAGTGCTC 
ACCCGGAAGC 
AAAACTGAGT 
TTTCTGGAGA 
CTACATGGAT 
ACAGCTCAGC 
TAAGTTATTG 
TCTTTCAATG 
TCTTACCGAA 
GGAGGTATCC 
TTTCTTGGGG 



51 
I 

TCTCTTGACC 
CTCCTTGTGC 
CGCTGGTCCT 
GAGAGTAAAC 
CAAGGTGGAA 
CCCTTTTCTA 
AACAAAAAAG 
TCCCTGGACC 
TCCCTACTTT 
TAATGAAGTA 
AACCCTTTGG 
AATATTGAAT 
AAGGCTGTGG 
TGTTGTTCTT 
AATATGTTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



431 



10 



WO 02/086443 

TCTTTACCCT AGGATGCTAT TTAAGTTGTA CTGTATTAGA ACACTGGGTG TGTCATACCG 960 

TTATCTGTGC AGAATATATT TCCTTATTCA GAATTTCTAA AAATTTAAGT TCTGTAAGGG 1020 

CTAATATATT CTCTTCCTAT GGTTTTAGAT GTTTGATGTC TTCTTAGTAT GGCATAATGT 1080 

CATGATTTAC TCATTAAACT TTGATTTTGT ATGCTATTTT TTCACTATAG GATGACTATA 1140 

ATTCTGGTCA CTAAATATAC ACTTTAGATA GATGAAGAAG CCCAAAAACA GATAAATTCC 1200 

TGATTGCTAA TTTACATAGA AATGTATTCT CTTGGTTTTT TAAATAAAAG CAAAATTAAC 1260 

AATGATCTGT GCTCTGCAAA GTTTTGAAAA TATATTTGAA CAATTTGAAT ATAAATTCAT 1320 

CATTTAGTCC XCAAAATATA TACAGCATTG CTAAGATTTT CAGATATCTA TTGTGGATCT 13 80 

TTTAAAGGTT TTGACCATTT TGTTATGAGG AATTATACAT GTATCACATT CACTATATTA 1440 

AAATTGCACT TTTATTTTTT CCTGTGTGTC ATGTTGGTTT TTGGTACTTG TATTGTCATT 1500 
TGGAGAAACA ATAAAAGATT TCTAAACCAA AAAAAAAAAA AAAAAAA 



PCT/US02/12476 



15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 641 Protein sequence 
Protein Accession #: NP__002984.1 

1 11 21 31 41 51 

I I I I I I 

MSLPSSRAAR VPGPSGSLCA LLALLLLLTP PGPLASAGPV SAVLTELRCT CLRVTLRVNP 
KTIGKLQVFP AGPQCSKVEV VASLKNGKQV CLDPEAPFLK KVIQKILDSG NKKN 

Seq ID NO: 642 DNA sequence 

Nucleic Acid Accession #: NM_013271.1 

Coding sequence: 2 7.. 80 9 



TCCGGAGCCA 
CCGGGGGCGT 
TCTGCGCGCG 
AGACTGGCGC 
TGCAGGAGCT 
GGGCCGAGGC 
TCTGGGGCGC 
CTGCAGCGCA 
CCCAGCTTGT 
ACGACGGCCC 
CCGAGCTGTT 
TGGCAGCCCC 
CTGAGGGCGT 
TGCCTGCACG 
CAGAAGTGCC 
TTACCCCGGC 
GATCTGAGC 



11 

I 

GGCTCGCTGG 
CGGCCTTTTG 
GCCGGTAAAG 
TCCTCGCCGC 
GGCGCGGGCG 
GCAGGAGGCT 
CCCCCGCAAC 
GCTCGCTCGC 
CCCCGCGCCC 
CGCGGGCCCG 
GAGGTACTTG 
GCGCCGCCTC 
GCTGGGGGCG 
CCGCCTCTTG 
CCCGCCATCC 
CAGCCAGCCC 



21 

I 

GGCAGCATGG 
GTGCTGCTGC 
GAACCCCGCG 
TTCCGGCGGT 
CTGGCGCATC 
GAGGATCAGC 
TCTGATCCGG 
GCTCTGCTCC 
GTCCCCGCCG 
GATGCTGAGG 
CTGGGACGGA 
CGCCGTGCCG 
CTGCTGCGTG 
CCACCCTGAG 
CGCCACCAGG 
TCTCACCCGA 



31 
I 

CGGGGTCGCC 
TGCTCGGCCT 
GCCTAAGCGC 
CAGTGCCCCG 
TGCTGGAGGC 
AGGCGCGCGT 
CTCTGGGCCT 
GCGCCCGCCT 
CGGCGCTCCG 
AGGCAGGCGA 
TTCTTGCGGG 
CCGACCACGA 
TGAAACGCCT 
CACTGCCCGG 
ACTTCTCCCC 
GGATCCCTAC 



41 
I 

GCTGCTCTGG 
GTTTCGGCCG 
AGCGTCTCCG 
AGGTGAGGCG 
CGAACGTCAG 
CCTGGCGCAG 
GGACGACGAC 
TGACCCTGCC 
ACCCCGGCCC 
CGAGACACCC 
AAGCGCGGAC 
TGTGGGCTCT 
AGAGACCCCG 
ATCCCGTGCA 
GCCAGCACGT 
CCCCTGGCCC 



51 
I 

GGGCCGCGGG 
CCCCCCGCGC 
CCCTTGGCTG 
GCGGGGGCGG 
GAGCGGGCGC 
CTGCTGCGCG 
CCCGACGCGC 
GCCCTAGCAG 
CCGGTCTACG 
GACGTGGACC 
TCCGAGGGGG 
GAGCTGCCCC 
GCGCCCCAGG 
CCCTGGGACC 
CCAGAGCAAC 
ACAATAACAT 



Seq ID NO: 643 Protein sequence 
Protein Accession #: NP_037403.1 



11 



21 



31 41 51 

11,111 

MAGSPLLWGP RAGGVGLLVL LLLGLFRPPP ALCARPVKEP RGLSAASPPL AETGAPRRFR 
RSVPRGEAAG AVQELARALA HLLEAERQER ARAEAQEAED QQARVLAQLL RVWGAPRNSD 
PALGLDDDPD APAAQLARAL IiRARLD P AAL AAQLVPAPVP AAALRPRPPV YDDGPAGPDA 
EEAGDETPDV DPELLRYLLG RILAGSADSE GVAAPRRLRR AADHDVGSEL PPEGVLGALL 
RVKRLETPAP QVPARRLLPP 

Seq ID NO: 644 DNA sequence 
Nucleic Acid Accession #: NM_002214 
Coding sequence: 681.. 2990 



CCCAGAGCCG 
CTGCCGACTT 
GTTGGCCTCC 
TCCCCTCGAC 
TAGGGTGGTT 
CTAAGCTGAT 
TGTCCCGGAG 
TGGCCGTCGA 
GGCCGTAGGG 
CCGAGCCGCG 
GGCCCCGAGG 
GGGGCGGGCT 
TCTGCCTGCA 
CACTTGTTCT 
CCTGTGCCAG 
TTTCAGGTGG 
GCTCAGTTGA 
TTAATACCCA 
ATTTTATGCT 
ATGTCTCAGC 
CTAGAAAAAT 
AAACAGTTTC 
ACAATTTAGA 
TCACTGAGTT 
AAGGAGGTTT 
AAGAGGCTAA 



11 

! 

CCTCCCCCTG 
GTCTTTGCCC 
CTGCCCACCT 
CTCGCCGGCG 
TCCCCCCCAG 
TTATGCAGCA 
CAGGCTGCGG 
AGGAGGTGCT 
GCCCTGAGAT 
GGGTCCGCCT 
TCGCCCGGGA 
GTTTTGCATT 
AAACGACCGG 
TGGACTGGGC 
GTGCCTTGCG 
ATCAAGAAGT 
TTCAATAGAA 
GGTGACACCA 
GAAAGTTCAT 
ATCAATGCAC 
GGCATTTTTC 
ACCATACATT 
CTGCATGCCT 
TGAGAAAGCA 
TGACGCCATG 
AAGATTGCTG 



21 
I 

TTGCTGGCAT 
GCTGCTCCGC 
GTGGAAGCAA 
TACCCTCCCA 
CTTCGGGCTT 
GAAGCCCCAC 
AGCCCTTGCA 
TCTCGCGGAG 
GCCGAGCGGT 
GCTAGG CCTG 
GGCCGAGCCC 
ATGTGCGGCT 
CGAGGTCCCG 
CAAGGTGAAG 
CTGGGTCCAG 
GAACGTTGTG 
TACCCATCTG 
GGAGAAGTGT 
CCTCTGAAGA 
AATAATATAG 
TCCCGTGACT 
AGCATCCACC 
CCCCATGGAT 
GTTCATAGAC 
CTTCAGGCAG 
CTGGTGATGA 



31 
I 

CCCGAGCTTC 
AGACGGGGCT 
CTGCGCTGAT 
CAGATCCAGC 
TGTTTGGGTT 
CGGCTGGAGA 
GAGCCCTCTC 
ACCGCGGGAC 
GCCCGGGCCC 
CGGAAAACGT 
GCGTCCGGAA 
CGGCCCTGGC 
CCTCGTTCCT 
ACAATAGATG 
AATGTGGATG 
ATATTGTTTC 
TGCATGTTAT 
CTATCCAGCT 
AATATCCTGT 
AAAAATTAAA 
TTCGTCTTGG 
CCGAAAGGAT 
ACATCCATGT 
AGAAGAT CTC 
CTGTCTGTGA 
CAGATCAGAC 



41 

I 

CTCCCTTGCC 
GCAAAGCTGC 
TGATGCGCCA 
ATCACCCAGT 
TGATTGTGTT 
GAAACAAAAG 
TCCAGTCGCC 
CCGCCGTGCC 
GCTTACCTGC 
CCTAGCGACA 
GGCAGCCAGG 
TTTTTTTACC 
CTGGGCAGCC 
TGCATCTTCA 
GTGTGTTCAA 
CAATTTAATA 
AATACCCACT 
GCGTCCAGGA 
GGATCTTTAT 
TTCCGTTGGA 
ATTTGGCTCA 
TCATAATCAA 
GCTGTCTTTG 
TGGAAACATA 
AAGTCATATC 
GTCTCATCTC 



51 
I 

AGCCAGGACG 
AACTAATGGT 
CAGACTTTTT 
GAATGTACAT 
TGGCTCTTCG 
CTCTTTTCTT 
GCCGGGCCCT 
GAGCCGGGAG 
ACCGCTTGCT 
CTCGCCCGCG 
CGGCGGGCGC 
GCTGCATTTG 
TGGGTGTTTT 
AATGCAGCAT 
GAGGATTTCA 
AGCAAAGGCT 
GAAAATGAAA 
GCCGAAGCTA 
TATCTTGTTG 
AACGATTTAT 
TACGTTGATA 
TGCAGTGACT 
ACAGAGAACA 
GATACACCAG 
GGATGGCGAA 
GCTCTTGATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



60 
120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



432 



WO 02/086443 

GCAAATTGGC AGGCATAGTG GTGCCCAATG ACGGAAACTG T CAT CTG AAA AACAACGTCT 162 0 

ACGTCAAATC GACAACCATG GAACACCCCT CACTAGGCCA ACTTTCAGAG AAATTAATAG 1680 

ACAACAACAT TAATGT CATC TTTGCAGTTC AAGGAAAACA ATTTCATTGG TATAAGGATC 1740 

TTCTACCCCT CTTGCCAGGC ACCATTGCTG GTGAAATAGA ATCAAAGGCT GCAAACCTCA - 1800 

ATAATTTGGT AGTGGAAGCC TATCAGAAGC TCATTTCAGA AGTGAAAGTT CAGGTGGAAA 18 60 

ACCAGGTACA AGGCATCTAT TTTAACATTA CCGCCATCTG TCCAGATGGG TCCAGAAAGC 192 0 

CAGGCATGGA AGGATGCAGA AACGTGACGA GCAATGATGA AGTTCTTTTC AATGTAACAG 1980 

TTACAATGAA AAAATGTGAT GTCACAGGAG GAAAAAACTA TGCAATAATC AAACCTATTG 2040 

GTTTTAATGA AACCGCTAAA ATTCATATAC ACAGAAACTG CAGCTGTCAG TGTGAGGACA 2100 

ACAGAGGACC TAAAGGAAAG TGTGTAGATG AAACTTTTCT AGATTCCAAG TGTTTCCAGT 2160 

GTGATGAGAA TAAATGTCAT TTTGATGAAG ATCAGTTTTC TTCTGAGAGT TGCAAGTCAC 2220 

ACAAGGATCA GCCTGTTTGC AGTGGTCGAG GAGTTTGTGT TTGTGGGAAA TGTTCATGTC 2280 

ACAAAATTAA GCTTGGAAAA GTGTATGGAA AATACTGTGA AAAGGATGAC TTTTCTTGTC 23 40 

CATATCACCA TGGAAATCTG TGTGCTGGGC ATGGAGAGTG TGAAGCAGGC AGATGCCAAT 2400 

GCTTCAGTGG CTGGGAAGGT GATCGATGCC AGTGCCCTTC AGCAGCAGCC CAGCACTGTG 2460 

TCAATTCAAA GGGCCAAGTG TGCAGTGGAA GAGGCACGTG TGTGTGTGGA AGGTGTGAGT 2520 

GCACCGATCC CAGGAGCATC GGCCGCTTCT GTGAACACTG CCCCACCTGT TATACAGCCT 2 580 

GCAAGGAAAA CTGGAATTGT ATGCAATGCC TTCACCCTCA CAATTTGTCT CAGGCTATAC 2640 

TTGATCAGTG CAAAACCTCA TGTGCTCTCA TGGAACAACA GCATTATGTC GACCAAACTT 2700 

CAGAATGTTT CTCCAGCCCA AGCTACTTGA GAATATTTTT CATCATTTTC ATAGTTACAT 2 760 

TCTTGATTGG GTTGCTTAAA GTCCTGATCA TTAGACAGGT GATACTACAA TGGAATAGTA 2 820 

ATAAAATTAA GTCCTGATCA GATTACAGAG TGTCAGCCTC AAAAAAGGAT AAGTTGATTC 2880 

TGCAAAGTGT TTGCACAAGA GCAGTCACCT ACCGACGTGA GAAGCCTGAA GAAATAAAAA 2 940 

TGGATATCAG CAAATTAAAT GCTCATGAAA CTTTCAGGTG CAACTTCTAA AAAAAGATTT 3 000 

TTAAACACTT AATGGGAAAC TGGAATTGTT AATAATTGCT CCTAAAGATT ATAATTTTAA 3060 

AAGT CACAGG AGGAGACAAA TTGCTCACGG TCATGCCAGT TGCTGGTTGT ACACTCGAAC 312 0 

GAAGACTGAC AAGT AT CCTC AT CATG ATGT GACTCACATA GCTGCTGACT TTTT CAGAGA 3180 

AAAATGTGT C TTACTACTGT TTGAGACTAG TGTCGTTGTA GCACTTTACT GTAATATATA 3240 

ACTTATTTAG AT C AGCAT AG AATGTAGATC CTCTGAAGAG CACTGATTAC ACTTTACAGG 33 00 

TACCTGTTAT CCCTACGCTT CCCAGAGAGA ACAATGCTGT GAGAGAGTTT AGCAT TGTGT 3360 

CACTACAAGG GTACAGTAAT CCCTGCACTG GACATGTGAG GAAAAAAATA ATCTGGCAAG 3420 

TATATTCTAA GGTTGCCAAA CACTTCAACA GTTGGTGGTT GAATAGACAA GAACAGCTAG 3480 

ATGAATAAAT GATTCGTGTT TCACTCTTTC AAGAGGTGAA CAGATACAAC CTTAATCTTA 3540 

AAAGATTATT GCTTTTTAAA GTGTGTAGTT TTATGCATGT GTGTTTATGG TTTGCTTATT 3600 

TTTGCAAGAT GGATACTAAT TCCAGCATTC TCTCCTCTTT GCCTTTATGT TTTGTTTTCT 3660 

TTTTTACAGG ATAAGTTTAT GTATGTCACA GATGACTGGA TTAATTAAGT GCTAAGTTAC 3720 

TACTGCCATA AAAAACTAAT AATACAATGT CACTTTATCA GAATACTAGT TTTAAAAGCT 3780 
GAATGTTAA 

Seq ID NO: 645 Protein sequence 
Protein Accession #: NP_002205 

1 11 21 31 41 51 

I I I I I I 

MCGSALAFFT AAFVCLQNDR RGP AS FLWAA WVFSLVLGLG QGEDNRCASS NAASCARCLA 60 

LGPECGWCVQ EDFISGGSRS ERCDIVSNLI SKGCSVDSIE YPSVHVIIPT ENEINTQVTP 120 

GEVSIQLRPG AEANFMLKVH PLKKYPVDLY YLVDVSASMH NNIEKLNSVG NDLSRKMAFF 180 

SRD FRLGFGS YVDKTVSPYI SIHPERIHNQ CSDYNLDCMP PHGY I HVLSL TENITEFEKA 240 

VHRQKISGNI DTPEGGFDAM LQAAVCESHI GWRKEAKRLL LVMTDQTSHL ALDSKLAGIV 300 

VPNDGNCHLK NNVYVKSTTM EHPSLGQLSE KLIDNNINVI FAVQGKQFHW YKDLLPLIiPG 3 60 

TIAGEIESKA ANLNNLWEA YQKLISEVKV QVENQVQGIY FNITAICPDG SRKPGMEGCR 420 

NVTSNDEVLF NVTVTMKKCD VTGGKNYAI I KPIGFNETAK IHIHRNCSCQ CEDNRGPKGK 480 

CVDETFLDSK CFQCDENKCH FDEDQFSSES CKSHKDQPVC SGRGVCVCGK CSCHKIKLGK 540 

VYGKYCEKDD FSCPYHHGNL CAGHGECEAG RCQCFSGWEG DRCQCPSAAA QHCVNSKGQV 600 

CSGRGTCVCG RCECTDPRSI GRFCEHCPTC YTACKENWNC MQCLHPHNLS QAILDQCKTS 660 

CALMEQQHYV DQTSECFSSP SYLRIFFIIF IVTFLIGLLK VLIIRQVILQ WNSNKIKSSS 720 
DYRVSASKKD KLILQSVCTR AVTYRREKPE EIKMDISKLN AHETFRCNF 

Seq ID NO: 646 DNA sequence 

Nucleic Acid Accession #: NM_003318.1 

Coding sequence: 1..2574 

1 11 21 31 41 51 

I I I I I I 

ATGGAATCCG AGGATTTAAG TGGCAGAGAA TTGACAATTG ATTCCATAAT GAACAAAGTG 60 

AGAGACATTA AAAATAAGTT TAAAAATGAA GACCTTACTG ATGAACTAAG CTTGAATAAA 120 

ATTTCTGCTG ATACTACAGA TAACTCGGGA ACTGTTAACC AAATTATGAT GATGGCAAAC 180 

AACCCAGAGG ACTGGTTGAG TTTGTTGCTC AAACTAGAGA AAAACAGTGT TCCGCTAAGT 240 

GATGCTCTTT TAAATAAATT GATTGGTCGT TACAGTCAAG CAATTGAAGC GCTTCCCCCA 300 

GATAAATATG G CCAAAATG A GAGTTTTGCT AGAATTCAAG TGAGATTTGC TGAATTAAAA 360 

GCTATTCAAG AGCCAGATGA TGCACGTGAC TACTTTCAAA TGGCCAGAGC AAACTGCAAG 420 

AAATTTGCTT TTGTTCATAT ATCTTTTGCA CAATTTGAAC TGTCACAAGG TAATGTCAAA 480 

AAAAGTAAAC AACTTCTTCA AAAAGCTGTA GAACGTGGAG CAGTACCACT AGAAATGCTG 540 

GAAATTGCCC TGCGGAATTT AAACCTCCAA AAAAAGCAGC TGCTTTCAGA GGAGGAAAAG 600 

AAGAATTTAT CAGCATCTAC GGTATTAACT GCCCAAGAAT CATTTTCCGG TTCACTTGGG 660 

CATTTACAGA ATAGGAACAA CAGTTGTGAT TCCAGAGGAC AGACTACTAA AGCCAGGTTT 720 

TTATATGGAG AGAACATGCC ACCACAAGAT GCAGAAATAG GTTACCGGAA TTCATTGAGA 780 

CAAACTAACA AAACTAAACA GTCATGCCCA TTTGGAAGAG TCCCAGTTAA CCT TCTAAAT 840 

AGCCCAGATT GTGATGTGAA GACAGATGAT TCAGTTGTAC CTTGTTTTAT GAAAAGACAA 900 

ACCTCTAGAT CAGAATGCCG AGATTTGGTT GTGCCTGGAT CTAAACCAAG TGGAAATGAT 960 

TCCTGTGAAT TAAGAAATTT AAAGTCTGTT CAAAATAGT C ATTTCAAGGA ACCTCTGGTG 1020 

T CAG ATG AAA AGAGTT CTG A ACTTATTATT ACTGATTCAA TAACCCTGAA GAATAAAACG 1080 

GAATCAAGTC TTCTAGCTAA ATTAGAAGAA ACTAAAGAGT ATCAAGAACC AGAGGTTCCA 1140 

GAGAGTAACC AGAAACAGTG GCAATCTAAG AGAAAGTCAG AGTGTATTAA CCAGAATCCT 12 00 

GCTGCATCTT CAAATCACTG GCAGATTCCG GAGTTAGCCC GAAAAGTTAA TACAGAGCAG 1260 

AAACATACCA CTTTTGAGCA ACCTGTCTTT TCAGTTTCAA AACAGTCACC ACCAATATCA 132 0 

ACATCTAAAT GGTTTGACCC AAAATCTATT TGTAAGACAC CAAGCAGCAA TACCTTGGAT 1380 
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GATTACATGA GCTGTTTTAG AACTCCAGTT GTAAAGAATG ACTTTCCACC TGCTTGTCAG 1440 

TTGTCAACAC CTTATGGCCA ACCTGCCTGT TTCCAGCAGC AACAGCATCA AATACTTGCC 1500 

ACTCCACTTC AAAATTTACA GGTTTTAGCA TCTTCTTCAG CAAATGAATG CATTTCGGTT 1560 

AAAGGAAGAA TTTATTCCAT TTTAAAGCAG ATAGGAAGTG GAGGTTCAAG CAAGGTATTT 1620 

CAGGTGTTAA ATGAAAAGAA ACAGATATAT GCTATAAAAT ATGTGAACTT AGAAGAAGCA 1680 

GATAACCAAA CTCTTGATAG TTACCGGAAC GAAATAGCTT ATTTGAATAA ACTACAACAA 1740 

CACAGTGATA AGATCATCCG ACTTTATGAT TATGAAATCA CGGACCAGTA CATCTACATG 1800 

GTAATGGAGT GTGGAAATAT TGATCTTAAT AGTTGGCTTA AAAAGAAAAA ATCCATTGAT 1860 

CCATGGGAAC GCAAGAGTTA CTGGAAAAAT ATGTTAGAGG CAGTTCACAC AATCCATCAA 192 0 

CATGGCATTG TTCACAGTGA TCTTAAACCA GCTAACTTTC TGATAGTTGA TGGAATGCTA 1980 

AAGCTAATTG ATTTTGGGAT TGCAAACCAA ATGCAACCAG ATACAACAAG TGTTGTTAAA 2040 

GATTCTCAGG TTGGCACAGT TAATTATATG CCACCAGAAG CAATCAAAGA TATGTCTTCC 2100 

TCCAGAGAGA ATGGGAAATC TAAGTCAAAG ATAAGCCCCA AAAGTGATGT TTGGTCCTTA 2160 

GGATGTATTT TGTACTATAT GACTTACGGG AAAACACCAT TTCAGCAGAT AATTAATCAG 2220 

ATTTCTAAAT TACATGCCAT AATTGAT CCT AAT CATGAAA TTGAATTTCC CGATATTCCA 2280 

GAG AAAGAT C TTCAAGATGT GTTAAAGTGT TGTTTAAAAA GGGACCCAAA ACAGAGGATA 2340 

TCCATTCCTG AGCTCCTGGC TCATCCCTAT GTTCAAATTC AAACTCATCC AGT TAACCAA 2400 

ATGGCCAAGG GAACCACTGA AGAAATGAAA TATGTTCTGG GCCAACTTGT TGGT CTGAAT 2460 

TCTCCTAACT CCATTTTGAA AGCTGCTAAA ACTTTATATG AACACTATAG TGGTGGTGAA 2520 
AGTCATAATT CTTCATCCTC CAAGACTTTT GAAAAAAAAA GGGGAAAAAA ATGA 

Seq ID NO: 64 7 Protein sequence 
Protein Accession #: NP_003309.1 

1 11 21 31 41 51 

1 I I 1 I 1 

MESEDLSGRE LTIDSIMNKV RDIKNKFKNE DLTDELSLNK ISADTTDNSG TVNQIMMMAN 60 

NPEDWLSLLL KLEKNSVPLS DALLNKLIGR YSQAIEALPP DKYGQNESFA RIQVRFAELK 120 

AIQEPDDARD YFQMARANCK KFAFVHISFA QFELSQGNVK KSKQLLQKAV ERGAVPLEML 180 

EIALRNLNLQ KKQLLSEEEK KNLSASTVLT AQESFSGSLG HLQNRNNSCD SRGQTTKARF 240 

LYGENMPPQD AEIGYRNSLR QTNKTKQSCP FGRVPVNLLN SPDCDVKTDD SWPCFMKRQ 3 00 

TSRSECRDLV VPGSKPSGND SCELRNLKSV QNSHFKEPLV SDEKSSELII TDSITbKNKT 360 

ESSLLAKLEE TKEYQEPEVP ESNQKQWQSK RKSECINQNP AASSNHWQIP ELARKVNTEQ 42 0 

KHTTFEQPVF SVSKQSPPIS TSKWFDPKSI CKTPSSNTLD DYMSCFRTPV VKNDFPPACQ 4 80 

LSTPYGQPAC FQQQQHQILA TPLQNLQVLA SSSANECISV KGRIYSILKQ IGSGGSSKVF 540 

QVLNEKKQIY AIKYVNLEEA DNQTLDSYRN EIAYLNKLQQ HSDKIIRLYD YEITDQYIYM 600 

VMECGNIDLN SWLKKKKSID PWERKSYWKN MLEAVHTIHQ HGIVHSDLKP ANFLIVDGML 660 

KLIDFGIANQ MQPDTTSWK DSQVGTVNYM PPEAIKDMSS SRENGKSKSK ISPKSDVWSL 72 0 

GCILYYMTYG KTPFQQIINQ ISKLHAIIDP NHEIEFPDIP EKDLQDVLKC CLKRDPKQRI 780 

SIPELLAHPY VQIQTHPVNQ MAKGTTEEMK YVLGQLVGLN SPNSILKAAK TLYEHYSGGE 840 
SHNSSSSKTF EKKRGKK 

Seq ID NO: 648 DNA sequence 
Nucleic Acid Accession #: NM_015507 
Coding sequence: 241.. 1902 

1 11 21 31 41 51 

[ I I ! I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 12 0 

CGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 42 0 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 72 0 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGT CT AG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGT CACA TTGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCT CATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 132 0 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 150 0 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAAT CGCAG TGGATGGCGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT AT CTTTATAT 192 0 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

T T T CTGAAT C TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGT AT AT CT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
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TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 



PCT/US02/12476 



Seq ID NO: 64 9 Protein sequence 
Protein Accession #: NP_056322 



10 
15 
20 
25 
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35 
40 
45 
50 
55 
60 
65 
70 
75 
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85 



MPLPWSLALP 
CEATCEPGCK 
LSGHMLMPDA 
GKVICPYNRR 
GSFKCKCKQG 
PEPTRTPTPK 
SLRGDVPFPK 
WNPADRDNAI 
FVKNSNNALA 
SGLCPDSLLS 



11 

I 

LLLSWVAGGF 
FGECVGPNKC 
TCVNSRTCAM 
CVNTFGSYYC 
YKGNGLRCSA 
VNLQPFNYEE 
VNEAGEFGIjI 
GFYMAVPALA 
WEKTTSEDEK 
VDD 



21 

\ 

GNAASARHHG 
RCFPGYTGKT 
INCQYSCEDT 
KCHIGFELQY 
IPENSVKEVL 
IVSRGGNSHG 
LVQRKALTSK 
GHKKDIGRLK 
WKTGKIQIi YQ 



31 
1 

LLASARQPGV 
CSQDVNECGM 
EEGPQCLCPS 
ISGRYDCIDI 
RAPGTIKDRI 
GKKGNEEKMK 
LEHKDLNISV 
LLLPDLQPQS 
GTDATKSIIF 



41 
I 

CHYGTKLACC 
KPRPCQHRCV 
SGLRLAPNGR 
NECTMDSHTC 
KKLLAHKNSM 
EGLEDEKREE 
DCSFNHGICD 
NFCLLFDYRL 
EAERGKGKTG 



51 
I 

YGWRRNSKGV 
NTHGSYKCFC 
DCLDIDECAS 
SHHANCFNTQ 
KKKAKIKNVT 
KAL KND I EER 
WKQDREDDFD 
AGDKVGKLRV 
EIAVDGVLLV 



Seq ID NO: 650 DNA sequence 

Nucleic Acid Accession #: NM_0O3506.1 

Coding sequence: 259.. 2379 



GCAGCTCCAG 
TTAGACGGGG 
CTCATTTTCA 
ATCTTTGGAT 
ATCAGGAATT 
CTCCTAAGAG 
ATGGCCTACA 
GCGGTGGAAA 
ACTTTCCTCT 
TGTCGTAAAC 
ATCCGATGGC 
GTAACTTTTG 
AGAGACATTG 
TTTCTGGGAA 
CTAGAGTTTG 
TTCACATTCC 
ATATATTACT 
GGCGATAGCA 
CTAGGCTCTC 
GCTGGCACTG 
TGGAGTTGTG 
CCAGGTTTCC 
GGAGTTTGCT 
CTGTGCCTTT 
CATGTTCGAC 
ATTCGAATTG 
TACGTCTATG 
CGTCAGTACC 
TTATTTATGA 
GGAAGCAAAA 
CCAATCAGTG 
TCTAAAGTTA 
TCCAAATCCA 
ATTACTAGCC 
ACATCAATGA 
TGTGGTGAAC 
GGGAAGGGCC 
AAGAGTGATA 
TCAGAACCAA 
AGAAAAGAGC 
CAGAAGCAAA 
TACGTTCTTC 
TCAAGAATAA 
AAATGTGCAG 
CCTTTTCTAT 
TTTACCTTTT 
GTATCTTTTT 
ATTCAAGTAT 
ATTTCTAAGA 
GTCTTATAAT 
TGTGATTTTT 
GGTGCTTACT 
ATATTTAAAA 
GGCCAAGTGC 
AACCACTTAC 
TACATTTTGT 



11 
I 

TCCCGGACGC 
ACGGGAAGGG 
GGAAAGCCTG 
GGGGATCTTC 
TGAAGAAAAT 
GGCACAGTCT 
ACATGACGTT 
TGGAGCATTT 
GCAAAGCATT 
TTTGTGAGAA 
CTGAGGAGCT 
ATCCACACAC 
GATTTTGGTG 
TTGACCAGTG 
CAAAAAGTTT 
TTACTTTTTT 
CTGTCTGTTA 
CAGCCTGCAA 
AAAATAAGGC 
TGTGGTGGGT 
AAGCCATCGA 
TGACTGTTAT 
TTGTTGGCCT 
GTGTGTTTGT 
AAGTCATACA 
GAGTCTTCAG 
AGCAAGTGAA 
ATAT CCCATG 
TAAAATACCT 
AGACATGCAC 
AAAGTCGAAG 
AACACAAAAA 
TGGGAACCAG 
ATGATTACCT 
GAGAGGTGAA 
CTGCCTCGCC 
AGGCAGGCAG 
TTACTGACAC 
GCAGCCTCAA 
AGGGAGGTGG 
TTTGTGTTAC 
TTTTGCACTT 
TATGACTCAT 
GTTAATAATA 
TTATGAAGAT 
TGATATAAAA 
ATACATATTT 
TTTTATCATG 
AAATTGTAAA 
AGGAATTTAA 
ATAGTCTCGT 
CAAAGAGTGT 
TAAAATGTCC 
AATTGACTTC 
AGTTGCTTAT 
ATTATACAGT 



21 

1 

AACCCCGGAG 
ACAGCGGCCT 
AAAATGAGTA 
TGAGGATGCA 
GGAGATGTTT 
CTTCACCTGT 
TTTCCCTAAT 
TCTTCCTCTC 
TGTACCAACC 
AGTATATTCT 
TGAATGTGAC 
AGAATTTCTT 
TCCAAGGCAT 
TGCGCCTCCA 
TATTGGAACA 
AATTGATGTT 
CAGCATTGTA 
TAAGGCAGAT 
TTGCACCGTT 
GATTCTTACC 
G CAAAAAGC A 
GCTTCTTGCT 
TTATGACCTG 
TGGGCTCTCT 
ACATGATGGC 
CGGCTTGTAT 
CAGGATTACC 
TCCTTATCAG 
GATGACATTA 
AGAATGGGCT 
AGTACTACAG 
GAAGCACTAT 
CACAGGAGCT 
AGGACAAGAA 
AGCGGACGGA 
AGCAGCATCC 
TGTATCTGAA 
TGGCCTGGCA 
AGGTT CCACA 
TTGT CATTCA 
ACTGGAAGTG 
AAAGTTGCAT 
TTCACACAAA 
TTTTTTTAAT 
TCTACTCTTG 
TCAAGATATT 
GAAAATAAGC 
CTATTGTGAT 
ATAGTCTTCT 
CTTTAAAAAC 
TTTAGGAATT 
CCACTATTGA 
TAAAGGGTTA 
CCTTTTTTAA 
ATTTTTTGTT 
ACCTTTCTCA 



31 
I 

CCGTCTCAGG 
TCGACCGCCC 
AAATAGTGAA 
AAGAGTGATT 
ACATTTTTGT 
GAACCAATTA 
CTGATGGGTC 
GCAAATCTGG 
TGCATAGAAC 
GATTGCAAAA 
AGATTACAAT 
GGTCCTCAGA 
CTTAAGACTT 
TGCCCCAACA 
GTTTCAATAT 
AGAAGATTCA 
TCTCTTATGT 
GAGAAGCTAG 
TTGTTCATGC 
ATTACTTGGT 
GTGTGGTTTC 
CTGAACAAAG 
GATGCTTCTC 
CTTCTTTTAG 
CGGAACCAAG 
CTTGTGCCAT 
TGGGAGATAA 
GCAAAAGCAA 
ATTGTTGGCA 
GGGTTTTTTA 
GAATCATGTG 
AAACCAAGTT 
ACAGCAAATC 
ACTTTGACAG 
GCTAGCACCC 
AT CT CCAGAC 
AGTGCGCGGA 
CAGAGCAACA 
TCTCTGCTTG 
GATACTTGAA 
ACCTATGCAC 
TGCCTACTGT 
GGTTAATGAC 
AGTGTGGGAG 
GTAAGAGTAT 
TCTTTGCTGA 
TTATATGTAT 
ATTTTAGCAC 
TTTATACTGT 
CCACTTATTG 
TCACAGATCT 
TTGTATTATG 
GTAGACAAAA 
TGTTTCATGA 
TTAACTTTTG 
GACATTTTGT 



41 

1 

TCCCTGGGGG 
CCCGAGTAAT 
ATGAGGAATT 
CATCCAAGCC 
TGACGTGTAT 
CTGTTCCCAG 
ATTATGACCA 
AATGTTCACC 
AAATTCATGT 
AATTAATTGA 
ACTGTGATGA 
AGAAAACAGA 
CTGGGGGACA 
TGTATTTTAA 
TTTGTCTTTG 
GATACCCAGA 
ACTTCATTGG 
AACTTGGTGA 
TTTTGTATTT 
TCTTAGCTGC 
ATGCTGTTGC 
TTGAAGGAGA 
GCTACTTTGT 
CTGGCATTAT 
AAAAACTAAA 
TAGTGACACT 
CTTGGGTCTC 
AAGCTCGACC 
TCTCTGCTGT 
AACGAAATCG 
AGTTTTTCTT 
CACACAAGCT 
ATGGCACTTC 
AAATCCAAA C 
CCAGGTTAAG 
TCTCTGGGGA 
GTGAAGGAAG 
ATTTGCAGGT 
TTCACCCAGT 
GAACATTTTC 
TGTTTTGTAA 
TATACTGGAA 
AACAATATAC 
GACAGAGTTA 
TTTAAGATGT 
AGTATTTAAA 
TTGAACTTTT 
TTTGGTAGCT 
AAAAAAAGAT 
ATACCTTACC 
AAATTATGTA 
CTGCTCACTG 
TGTTAGTCTT 
CCACCCATTG 
TTTCTTAACA 
AG 



51 
I 

GAACGGTGGG 
TGACCCAGGA 
TGAACATTTT 
ATGTGGTAAA 
TTTTCTACCC 
ATGTATGAAA 
GAGTATTGCC 
AAACATTGAA 
GGTTCCACCT 
CACTTTTGGG 
GACTGTTCCT 
ACAAGTCCAA 
AGGATATAAG 
AAGTGATGAG 
TGCAACTCTG 
GAGACCAATT 
ATTTTTGCTG 
CACTGTTGT C 
TTTCACAATG 
AGGAAGAAAA 
ATGGGGAACA 
CAACATTAGT 
ACTCTTGCCA 
TTCCTTAAAT 
GAAATTTATG 
TCTCGGATGT 
TGATCATTGT 
AGAATTGGCT 
CTTCTGGGTT 
CAAGAGAGAT 
AAAGCACAAT 
GAAGGTCATT 
TGCAGTAGCA 
CTCACCAGAA 
AGAACAGGAC 
ACAGGT CG AC 
GATTAGTCCA 
CCCCAGTTCT 
TTCAGGAGTG 
TCTCGTTACT 
GAATCACTGT 
AAAATAGAGT 
CTGAAAACAG 
GAGGAATCTT 
ACTATGCTAT 
TCTTATCCTT 
TTGAAATCCT 
TTTACACTGA 
ATACCAAAAA 
ATCTAAAATG 
ACTGAAATAA 
ATCCTTCTGC 
TTGTATATTA 
ATTGTATTAT 
TTTAGAATAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
18 0 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



Seq ID NO: 651 Protein sequence 
Protein Accession #: NP_0034 97.1 

1 11 21 
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WO 02/086443 

MEMFTFLIiTC IFI.PLLRGHS LFTCEPITVP RCMKMAYNMT FFPNJjMGHYD QS IAAVEMEH 60 

FLPLANLECS PNIETFLCKA FVPTCIEQIH WPPCRKLCE KVYSDCKKLI DTFGIRWPEE 120 

LECDRLQYCD ETVPVTFDPH TEPLGPQKKT EQVQRDIGPW CPRHIjKTSGG QGYKFLGIDQ 180 

CAPPCPNMYF KSDELEFAKS FIGTVSIFCL CATLFTFLTF LIDVRRFRYP ERPI1YYSVC 240 

YSIVSLMYFI GFLLGDSTAC NKADEKLELG DTWLGSQNK ACTVIiFMLLY FFTMAGTVWW 300 

VILTITWFLA AGRKWSCEAI EQKAVWFHAV AWGTPGFLTV MLLAIjNKVEG DNISGVCFVG 360 

MDLDASRYF VLLPhCLCVF VGLSLLLAGI ISLNHVRQVI QHDGRKQEKL KKFMIRIGVF 420 

SGLYLVPLVT LLGCYVYEQV NRITWEITWV SDHCRQYHIP CPYQAKAKAR PELAIiFMIKY 480 

LMTLIVGISA VFWVGSKKTC TEWAGFFKRN RKRDPISESR RVLQESCEFF LKHNSKVKHK 540 

KKHYKPSSHK LKVI SKSMGT STGATANHGT SAVAITSHDY LGQETLTEIQ TSPETSMREV 600 

KADGAST PRL REQDCGEPAS PAAS1SRLSG EQVDGKGQAG SVSESARSEG RISPKSDITD 660 
TGLAQSNNLQ VPSSSEPSSL KGSTSLLVHP VSGVRKEQGG GCHSDT 



PCT/US02/12476 



Seq ID NO: 652 DNA sequence 
15 Nucleic Acid Accession #: NM_014791 . 1 
Coding sequence: 171.. 2126 



TTGGCGGGCG 
GCCGCGTCTC 
TCCGCCCCTC 
ATGATGAACT 
AGGT CAAACT 
AAAACACACT 
TGAGACATCA 
TGGTTCTTGA 
TGTCAGAAGA 
ACAGCCAGGG 
ATAAATTAAA 
ATCTACAGAC 
CATATCTTGG 
GTGGATTTCT 
GAAAATATGA 
TGCAGGTGGA 
TGCAAGATTA 
ATGATTGCGT 
TAATTTCACT 
AGGCTCGGGG 
CTACCCCATT 
ATAAAAATTA 
GTGCTGCTAC 
AATCTAAATC 
AAAATGTATA 
CAAAGACTCC 
ACACTACACC 
TAAATTCAAC 
GCTCAGTGGA 
CCAAAGTGTT 
GCAAAAGGAA 
CTACAACTAG 
AGAAGCATGT 
TTGGGAAAGT 
TGGGTAT C AG 
ACATCCTATC 
GGTGTGATAC 
CTACCAACTT 
GATATTATTT 
TAATCATGTG 
AATGTAAGCT 
TTGTGAATAT 



11 

I 

GAAGCGGCCA 
TCAGGACAGC 
AGGTTCTTTT 
TCTCAAATAT 
TGCCTGCCAT 
AGGGAGTGAT 
GCATATATGT 
GTACTGCCCT 
GGAGACCCGG 
CTATGCTCAC 
GCTGATTGAC 
ATGCTGTGGG 
ATCAGAGGCA 
ACCATTTGAT 
TGTTCCCAAG 
CCCAAAGAAA 
CAACTATCCT 
AACAGAACTT 
GTGGCAGTAT 
AAAACCAGTT 
CACAGACATC 
TGTGGCGGGA 
TCCCCGAACA 
ATTAACTCCA 
TACTCCTAAG 
AGTTAATAAG 
CTCAAAAGCT 
AGGAACAGAC 
ATTGGATCTC 
TGGGAGCCTT 
GGGTTCTGCC 
ATTAGTGAAT 
TGACTTTGTA 
GACAATGCAA 
GAGGCAGCGG 
TAGCTGCAAG 
AGCCTACATA 
GTTTCTAAAG 
TGTGTATGAA 
GTTTTGTATA 
CTTAACTATG 



21 
I 

CAACCCGGCG 
AGGCCCCTGT 
TCTAATTCCA 
TATGAATTAC 
ATCCTTACTG 
TTGCCCCGGA 
CAACTCTACC 
GGAGGAGAGC 
GTTGTCTTCC 
AGGGACCTCA 
TTTGGTCTCT 
AGTCTGGCTT 
GATGTTTGGA 
GATGATAATG 
TGGCTCTCTC 
CGGATTTCTA 
GTTGAGTGGC 
TCTGTACATC 
GATCACCTCA 
CGTTTAAGGC 
AAGTCAAATA 
TTAATAGACT 
TCACAGTTTA 
GCCTTATGCA 
TCTGCTGTAA 
AACCAGCATA 
AGAAACCAGT 
AAGTTAATGA 
AACCAAGCAC 
GAAAGGGGGT 
AGAGACGGGC 
CCAGAT CAAC 
CAAAAGGGTT 
TTTGAATTAG 
CTTAAGGGCG 
GTATAATTGA 
AAGACTGTTA 
AGCTATCTTA 
TCTAAATCAA 
TTAATAATTG 
TCTCTTTGTA 



31 
I 

ATCGAAAAGA 
CCTTCTGTCG 
AATAAACTTG 
ATGAAACTAT 
GAGAGATGGT 
TCAAAACGGA 
ATGTGCTAGA 
TGTTTGACTA 
GTCAGATAGT 
AGCCAGAAAA 
GTGCAAAACC 
ATGCAGCACC 
GCATGGGCAT 
TAATGGCTTT 
CCAGTAGCAT 
TGAAAAATCT 
AAAGCAAGAA 
ACAGAAACAA 
CGGCTACCTA 
TTTCTTCTTT 
ATTGGAGTCT 
ATGATTGGTG 
CCAAGTACTG 
GAACACCTGC 
AGAATGAAGA 
AGAGAGAAAT 
GCCTGAAAGA 
CAGGTGTCAT 
ATATGGAGGA 
TGGATAAGGT 
CCAGAAGACT 
TGTTGAATGA 
ATACACTGAA 
AAGTGTGCCA 
ATGCCTGGGT 
TGGATTCTTC 
TGATCGCTTT 
AGACCAATAT 
GCCCATCTGT 
TTGACTTTCT 
ATGTGTAATT 



41 
I 

TTCTTAGGAA 
GGCGCCGCTC 
CAAGAGGACT 
TGGGACAGGT 
AGCTATAAAA 
GATTGAGGCC 
GACAGCCAAC 
TATAATTTCC 
ATCTGCTGTT 
TTTGCTGTTT 
CAAGGGTAAC 
TGAGTTAATA 
ACTGTTATAT 
ATACAAGAAG 
TCTGCTTCTT 
ATTGAACCAT 
TCCTTTTATT 
CAGGCAAACA 
TCTTCTGCTT 
CTCCTGTGGA 
GGAAGATGTG 
TGAAGATGAT 
GACAGAATCA 
AAATAAATTA 
GTACTTTATG 
ACTCACTACG 
AACTCCAATT 
TAGCCCTGAG 
GACTCCAAAA 
TATCACTGTG 
AAAGCTTCAC 
AATAATGTCT 
GTGTCAAACA 
GCTTCAAAAA 
TTACAAAAGA 
CATCCTGCCG 
GATTTTAAAG 
CTCTTTGTTT 
CATTATGTTA 
TAGATTCACT 
TCTTTCTGAA 



51 

i 

CGCCGTACCA 
AGCCGTGCCC 
ATGAAAGATT 
GGCTTTGCAA 
ATCATGGATA 
TTGAAGAACC 
AAAATATTCA 
CAGGATCGCC 
GCTTATGTGC 
GATGAATATC 
AAGGATTACC 
CAAGGCAAAT 
GTTCTTATGT 
ATTATGAGAG 
CAACAAATGC 
CCCTGGATCA 
CACCTCGATG 
ATGGAGGATT 
CTAGCCAAGA 
CAAGCCAGTG 
ACCGCAAGTG 
T TAT CAAC AG 
AATGGGGTGG 
AAGAACAAAG 
TTTCCTGAGC 
CCAAATCGTT 
AAAATACCAG 
AGGCGGTGCC 
AGAAAGGGAG 
CTCACCAGGA 
TATAATGTGA 
ATTCTTCCAA 
CAGTCAGATT 
CCCGATGTGG 
TTAGTGGAAG 
GATGAGTGTG 
TTCATTGGAA 
TTAAACAAAA 
CTGTCTTTTT 
TCCATATGTG 
ATAAAACCAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



Seq ID NO: 653 Protein sequence 
Protein Accession #: NP_055606.1 



MKDYDELLKY 
IiKNLRHQHIC 
AYVHSQGYAH 
QGKSYLGSEA 
QQMLQVDPKK 
MEDLISLWQY 
TASDKNYVAG 
KNKENVYTPK 
KIPVNSTGTD 
LTRSKRKGSA 
QSDFGKVTMQ 



11 
I 

YELHETIGTG 
QLYHVLETAN 
RDLKPENLLF 
DVWSMGILLY 
RISMKNLLNH 
DHLTATYLLL 
LIDYDWCEDD 
SAVKNEEYFM 
KLMTGVISPE 
RDGPRRLKLH 
FELEVCQLQK 



21 

i 

GFAKVKLACH 
KIFMVLEYCP 
DEYHKLKLID 
VLMCGFLPFD 
PWIMQDYNYP 
LAKKARGKPV 
LSTGAATPRT 
FPEPKTPVNK 
RRCRSVELDD 
YNVTTTRLVN 
PDWGIRRQR 



31 
i 

ILTGEMVAIK 
GGELFDYIIS 
FGLCAKPKGN 
DDNVMALYKK 
VEWQSKNPFI 
RLRLSSFSCG 
SQFTKYWTES 
NQHKREILTT 
NQAHMEETPK 
PDQLLNEIMS 
LKGDAWVYKR 



41 
I 

IMDKNTLGSD 
QDRLSEEETR 
KDYHLQTCCG 
IMRGKYDVPK 
HLDDDCVTEL 
QASATPFTDI 
NGVESKSLTP 
PNRYTTPSKA 
RKGAKVFGSL 
ILPKKHVDFV 
LVEDILSSCK 



51 
I 

LPRIKTEIEA 
WFRQIVSAV 
SLAYAAPELI 
WLSPSSILLL 
SVHHRNNRQT 
KSNNWSIiEDV 
ALCRTPANKL 
RNQCLKETPI 
ERGLDKV I TV 
QKGYTLKCQT 
V 



Seq ID NO: 654 DNA sequence 
Nucleic Acid Accession #: NM_000582 
Coding sequence: 8 8.. 990 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



11 



21 



31 



41 



51 



GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 
AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 



60 
120 



436 



WO 02/086443 

GGCATCACCT GTGCCATACC AGTTAAACAG 
CTTTACAACA AATACCCAGA TGCTGTGGCC 
CAGAAT CTCC TAGCCCCACA GACCCTTCCA 
GATGATATGG ATGATGAAGA TGATGATGAC 
AACGACTCTG ATGATGTAGA TGACACTGAT 
TCTGATGAAT CTGATGAACT GGTCACTGAT 
TTCACTCCAG TTGTCCCCAC AGTAGACACA 
GGACTGAGGT CAAAATCTAA GAAGTTTCGC 
GACGAGGACA TCACCTCACA CATGG AAAG C 
CCCGTTGCCC AGGACCTGAA CGCGCCTTCT 
GAAACGAGTC AGCTGGATGA CCAGAGTGCT 
TATAAGCGGA AAGCCAATGA TGAGAGCAAT 
CTTTCCAAAG TCAGCCGTGA ATTCCACAGC 
GTTGTAGACC CCAAAAGTAA GGAAGAAGAT 
TTAGATAGTG CATCTTCTGA GGTCAATTAA 
ATTTAGTCAA AAGAAAAAAT GCTTTATAGC 
CTCAGTTTAT TGGTTGAATG TGTATCTATT 
ATTAGTTTAG TTTGTGGCTT CATGGAAACT 
CT ATGTT CAT TCTATAGAAG AAATGCAAAC 
T CATGAAT AG AAATTTATGT AGAAGCAAAC 
ATAACATTTT ATGTCACTAT AATCTTTTGT 
TATCTTTTTG TGGTGTGAAT AAATCTTTTA 
AATTGCTTAT TTGTTTTCCC ACGGTTGTCC 
GCCTAAAAAA AAAAAAAAAA AAAA 



GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 
ACATGGCTAA ACCCTGACCC ATCTCAGAAG 
AGTAAGTCCA ACGAAAGCCA TGACCACATG 
CATGTGGACA GCCAGGACTC CATTGACTCG 
GATTCTCACC AGT CTGATGA GTCTCACCAT 
TTTCCCACGG ACCTGCCAGC AACCGAAGTT 
TATGATGGCC GAGGTGATAG TGTGGTTTAT 
AGACCTGACA TCCAGTACCC TGATGCTACA 
GAGGAGTTGA ATGGTGCATA CAAGGCCATC 
GATTGGGACA GCCGTGGGAA GGACAGTTAT 
GAAACCCACA GCCACAAGCA GTCCAGATTA 
GAGCATTCCG ATGTGATTGA TAGTCAGGAA 
CATGAATTTC ACAGCCATGA AGATATGCTG 
AAACACCTGA AATTTCGTAT TTCTCATGAA 
AAGGAGAAAA AATACAATTT CTCACTTTGC 
AAAATGAAAG AGAACATGAA ATGCTTCTTT 
TGAGTCTGGA AATAACTAAT GTGTTTGATA 
CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 
TATCACTGTA TTTTAATATT TGTTATTCTC 
AAAATACTTT TACCCACTTA AAAAGAGAAT 
TTTTTAAGTT AGTGTATATT TTGTTGTGAT 
TCTTGAATGT AATAAGAATT TGGTGGTGTC 
AGCAATTAAT AAAACATAAC CTTTTTTACT 



Seq ID NO: 655 Protein sequence 
Protein Accession #: NP_000573 

1 11 21 31 41 51 

I I I I I I 

MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 

PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 

DFPTDLPATE VFTPWPTVD TYDGRGDSW YGhRSKSKKF RRPDIQYPDA TDEDITSHME 

SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 

NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 



Seq ID NO: 656 DNA sequence 

Nucleic Acid Accession #: NM_003108.1 

Coding sequence: 76., 1401 



1 11 21 31 41 51 

I ! I I I i 

GGGGTGGGAG GGGGAGGGGG ACCTCCGCAC GAGACCCAGC GGCCCGGGTT GGAGCGTCCA 
GCCCTGCAAC GGATCATGGT GCAGCAGGCG GAGAGCTTGG AAGCGGAGAG CAACCTGCCC 
CGGGAGGCGC TGGACACGGA GGAGGGCGAA TTCATGGCTT GCAGCCCGGT GGCCCTGGAC 
GAGAGCGACC CAGACTGGTG CAAGACGGCG TCGGGCCACA TCAAGCGGCC GATGAACGCG 
TTCATGGTAT GGT CCAAG AT CGAACGCAGG AAGATCATGG AGCAGTCTCC G G AC AT G C AC 
AACGCCGAGA T CT CCAAGAG GCTGGGCAAG CGCTGGAAAA TGCTGAAGGA CAGCGAGAAG 
ATCCCGTTCA TCCGGGAGGC GGAGCGGCTG CGGCTCAAGC ACATGGCCGA CTACCCCGAC 
TACAAGTACC GGCCCCGGAA AAAGCCCAAA ATGGACCCCT CGGCCAAGCC CAGCGCCAGC 
CAG AG CCCAG AGAAGAGCGC GGCCGGCGGC GGCGGCGGGA GCGCGGGCGG AGGCGCGGGC 
GGTGCCAAGA CCTCCAAGGG CTCCAGCAAG AAATGCGGCA AGCTCAAGGC CCCCGCGGCC 
GCGGGCGCCA AGGCGGGCGC GGGCAAGGCG GCCCAGTCCG GGGACTACGG GGGCGCGGGC 
GACGACTACG TGCTGGGCAG CCTGCGCGTG AGCGGCTCGG GCGGCGGCGG CGCGGGCAAG 
ACGGTCAAGT GCGTGTTTCT GGATGAGGAC GACGACGACG ACGACGACGA CGACGAGCTG 
CAGCTGCAGA TCAAACAGGA GCCGGACGAG GAGGACGAGG AACCACCGCA CCAGCAGCTC 
CTGCAGCCGC CGGGG CAGCA GCCGT CGCAG CTGCTGAGAC GCTACAACGT CGCCAAAGTG 
CCCGCCAGCC CTACGCTGAG CAGCTCGGCG GAGTCCCCCG AGGGAGCGAG CCTCTACGAC 
GAGGTGCGGG CCGGCGCGAC CTCGGGCGCC GGGGGCGGCA GCCGCCTCTA CTACAGCTTC 
AAGAACATCA CCAAGCAGCA CCCGCCGCCG CTCGCGCAGC CCGCGCTGTC GCCCGCGTCC 
TCGCGCTCGG TGTCCACCTC CTCGTCCAGC AGCAGCGGCA GCAGCAGCGG CAGCAGCGGC 
GAGGACGCCG ACGACCTGAT GTTCGACCTG AGCTTGAATT TCTCTCAAAG CGCGCACAGC 
GCCAGCGAGC AGCAGCTGGG GGGCGGCGCG GCGGCCGGGA ACCTGTCCCT GTCGCTGGTG 
GATAAGGATT TGGATTCGTT CAGCGAGGGC AGCCTGGGCT CCCACTTCGA GTTCCCCGAC 
TACTGCACGC CGGAGCTGAG CGAGATGATC GCGGGGGACT GGCTGGAGGC GAACTTCTCC 
GACCTGGTGT TCACATATTG AAAGGCGCCC GCTGCTCGCT CTTTCTCTCG GAGGGTGCAG 
AGCTGGGTTC CTTGGGAGGA AGTTGTAGTG GTGATGATGA TGATGATGAT AATGATGATG 
ATGATGGTGG TGTTGATGGT GGCGGTGGTA GGGTGGAGGG GAGAGAAGAA GATGCTGATG 
AT AT TGATAA GATGT CGTG A CGCAAAGAAA TTGGAAAACA TGATGAAAAT TTTGGTGGAG 
TTAAAGTGAA ATGAGTAGTT TTTAAACATT TTTCCTGTCC TTTTTTTGTC CCCCCTCCCT 
TCCTTTATCG TGTCTCAAGG TAGTTGCATA CCTAGTCTGG AGTTGTGATT ATTTTCCCAA 
AAAATGTGTT TTTGTAATTA CTATTTCTTT TTCCTGAAAT TCGTGATTGC AACAAAGGCA 
GAGGGGGCGG CGCGGCGGAG GGGAGGTAGG ACCCGCTCCG GAAGGCGCTG TTTGAAGCTT 
GTCGGTCTTT GAAGTCTGGA AGACGTCTGC AGAGGACCCT TTTGG CAGCA CAACTGTTAC 
TCTAGGGAGT TGGTGGAGAT ATTTTTTTTT CTTAAGAGAA CTTAAAGAAC TGGTGATTTT 
TTTTTAACAA AAAAAGGG 

Seq ID NO: 657 Protein sequence 
Protein Accession #: NP_003099.1 

l 11 21 31 41 SI 

I I I I 1 I 

MVQQAESLEA ESNLPREALD TEEGEFMACS PVALDESDPD WCKTASGH I K RPMNAFMVWS 

KIERRKIMEQ SPDMHNAEIS KRLGKRWKML KDSEKIPFIR EAERLRLKHM ADYPDYKYRP 

RKKPKMDPSA KPSASQSPEK SAAGGGGGSA GGGAGGAKTS KG S SKKCGKL KAPAAAGAKA 

GAGKAAQSGD YGGAGDDYVL GSLRVSGSGG GGAGKTVKCV FLDEDDDDDD DDDELQLQIK 



437 
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QEPDEEDEBP PHQQMjQPPG QQPSQLLRRY NVAKVPASPT LSSSAESPEG ASLYDEVRAG 300 
ATSGAGGGSR LYYSFKNITK QHPPPDAQPA LSPASSRSVS TSSSSSSGSS SGSSGEDADD 360 
LMFDLSLNFS QSAHSASEQQ LGGGAAAGNL SLSLVDKDLD SFSEGSLGSH FEFPDYCTPE 420 
LSEMIAGDKL EANFSDLVFT Y 

Seq ID NO: 658 DNA sequence 
Nucleic Acid Accession #: NM_001719 
Coding sequence: 123.. 1418 



PCT/US02/12476 



GGGCGCAGCG 
CTGCCACCTG 
CGATGCACGT 
CCCTGTTCCT 
GCTTCATCCA 
CCATTTTGGG 
CCATGTTCAT 
GCCAGGGCTT 
GCCTGCAAGA 
TGGAACATGA 
TTTCCAAGAT 
ACATCCGGGA 
AGCACTTGGG 
AGGAGGGCTG 
GGCACAACCT 
AGTTGGCGGG 
TCTTCAAGGC 
GCCAGAACCG 
AG AAC AG CAG 
GAGACCTGGG 
AGGGGGAGTG 
AGACGCTGGT 
AGCTCAATGC 
ACAGAAACAT 
TTGGGGCCAA 
CTGCCTTTTG 
AAACATGAGC 
TCCTACAAGC 
GCCGGGCCAG 
TTATGAGCGC 
GGGCACATTG 
CAATAAAACG 



IX 

] 

GGGCCCGTCT 
GGGCGGTGCG 
GCGCTCACTG 
GCTGCGCTCC 
CCGGCGCCTC 
CTTGCCCCAC 
GCTGGACCTG 
CTCCTACCCC 
TAGCCATTTC 
CAAGGAATTC 
CCCAGAAGGG 
ACGCTTCGAC 
CAGGGAATCG 
GCTGGTGTTT 
GGGCCTGCAG 
CCTGATTGGG 
CACGGAGGTC 
CTCCAAGACG 
CAGCGACCAG 
CTGGCAGGAC 
TGCCTTCCCT 
CCACTTCATC 
CATCTCCGTC 
GGTGGTCCGG 
GTTTTTCTGG 
TGAGACCTTC 
AGCATATGGC 
TGTGCAGGCA 
GTCATTGGCT 
CTACCAGCCA 
GTGTCTGTGC 
AATGAATG 



21 

! 

GCAGCAAGTG 
GGCCCGGAGC 
CGAGCTGCGG 
GCCCTGGCCG 
CGCAGCCAGG 
CGCCCGCGCC 
TACAACGCCA 
TACAAGGCCG 
CTCACCGACG 
TTCCACCCAC 
GAAGCTGT C A 
AATGAGACGT 
GATCTCTTCC 
GACATCACAG 
CTGTCGGTGG 
CGGCACGGGC 
CACTTCCGCA 
CCCAAGAACC 
AGGCAGGCCT 
TGGAT CAT CG 
CTGAACTCCT 
AACCCGGAAA 
CTCTACTTCG 
GCCTGTGGCT 
ATCCTCCATT 
CCCTCCCTAT 
TTTTGATCAG 
AAACCTAGCA 
GGGAAGTCTC 
GGCCACCCAG 
GAAAGGAAAA 



31 

I 

ACCGACGGCC 
CCGGAGCCCG 
CGCCGCACAG 
ACTTCAGCCT 
AGCGGCGGGA 
CGCACCTCCA 
TGGCGGTGGA 
TCTTCAGTAC 
CCGACATGGT 
GCTACCACCA 
CGGCAGCCGA 
TCCGGATCAG 
TGCTCGACAG 
CCACCAGCAA 
AGACGCTGGA 
CCCAGAACAA 
GCATCCGGTC 
AGGAAGCCCT 
GTAAGAAGCA 
CGCCTGAAGG 
ACATGAACGC 
CGGTGCCCAA 
ATGACAGCTC 
GCCACTAGCT 
GCTCGCCTTG 
CCGCAACTTT 
TTTTTCAGTG 
GGAAAAAAAA 
AGCCATGCAC 
CCGTGGGAGG 
TTGACCCGGA 



41 
i 

GGGACGGCCG 
GGTAGCGCGT 
CTTCGTGGCG 
GGACAACGAG 
GATGCAGCGC 
GGGCAAGCAC 
GGAGGGCGGC 
CCAGGGCCCC 
CATGAGCTTC 
TCGAGAGTTC 
ATTCCGGATC 
CGTTTATCAG 
CCGTACCCTC 
CCACTGGGTG 
TGGGCAGAGC 
GCAGCCCTTC 
CACGGGGAGC 
GCGGATGGCC 
CGAGCTGTAT 
CTACGCCGCC 
CACCAACCAC 
GCCCTGCTGT 
CAACGTCATC 
CCTCCGAGAA 
GCCAGGAACC 
AAAGGTGTGA 
GCAGCATCCA 
ACAACGCATA 
GGACTCGTTT 
AAGGGGGCGT 
AGTTCCTGTA 



51 
1 

CCTGCCCCCT 
AGAGCCGGCG 
CTCTGGGCAC 
GTGCACTCGA 
GAGATCCTCT 
AACT CGGCAC 
GGGCCCGGCG 
CCTCTGGCCA 
GTCAACCTCG 
CGGTTTGATC 
TACAAGGACT 
GTGCTCCAGG 
TGGGCCTCGG 
GTCAATCCGC 
ATCAACCCCA 
ATGGTGGCTT 
AAACAGCGCA 
AACGTGGCAG 
GTCAGCTTCC 
TACTACTGTG 
GCCATCGTGC 
GCGCCCACGC 
CTGAAGAAAT 
TTCAGACCCT 
AGCAGACCAA 
GAGTATTAGG 
ATGAACAAGA 
AAGAAAAATG 
CC AGAGGTAA 
GGCAAGGGGT 
ATAAATGTCA 



Seq ID NO: 65 9 Protein sequence 
Protein Accession #: NP_001710 



MHVRSLRAAA 
ILGLPHRPRP 
LQDSHFLTDA 
IRERFDNETF 
HNLGLQLSVE 
QNRSKTPKNQ 
GECAFPLNSY 
RNMWRACGC 



11 

I 

PHSFVALWAP 
HLQGKHNSAP 
DMVMSFVNLV 
RISVYQVLQE 
TLDGQSINPK 
EALRMANVAE 
MNATNHAIVQ 
H 



21 

\ 

LFLLRSALAD 
MFMLDLYNAM 
EHDKEFFHPR 
HLGRESDLFL 
LAGLIGRHGP 
NSSSDQRQAC 
TLVHFINPET 



31 
I 

FSLDNEVHSS 
AVEEGGGPGG 
YHHREFRFDL 
LDSRTLWASE 
QNKQPFMVAF 
KKHELYVSFR 
VPKPCCAPTQ 



Seq ID NO: 660 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 211.. 1895 



GG AT CTGAGG 
GGGGCTTGGG 
GAGGAATTAT 
TGATTTTTTT 
GTGCTTTTTC 
CACAGGTTCC 
CTTGTGCTGA 
GAAGGTAATT 
AAAATAT CGG 
TTCCGACACT 
GCCAATTATT 
TTCTTTGAAC 
GCTGTGGCTA 
CACATGCACT 
GTAGT CCATG 
CAAAATTCCA 
GTTGTGATGT 
TACCTGCATA 
ATCTTGATAG 
ACTCTGGCTG 
GCACCGATCT 
CTAGCTACCA 
AAACTGGCCA 



11 
I 

GGCGCCCAGT 
AGGCAGCCTG 
CTGATAAAAT 
CCCTCGAAAA 
TTTTCTCTTC 
TTGAACAGCT 
AAGCGAAAGT 
GTTTCCCTGA 
CTGTTCCATG 
GTAACCCCAA 
CAGACTGCCT 
GCCTCTATGT 
TTCTCATCAT 
TATTTGTGTC 
CTCACATAGG 
TTGAGGCAAC 
TTATTTACTT 
ATCTCATCTT 
GCTGGGGGTT 
ATGCGAGGTG 
TAGCAGCTAT 
AAATCTGGGA 
AATCGACACT 



21 
I 

CACTTCCTCC 
CTCTCCAGTC 
TCCTGGGTTA 
TGACCTTTTT 
TTTTTCTACG 
GGATTCTGAT 
ACAATGTGAA 
ATGGGATGGA 
CCCTCCTTAT 
TGGAACATGG 
TCGCTTTCTG 
AATGTATACC 
TGGTTACTTC 
TTTCATGCTG 
AGTAAAGGAG 
TTCTGTGGAC 
CCTGGCTACA 
TGTGGCTTTC 
TCCAGCAGCA 
CTGGGAACTT 
TGGGCTGAAT 
GACCAATGCA 
GGTCCTGGTC 



31 
1 

ACGTTCT CGT 
CCTATCCACC 
ATATTTTTAA 
ATGCTTCGAA 
ATAAATGAAA 
GGCACCATTA 
CTCAACATCA 
CTCATTTGTT 
ATTTATGACT 
GATTTTATGC 
CAGCCAGATA 
GTTGGCTACT 
AGACGATTGC 
AGAGCTACAA 
CTGGAGTCCC 
AAATCACAAT 
AATTATTATT 
TTTTCGGACA 
TTTGTTGCAG 
AGTGCTGGAG 
TTTATTCTGT 
GTTGGGCATG 
CTAGTCTTTG 



41 
I 

FIHRRLRSQE 
QGFSYPYKAV 
SKIPEGEAVT 
EGWLVFDITA 
FKATEVHFRS 
DLGWQDWIIA 
LNAISVLYFD 



41 
I 

GCTGGGCGGG 
CACAGGTTTT 
AAACGGAGAG 
GCAGTTTGTC 
GCATTTCTTC 
CTATAGAGGA 
CAGCTCAACT 
GGCCCAGAGG 
TCAACCATAA 
ACAGCTTAAA 
TCAGCATAGG 
CCATCTCTTT 
ATTGCACTAG 
GCATCTTTGT 
TAATAATGCA 
AT AT CGGGTG 
GGATCCTGGT 
CCAAATACCT 
CATGGGCTGT 
ACATCAAGTG 
TTCTGAATAC 
ACACAAGGAA 
GAGTGCATTA 



51 

i 

RREMQREILS 
FSTQGPPLAS 
AAEFRIYKDY 
TSNHWWNPR 
IRSTGSKQRS 
PEGYAAYYCE 
DSSNVILKKY 



51 

i 

AGGAGCGGAT 
TTGGGTCGGA 
TTTTTAAAAA 
AAC CAG CAT A 
AAGAAAAAGG 
GCAGATTGTC 
CCAGGAGGGA 
AACAGTGGGG 
AGGAGTTGCT 
TAAAACATGG 
AAAGCAAGAA 
TGGTTCCTTG 
GAACTATATC 
CAAAGACAGA 
GGATGACCCA 
CAAGATTGCT 
GGAAGGTCTC 
GTGGGGCTTC 
GGCACGAGCA 
GATTTATCAA 
GGTTAGAGTT 
GCAATACAGG 
CATCGTGTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
840 
900 
960 
1020 
1080 
1140 
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1260 
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1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



438 



WO 02/086443 

GTATGCCTGC CTCACTCCTT CACTGGGCTC GGGTGGGAGA TCCGCATGCA CTGTGAGCTC 1440 

TTCTTCAACT CCTTTCAGGG TTTCTTTGTG TCTATCATCT ACTGCTACTG CAATGGAGAG 1500 

GTT CAGGCAG AGGTGAAGAA GATGTGGAGT CGGTGGAATC TCTCCGTGGA CTGGAAAAGG 1560 

ACACCGCCAT GTGGCAGCCG CAGATGCGGC TCAGTGCTCA CCACCGTGAC GCACAGCACC 1620 

AGCAGCCAGT CACAGGTGGC GGCCAGCACA CGCATGGTGC TTATCTCTGG CAAAGCTGCC 1680 

AAGATCGCCA GCAGACAGCC TGACAGCCAC ATCACTTTAC CTGGCTATGT CTGGAGTAAC 1740 

TCAGAGCAGG ACTGCCTGCC AGACTCTTTC CACGAGGAGA CCAAGGAAGA TAGTGGGAGG 1800 

CAGGGAGATG ATATTCTAAT GGAGAAGCCT TCCAGGCCTA TGGAATCTAA CCCAGACACT 1860 
GAAGGATGCC AAGGAGAAAC TGAGGATGTT CTCTGA 

Seq ID NO: 661 Protein sequence 
Protein Accession Eos sequence 

1 11 21 31 41 51 

I I 1 I I I 

MhRSShSTSI VLFLFSSFST INESISSRKR HRFLEQLDSD GTITIEEQIV LVLKAKVQCE 60 

LNITAQLQEG EGNCFPEWDG LICWPRGTVG KISAVPCPPY IYDFNHKGVA FRHCNPNGTW 120 

DPMHSLNKTW ANYSDCLRFL QPDISIGKQE FPERLYVMYT VGYSISFGSL AVAILIIGYF 180 

RRLHCTRNYI HMHLFVSFML RATS I FVKDR WHAHIGVKE IiESIjIMQDDP QNSIEATSVD 240 

KSQYIGCKIA WMFIYFLAT NYYWIIjVEGL YLHNLIFVAF FSDTKYLWGF ILIGWGFPAA 300 

FVAAWAVARA TLADARCWEL SAGDIKWIYQ APILAAIGLN FILFLNTVRV LATKIWETNA 360 

VGHDTRKQYR KLAKSTLVLV LVFGVHYIVF VCLPHSFTGL GWEIRMHCEL FFNSFQGFFV 420 

SIIYCYCNGE VQAEVKKMWS RWNLSVDWKR TPPCGSRRCG SVLTTVTHST SSQSQVAAST 480 

RMVLI SGKAA KIASRQPDSH ITLPGYVWSN SEQDCLPHSF HEETKEDSGR QGDDILMEKP 540 
SRPMESNPDT EGCQGETEDV L 

Seq ID NO: 662 DNA sequence 
Nucleic Acid Accession #: NM_005048 
Coding sequence: 143.. 1795 

1 11 21 31 41 51 

I I 1 I I I 

GGCCGGTGGC CCGGGCCCGA CCACCCCAGC TGCGCGTCGT TACTGGCCAC AAGTTTGCT C 60 

TGGGCCAGCC AAGTTGGCAA CTTGGAAGCT TCTCCCGGGC TCTGGAGGAG GGTCCCTGCT 120 

TCTTCCTACA GCCGTTCCGG GCATGGCCGG GCTGGGGGCG TCGCTCCACG TCTGGGGTTG 180 

GCTAATGCTC GGCAGCTGCC TCCTGGCCAG AGCCCAGCTG GATTCTGATG GCACCATTAC 240 

TATAGAGGAG CAGATTGTCC TTGTGCTGAA .AGCGAAAGTA CAATGTGAAC TCAACATCAC 300 

AGCTCAACTC CAGGAGGGAG AAGGTAATTG TTTCCCTGAA TGGGATGGAC TCATTTGTTG 360 

GCCCAGAGGA ACAGTGGGGA AAATAT CGGC TGTTCCATGC CCTCCTTATA TTTATGACTT 42 0 

CAACCATAAA GGAGTTGCTT TCCGACACTG TAACCCCAAT GGAACATGGG ATTTTATGCA 480 

CAGCTTAAAT AAAACATGGG CCAATTATTC AGACTGCCTT CGCTTTCTGC AGCCAGATAT 540 

CAGCATAGGA AAGCAAGAAT TCTTTGAACG CCTCTATGTA ATGTATACCG TTGGCTACTC 60 0 

CATCTCTTTT GGTTCCTTGG CTGTGGCTAT TCTCATCATT GGTTACTTCA GACGATTGCA 660 

TTGCACTAGG AACTATATCC ACATGCACTT ATTTGTGTCT TTCATGCTGA GAGCTACAAG 720 

CATCTTTGTC AAAGACAGAG TAGT CCATGC TCACATAGGA GTAAAGGAGC TGGAGTCCCT 780 

AATAATGCAG GATGACCCAC AAAATT CC AT TGAGGCAACT TCTGTGGACA AATCACAATA 84 0 

TATCGGGTGC AAGATTGCTG TTGTGATGTT TATTTACTTC CTGGCTACAA ATTATTATTG 900 

GATCCTGGTG GAAGGTCTCT ACCTGCATAA TCTCATCTTT GTGGCTTTCT TTTCGGACAC 960 

CAAATACCTG TGGGGCTTCA TCTTGATAGG CTGGGGGTTT CCAGCAGCAT TTGTTGCAGC 102 0 

ATGGGCTGTG GCACGAGCAA CTCTGGCTGA TGCGAGGTGC TGGGAACTTA GTGCTGGAGA 10 80 

CATCAAGTGG ATTTATCAAG CACCGATCTT AGCAGCTATT GGGCTGAATT TTATTCTGTT 1140 

TCTGAATACG GTTAGAGTTC TAGCTACCAA AAT CTGGGAG ACCAATGCAG TTGGGCATGA 12 0 0 

CACAAGGAAG CAATACAGGA AACTGGCCAA AT CGACACTG GTCCTGGTCC TAGTCTTTGG 1260 

AGTGCATTAC ATCGTGTTCG TATGCCTGCC TCACTCCTTC ACTGGGCTCG GGTGGGAGAT 1320 

CCGCATGCAC TGTGAGCTCT TCTTCAACTC CTTTCAGGGT TTCTTTGTGT CTATCATCTA 13 80 

CTGCTACTGC AATGGAGAGG TTCAGGCAGA GGTGAAGAAG ATGTGGAGTC GGTGGAATCT 1440 

CTCCGTGGAC TGGAAAAGGA CACCGCCATG TGGCAGCCGC AGATGCGGCT CAGTGCTCAC 1500 

CACCGTGACG CACAGCACCA GCAGCCAGTC ACAGGTGGCG GCCAGCACAC GCATGGTGCT 1560 

TATCTCTGGC AAAGCTGCCA AGATCGCCAG CAGACAGCCT GACAGCCACA TCACTTTACC 162 0 

TGGCTATGTC TGGAGTAACT CAGAGCAGGA CTGCCTGCCA CACTCTTTCC ACGAGGAGAC 168 0 

CAAGGAAGAT AGTGGGAGGC AGGGAGATGA TATT CTAATG GAGAAGCCTT CCAGGCCTAT 1740 

GGAATCTAAC CCAGACACTG AAGGATGCCA AGGAGAAACT GAGGATGTTC TCTGAATGGA 1800 

CATTTGTGGC TGACTTTCAT GGGCTGGTCC AATGGCTGGT TGTGTGAGAG GGCTTGGCTG 18 60 

ATACTCCTAT GCTTGAGTTC AAAGGCTGAA AATTCAGTTA AGGTGTTACT TAATAATAGT 1920 

TTTTAGGCTC CATGAATTGG CTCCTGTAAA TACTAACGAC ATGAAAATGC AAGTGTCAAT 1980 

GGAGTAGTTT ATTACCTTCT ATTGGCATCA AGTTTTCCTC TAAATTAATG TATGGTATTT 2 040 

GCTCTGTGAT TGTTCATTTT TTTCTGCTAC TTTTGGGTAG AAAAAAGATT CAATTGCTTG 210 0 

GCTGTAGCTT TCTCTCATAT ATATCACCCT AAATATAATG AAGATCTTTT AGTGTGTATC 2160 

AT TTTCCTTT TAGAAACTAG TATTCTCTTA TTTCTTACTT TAATGTACTT CTATCACTGC 2220 

ATTTATTTTG CCTGTGCATA GGAGCAATTA GGATCTAAAA AAATAT ATGG GAAGATAAAA 228 0 

GAT CTAAGAA CAAGTACTTG CTGG AAAATT AGTTGGCTGG ACATTGATAA AATAATGCAT 2340 

TTATAACAAT TACATGTGTT TTTGGGAACA AGGAAAATTT CTCAAAAAAG AATATTTCAC 2400 

ACATCCCTTC TTTTGAATGG CCTCTTTGTG ACCAGCCAGA CCTCAGGTCT TCACTCTTTC 2460 

TTCTTTGTAA ACCATGTCAT GTGGAAAGAT TTCCTCAGTT AGTGAGCTTG TGTCTGCAAA 2520 

TTGATTTTGT TTGTAATGTA TTTTGATAGC AAATCATGCT GCATCTATAT CTTTTTCTTG 258 0 

TTTGAGCTGT TACTACATTG TACATGGCAT GTGGGATCAA TTAAAAATTT GTTTTAAAAA 2 640 
T 

Seq ID NO: 663 Protein sequence 
Protein Accession #: NP_005039 

1 11 21 31 41 51 

I 1 I I I 1 

MAGLGASLHV WGWLMLGSCL IARAQLDSDG TITIEEQIVL VLKAKVQCEL NITAQLQEGE 60 

GNCFPEWDGIi ICWPRGTVGK ISAVPCPPYI YD FNHKG VAF RHCNPNGTWD FMHS LNKTWA 120 

NYSDCLRFLQ PDISIGKQEF FERLYVMYTV GYSISFGSLA VAIL I IGYFR RLHCTRNYIH 180 

MHLFVSFMLR ATSIFVKDRV VHAHIGVKEL ESLIMQDDPQ NSIEATSVDK SQYIGCKIAV 240 
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VMFIYFLATN YYWILVEGLY 
LADARCWELS AGDIKWIYOA 
LAKSTLVLVL VFGVHYIVFV 
QAEVKKMWSR WNLSVDWKRT 
IASRQPDSHI TLPGYVWSNS 
GCQGETEDVL 



PCT/US02/12476 



LHNLIFVAFF SDTKYLWGFI LIGWGFPAAF VAAWAVARAT 
PILAAIGLNF ILFLNTVRVL ATKIWETNAV GHDTRKQYRK 
CLPHSFTGLG WEIRMHCELF FNSFQGFFVS IIYCYCNGEV 
PPCGSRRCGS VLTTVTHSTS SQSQVAASTR MVLISGKAAK 
EQDCLPHSFH EETKEDSGRQ GDDILMEKPS RPMESNPDTE 



300 
360 
420 
480 
540 



Seq ID NO: 664 DNA sequence 
Nucleic Acid Accession NM_012152 
Coding sequence: 4 3.. 1104 



CTTCTTTAAA 
GACAAGCACA 
GGAACAAAGC 
TCTAATTCTC 
TACCTGTTGG 
ATGTTTAACA 
GGGCTTCTGG 
AGGCACATGT 
CTGCTCATTT 
TGGAATTGCC 
TACCTTGTTT 
CTGCGGATCT 
TCCATCAGCC 
GCGTTTGTGG 
AGGCAGTGTG 
GTCGTGAACC 
ATGATCTGCT 
GTCCTCAGCA 
GTCTGCAATA 
GTCTTAGG 



11 

1 

TTTCTTTCTA 
TGGACTTTTT 
TTGTGATTGT 
TGGTCATCGC 
CTAATTTAGC 
CAGGCCCAGT 
ACAGTAGCTT 
C AAT CAT GAG 
TGCTTGTCTG 
TCTGCAACAT 
TCTGGACAGT 
ACGTGTACGT 
GCCGGAGGAC 
TATGCTGGAC 
GCGTGCAGCA 
CCATCATCTA 
GCTTCTCTCA 
GGAGTGACAC 
AAAGCACTTC 



21 
I 

GGATGTT CAC 
TTATAATAGG 
TTTGTGTGTT 
GGCAGTGATC 
TGCTGCCGAT 
TTCAAAAACT 
GACTGCTTCC 
GATGCGGGTC 
GGCCATCGCC 
CTCTGCCTGC 
GTCCAACCTC 
CAAGAGGAAA 
ACCCATGAAG 
CCCGGGCCTG 
TGTGAAAAGG 
CTCCTACAAG 
GGAGAACCCA 
AGGCAGCCAG 
CTAAACTCTG 



31 

s 

TTCTTCTCCA 
AGCAACACTG 
GGGACGTTTT 
AAAAACAGAA 
TTCTTCGCTG 
TTGACTGTCA 
CTCACCAACT 
CATAGCAACC 
ATTTTTATGG 
TCTTCCCTGG 
ATGGCCTTCC 
ACCAACGTCT 
CTAATGAAGA 
GTGGTTCTGC 
TGGTTCCTGC 
GACGAGGACA 
GAGAGGCGTC 
TACATAGAGG 
GATGCCTCTC 



41 
I 

CAATGAATGA 
ATACTGTCGA 
TCTGCCTGTT 
AATTTCATTT 
GAATTGCCTA 
ACCGCTGGTT 
TGCTGGTTAT 
TGACCAAAAA 
GGGCGGTCCC 
CCCCCATTTA 
TCATCATGGT 
TGTCTCCGCA 
CGGTGATGAC 
TCCTCGACGG 
TGCTGGCGCT 
TGTATGGCAC 
CCTCTCGCAT 
ATAGTATTAG 
GGCCCACCCA 



51 
I 

GTGTCACTAT 
TGACTGGACA 
TATTTTTTTT 
CCCCTTCTAC 
TGTATTCCTG 
TCTCCGTCAG 
CGCCGTGGAG 
GAGGGTGACA 
CACACTGGGC 
CAGCAGGAGT 
TGTGGTGTAC 
TACAAGTGGG 
TGTCTTAGGG 
CCTGAACTGC 
GCTCAACTCC 
CATGAAGAAG 
CCCCTCCACA 
CCAAGGTGCA 
GGTGATGACT 



Seq ID NO: 665 Protein sequence 
Protein Accession #: NP_036284 



MNECHYDKHM 
FHFPFYYLLA 
LVIAVERHMS 
PIYSRSYLVF 
VMTVLGAFW 
YGTMKKMICC 



11 

I ' 

DFFYNRSNTD 
NLAAADFFAG 
IMRMRVHSNL 
WTVSNLMAFL 
CWTPGLWLL 
FSQENPERRP 



21 
I 

TVDDWTGTKL 
IAYVFLMFNT 
TKKRVTLLIL 
IMWVYLRIY 
LDGLNCRQCG 
SRIPSTVLSR 



31 
I 

VIVLCVGTFF 
GPVSKTLTVN 
LVWAIAIFMG 
VYVKRKTNVL 
VQHVKRWFLL 
SDTGSQYIED 



41 

1 

CLFIFFSNSL 
RWFLRQGLLD 
AVPTLGWNCL 
SPHTSGSISR 
LALLNSWNP 
SISQGAVCNK 



Seq ID NO: 666 DNA sequence 
Nucleic Acid Accession #: NM_002821 
Coding sequence: 150.. 3362 



AACTCCCGCC 
GCGCTCCGGT 
CCTCAGCTCC 
CCCGCCGGTT 
CCATTGTCTT 
TTCGCTGTGA 
CTGTCCAGGA 
ACCGGCTGCA 
AAGCCCGCAG 
TGAAGCATCC 
ACATTGATGG 
ATGGTCAGAG 
GTCCTGAGCA 
GCAGCCAGAA 
CCCAGGACGT 
AGCCACCCCC 
GCCCCCCACA 
TCCGGCCACG 
TCATCCTGGA 
GGGTGTTTAC 
AGCCCAGCGT 
AGAAGGGCCA 
GCCACGCGGC 
TGCCCTCCTG 
TGGATTGCCT 
T CATCT CAGA 
TGGAGGTGTA 
AGGCGCAAGC 
AGCAGTGCAT 
AG CCCACTAT 
ACGCTGGGAC 
TTGCCTCCAA 
TTATCACCTT 
TGCAGTGCGA 
TCCTGGACCC 



11 

I 

TCGGGACGCC 
GCGTCCGCCT 
TTTTCCTGAG 
GCCTCTGCTC 
CAT CAAGCAG 
GGTTGAGGCT 
CACGGAGCGG 
GGACTCTGGC 
TGCCAACGCC 
AGCCTCGGAA 
GCACCCTCGG 
CAACCACACA 
TAGTGGGCTG 
CTTCACCTTG 
GGTAGTAGCG 
GAGCCTGCAG 
CCTCCGCAGA 
CAATGCAGGG 
AGCCACACTT 
AGCTGGCAGC 
GTGGTGGGAG 
CGAGCTGGTG 
CAACCTGGCT 
GCTGAAGAAG 
GACCCAGGCC 
GGACTCACGG 
TGATGGGACA 
CCGTGTCCAA 
GGAGTTTGAC 
TAAGTGGGAA 
CCTGCATTTT 
CGGGCCGCAG 
CAAAGTGGAA 
GGCCCAGGGG 
CACCAAGCTG 



21 
I 

TCGGGGT CGG 
CCTGTGCCCG 
CCCGCCGCGA 
AGCGTCCTGC 
CCGTCCTCCC 
CCGGGCCCGG 
CGTTTCGCCC 
ACCTTCCAGT 
TCCTTCAACA 
GCTGAGATCC 
CCCACCTACC 
GTCAGCAGCA 
TATTCCTGCT 
AGCATTGCTG 
AGGTATGAGG 
TGGCTCTTTG 
GCCACAGTGT 
ATCTACCGCT 
CACCTAGCAG 
GAGGAGCGTG 
CACGCGGGAG 
TTGGCCAATA 
GGTCAGCGGA 
CCCCAAGACA 
ACACCAAAAC 
TTCGAGGTCT 
TGGTACCGTT 
GTGCTGGAAA 
AAGGAGGCCA 
CGGGCAGATG 
GCCCGGGTGA 
GGCCAGATTC 
CCAGAGCGTA 
GACCCCAAGC 
GGACCCAGGA 



31 
I 

GCTCCGGCTG 
CCGCGGAGCA 
TGGGAGCTGC 
TGCTGCCGCT 
AGGATGCACT 
TACATGTGTA 
AGGGCAGCAG 
GTGTGGCTCG 
TCAAATGGAT 
AGCCACAGAC 
AATGGTTCCG 
AGGAGCGGAA 
GCGCCCACAG 
ATGAAAGCTT 
AGGCCATGTT 
AGGATGAGAC 
TTGCCAACGG 
GCATTGGCCA 
AGATTGAAGA 
TGACCTGCCT 
TCCGGCTGCC 
TTGCTGAAAG 
GACAGGATGT 
GCCAGCTGGA 
CTACAGTTGT 
TCAAGAATGG 
GTATGAGCAG 
AGCTCAAGTT 
CGGTGCCCTG 
GGAGCAGCCT 
CTCGAGATGA 
GTGCCCATGT 
CGACTGTGTA 
CGCTGATTCA 
TGCACATCTT 



41 
1 

CGGCTGCTGC 
GTCTGCGGCC 
GCGGGGATCC 
GCTGGGCGGT 
GCAGGGGCGC 
CTGGCTGCTC 
CCTGAGCTTT 
GGATGATGTC 
TGAGGCAGGT 
CCAGGTCACA 
AGATGGGACC 
CCTGACGCTC 
TGCTTTTGGC 
TGCCAGGGTG 
CCATTGCCAG 
TCCCATCACT 
GTCTCTGCTG 
GGGGCAGAGG 
CATGCCGCTA 
TCCCCCCAAG 
CACCCATGGC 
TGATGCTGGT 
CAACATCACT 
GGAGGGCAAA 
CTGGTACAGA 
GACCTTGCGC 
CACCCCAGCC 
CACACCACCA 
TTCAGCCACA 
CCCAGAGTGG 
CGCTGGCAAC 
CCAGCTCACT 
CCAGGGCCAC 
GTGGAAAGGC 
CCAGAATGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



51 
I 

VIAAVIKNRK 
SSLTASLTNL 
CNISACSSLA 
RRTPMKLMKT 
IIYSYKDEDM 
STS 



51 
I 

TGCGGCGCCC 
CGCCGTGCGC 
CCGGCCAGAC 
ACCCAGACAG 
CGGGCGCTGC 
GATGGGGCCC 
GCAGCTGTGG 
AC TGGAGAAG 
CCTGTGGTCC 
CTTCGTTGCC 
CCCCTTTCTG 
CGGCCAGCTG 
CAGGCTTGCA 
GTGCTGGCAC 
TTCTCAGCCC 
AACCGCAGTC 
CTGACCCAGG 
GGCCCACCCA 
TTTGAGCCAC 
GGTCTGCCAG 
AGGGTCTACC 
GTCTACACCT 
GTGGCCACTG 
CCCGGCTACT 
AACCAGATGC 
ATCAACAGCG 
GGCAGCATCG 
CCCCAGCCAC 
GGCCGAGAGA 
GTGACAGACA 
TACACTTGCA 
GTGGCAGTTT 
ACAGCCCTAC 
AAGGACCGCA 
TCCCTGGTGA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



440 



WO 02/086443 

TCCATGACGT GGCCCCTGAG GACTCAGGCC GCTACACCTG CATTGCAGGC AACAGCTGCA 2160 

ACATCAAGCA CACGGAGGCC CCCCTCTATG TCGTGGACAA GCCTGTGCCG GAGGAGTCGG 222 0 

AGGGCCCTGG CAGCCCTCCC CCCTACAAGA TG AT CCAGAC CATTGGGTTG TCGGTGGGTG 2280 

CCGCTGTGGC CTACATCATT GCCGTGCTGG GCCTCATGTT CTACTGCAAG AAGCGCTGCA 2340 

5 AAGCCAAGCG GCTGCAGAAG CAGCCCGAGG GCGAGGAGCC AGAGATGGAA TGCCTCAACG 2400 

GAGGGCCTTT GCAGAACGGG CAGCCCTCAG CAGAGATCCA AGAAGAAGTG GCCTTGACCA 2460 

GCTTGGGCTC CGGCCCCGCG GCCACCAACA AACGCCACAG CACAAGTGAT AAGATGCACT 2 52 0 

TCCCACGGTC TAGCCTGCAG CCCATCACCA CGCTGGGGAA GAGTGAGTTT GGGGAGGTGT 2580 

TCCTGGCAAA GGCTCAGGGC TTGGAGGAGG GAGTGGCAGA GACCCTGGTA CTTGTGAAGA 2 640 

10 GCCTGCAGAC GAAGGATGAG CAGCAGCAGC TGGACTTCCG GAGGGAGTTG GAGATGTTTG 2700 

GGAAGCTGAA CCACGCCAAC GTGGTGCGGC TCCTGGGGCT GTGCCGGGAG GCTGAGCCCC 2760 

ACTACATGGT GCTGGAATAT GTGGATCTGG GAGACCTCAA GCAGTTCCTG AGGATTTCCA 2820 

AGAGCAAGGA TGAAAAATTG AAGTCACAGC CCCTCAGCAC CAAGCAGAAG GTGGCCCTAT 2880 

GCACCCAGGT AGCCCTGGGC ATGGAGCACC TGTCCAACAA CCGCTTTGTG CATAAGGACT 2 940 

15 TGGCTGCGCG TAACTGCCTG GTCAGTGCCC AGAGACAAGT GAAGGTGTCT GCCCTGGGCC 3000 

TCAGCAAGGA TGTGTACAAC AGTGAGTACT ACCACTTCCG CCAGGCCTGG GTGCCGCTGC 3060 

GCTGGATGTC CCCCGAGGCC AT CCTGGAGG GTGACTTCTC TACCAAGTCT GATGTCTGGG 3120 

CCTTCGGTGT GCTGATGTGG GAAGTGTTTA CACATGGAGA GATGCCCCAT GGTGGGCAGG 3180 

CAGATGATGA AGTACTGGCA GATTTGCAGG CTGGGAAGGC TAGACTTCCT CAGCCCGAGG 3240 

20 GCTGCCCTTC CAAACTCTAT CGGCTGATGC AGCGCTGCTG GGCCCTCAGC CCCAAGGACC 3300 

GGCCCTCCTT CAGTGAGATT GCCAGCGCCC TGGGAGACAG CACCGTGGAC AGCAAGCCGT 3360 

GAGGAGGGAG CCCGCTCAGG ATGGCCTGGG CAGGGGAGGA CATCTCTAGA GGGAAGCTCA 3420 

CAGCATGATG GGCAAGATCC CTGTCCTCCT GGGCCCTGAG GTGCCCTAGT GCAACAGGCA 3480 

TTGCTGAGGT CTGAGCAGGG CCTGGCCTTT CCTCCTCTTC CTCACCCTCA TCCTTTGGGA 3 540 

25 GGCTGACTTG GACCCAAACT GGGCGACTAG GGCTTTGAGC TGGGCAGTTT CCCCTGCCAC 3600 

CTCTTCCTCT AT CAGGGACA GTGTGGGTGC CACAGGTAAC CCCAATTTCT GGCCTTCAAC 3660 

TTCTCCCCTT GACCGGGTCC AACTCTGCCA CTCATCTGCC AACTTTGCCT GGGGAGGGCT 3720 

AGGCTTGGGA TGAGCTGGGT TTGTGGGGAG TTCCTTAATA TTCTCAAGTT CTGGGCACAC 37 80 

AGGGTTAATG AGTCTCTTGC CCACTGGTCC ACTTGGGGGT CT AG AC CAGG ATTATAGAGG 3 840 

30 ACACAGCAAG TGAGTCCTCC CCACTCTGGG CTTGTGCACA CTGACCCAGA CCCACGTCTT 3 900 

CCCCACCCTT CTCTCCTTTC CT CATCCTAA GTGCCTGGCA GATGAAGGAG TTTTCAGGAG 3 960 

CTTTTGACAC TATATAAACC GCCCTTTTTG TATGCACCAC GGGCGGCTTT TATATGTAAT 4020 

TGCAGCGTGG GGTGGGTGGG CATGGGAGGT AGGGGTGGGC CCTGGAGATG AGGAGGGTGG 4080 

GCCATCCTTA CCCCACACTT TTATTGTTGT CGTTTTTTGT TTGTTTTGTT TTTTTGTTTT 4140 

35 TGTTTTTGTT TTTACACTCG CTGCTCTCAA TAAATAAGCC TTTTTTA 



PCT/US02/12476 



Seq ID NO: 667 Protein sequence 
Protein Accession #: NP 002812 
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MGAARGSPAR 
VHVYWLLDGA 
IKWIEAGPW 
KERNLTLRPA 
EAMPHCQPSA 
CIGQGQRGPP 
VRLPTHGRVY 
SQLEEGKPGY 
CMSSTPAGSI 
GSSLPEWVTD 
TTVYQGHTAL 
RYTCIAGNSC 
GLMFYCKKRC 
KRHSTSDKMH 
LDFRRELEMF 
PLSTKQKVAL 
YHFRQAWVPL 
AGKARIiPQPE 



11 

I 

PRRLPLLSVL 
PVQDTERRFA 
LKHPASEAEI 
GPEHSGLYSC 
QPPPSLQWLF 
IILEATLHLA 
QKGHELVLAN 
LDCLTQATPK 
EAQARVQVLE 
NAGTLHFARV 
LQCEAQGDPK 
NIKHTEAPLY 
KAKRLQKQPE 
FPRSSLQPIT 
GKLNHANWR 
CTQVALGMEH 
RWMSPEAILE 
GCPSKLYRLM 



21 
I 

LLPLLGGTQT 
QGSSLSFAAV 
QPQTQVTLRC 
CAHSAFGQAC 
EDETPITNRS 
EIEDMPLFEP 
IAESDAGVYT 
PTWWYRNQM 
KLKFTPPPQP 
TRDDAGNYTC 
PLIQWKGKDR 
WDKPVPEES 
GEEPEMECLN 
TLGKSEFGEV 
LLGLCREAEP 
LSNNRFVHKD 
GDFSTKSDVW 
QRCWALSPKD 



31 
I 

AIVFIKQPSS 
DRLQDSGTFQ 
HIDGHPRPTY 
SSQNFTLSIA 
RPPHLRRATV 
RVFTAGSEER 
CHAANLAGQR 
LISEDSRFEV 
QQCMEFDKEA 
IASNGPQGQI 
ILDPTKLGPR 
EGPGSPPPYK 
GGPLQNGQPS 
FLAKAQGLEE 
HYMVLEYVDL 
L AARNCLV S A 
AFGVLMWEVF 
RPSFSEIASA 



41 
I 

QDALQGRRAL 
CVARDDVTGE 
QWFRDGTPLS 
DESFARWLA 
FANGShhLTQ 
VTCLPPKGLP 
RQDVNITVAT 
FKNGTLRINS 
TVPCSATGRE 
RAHVQLTVAV 
MHIFQNGSLV 
M1QTXGLSVG 
AEIQEEVALT 
GVAETLVLVK 
GDLKQFLRIS 
QRQVKVSALG 
THGEMPHGGQ 
LGDSTVDSKP 



Seq ID NO: 668 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1389 



ATGGGCTACC 
ACCCTTGTTT 
GTTGTCAACT 
GGGTTTCCTT 
GTTTTATTGA 
AAAACTTTCG 
ATAGCAATGA 
ATCCCAGGAG 
ACAGTTACCT 
TCCCTCATCT 
TCACTGGGTC 
ATTCAAGCGG 
TACAGTTCTC 
GTGATTTCTG 
TTCACCCAAG 
AGATTTTGTT 
GAGGTAATTG 
ACAGTGATGG 
GTT CTAGAAC 
TGTTATCTGA 
ATGCTTCCCA 



11 

I 

AGAGGCAGGA 
CTGAACATGA 
CGATTATAGG 
TGGGAATATT 
TAAAAGGAGG 
GCTTTCCAGG 
TAAGTTACAA 
TTGATCCTGA 
TTACTCTGCC 
CTACAGGTTT 
CACACATACC 
TCGGGGTTAT 
TAGAAGAACC 
TATTTATCTG 
GGGACTTATT 
ATGGTGTCAC 
CCAATGTGTT 
TCATCACTGT 
T CAATGGTGT 
AACTGTCTGA 
TTGGTGCTGT 



21 
I 

GCCTGTCATC 
GTATAAAGAG 
ATCTGGTATA 
GCTTTTATTC 
GGCCCTCTCT 
GTATCTGCTC 
TATAATAGCT 
AAACGTGTTT 
TTTATCCTTG 
AACAACT CTG 
AAAAACAGAA 
GTCTTTTGCA 
CACAGTAGCT 
TATATTCTTT 
TGAAAATTAC 
TGTCATTTTG 
TTTTGGTGGG 
AGCCACGCTT 
GCTCTGTGCA 
AGAACCAAGG 
GGTGATGGTT 



31 
! 

CCGCCGCAGA 
AAAACCTGTC 
ATAGGATTGC 
TGGGTTTCAT 
GGAACAGATA 
CTCTCTGTTC 
GGAGATACTT 
AT TGGTCGCC 
TACCGAAATA 
ATTCTTGGAA 
GACGCTTGGG 
TTTATTTGCC 
AAGTGGTCCC 
GCTACATGTG 
TGCAGAAATG 
ACATACCCTA 
AATCTTTCAT 
GTGTCATTGC 
ACTCCCCTCA 
ACACACTCCG 
TTTGGATTCG 



41 
I 

GAGATTTAGA 
AGTCTGCTGC 
CTTATTCAAT 
ATGTTACGGA 
CCTACCAGTC 
TTCAGTTTTT 
TGAGCAAAGT 
ACTTCATTAT 
TAGCAAAGCT 
TTGTAATGGC 
TATTTGCAAA 
ACCATAACTC 
GCCTTATCCA 
GATACTTGAC 
ATGACCTGGT 
TGGAATGCTT 
CGGTTTTCCA 
TGATTGATTG 
TTTTTATCAT 
ATAAGATTAT 
TCATGGCTAT 



51 

I 

LRCEVEAPGP 
EAR S ANAS FN 
DGQSNHTVSS 
PQDWVARYE 
VRPRNAGIYR 
EPSVWWEHAG 
VPSWLKKPQD 
VEVYDGTWYR 
KPTIKWERAD 
FITFKVEPER 
IHDVAPEDSG 
AAVAYI IAVLt 
SLGSGPAATN 
SLQTKDEQQQ 
KSKDEKLKSQ 
LSKDVYNSEY 
ADDEVLADLQ 



51 
I 

TGACAGAGAA 
TCTTTTTAAT 
GAAGCAAGCT 
CTTTTCCCTT 
TTTGGTCAAT 
GTATCCTTTT 
TTTTCAAAGA 
TGGACTTTCC 
TGGAAAGGTC 
AAGGGCAATT 
GCCCAATGCC 
CTTCTTAGTT 
TATGTCCATC 
ATTTACTGGC 
AACATTTGGA 
TGTGACAAGA 
CATTGTTGTA 
CCTCGGGATA 
TCCATCAGCC 
GTCTTGTGTC 
TACAAATACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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WO 02/086443 

CAAGACTGCA CCCATGGGCA GGAAATGTTC TACTGCTTTC CTGACAATTT CTCTCTCACA 
AATACCTCAG AGTCTCATGT TCAGCAGACA ACACAACTTT CTACTTTAAA TATTAGTATC 
TTTCAATGA 

Seq ID WO: 669 Protein sequence 
Protein Accession #: Eos sequence 



1320 
1380 



PCT/US02/12476 



MGYQRQEPVI 
GFPLGILLLF 
IAMISYNIIA 
SLISTGLTTL 
YSSLEEPTVA 
RFCYGVTVIL 
VLELNGVLCA 
QDCTHGQEMF 



11 
I 

PPQRDLDDRE 
WVSYVTDFSL 
GDTLSKVFQR 
ILGIVMARAI 
KWSRLIHMSI 
TYPMECFVTR 
TPLIFIIPSA 
YCFPDNFSLT 



21 

1 

TLVSEHEYKE 
VLLIKGGALS 
IPGVDPENVF 
SLGPHIPKTE 
VISVFICIFF 
EVIANVFFGG 
CYLKLSEEPR 
NTSESHVQQT 



31 
I 

KTCQSAALFN 
GTDTYQSLVN 
IGRHFIIGLS 
DAWVFAKPNA 
ATCGYLTFTG 
NLSSVFHIW 
THSDKIMSCV 
TQLSTLNISI 



Seq ID NO: 670 DNA sequence 
20 Nucleic Acid Accession #: Eos sequence 
Coding sequence: 1..12 84 



ATGGGCTACC 
AAGCAAGCTG 
TTTTCCCTTG 
TTGGTCAATA 
TATCCTTTTA 
TTTCAAAGAA 
GGACTTTCCA 
GGAAAGGTCT 
AGGGCAATTT 
CCCAATGCCA 
TTCTTAGTTT 
ATGTCCATCG 
TTTACTGGCT 
ACATTTGGAA 
GTGACAAGAG 
ATTGTTGTAA 
CTCGGGATAG 
CCATCAGCCT 
TCTTGTGTCA 
ACAAATACTC 
TCTCTCACAA 
ATTAGTATCT 



11 
I 

AGAGGCAGGA 
GGTTTCCTTT 
TTTTATTGAT 
AAACTTTCGG 
TAGCAATGAT 
TCCCAGGAGT 
CAGTTACCTT 
CCCTCATCTC 
CACTGGGTCC 
TTCAAGCGGT 
ACAGTTCTCT 
TGATTTCTGT 
TCACCCAAGG 
GATTTTGTTA 
AGGTAATTGC 
CAGTGATGGT 
TTCTAGAACT 
GTTATCTGAA 
TGCTTCCCAT 
AAGACTGCAC 
ATACCTCAGA 
TTCAACTCGA 



21 
1 

GCCTGTCATC 
GGGAATATTG 
AAAAGGAGGG 
CTTTCCAGGG 
AAGTTACAAT 
TGAT CCTGAA 
TACTCTGCCT 
TACAGGTTTA 
ACACATACCA 
CGGGGTTATG 
AGAAGAACCC 
ATTTATCTGT 
GGACTTATTT 
TGGTGTCACT 
CAATGTGTTT 
CATCACTGTA 
CAATGGTGTG 
ACTGTCTGAA 
TGGTGCTGTG 
CCATGGGCAG 
GTCTCATGTT 
GTAA 



31 
I 

CCGCCGCAGA 
CTTTTATTCT 
GCCCTCTCTG 
TATCTGCTCC 
ATAATAGCTG 
AACGTGTTTA 
TTATCCTTGT 
ACAACTCTGA 
AAAACAGAAG 
TCTTTTGCAT 
ACAGTAGCTA 
ATATTCTTTG 
GAAAATTACT 
GTCATTTTGA 
TTTGGTGGGA 
GCCACGCTTG 
CTCTGTGCAA 
GAACCAAGGA 
GTGATGGTTT 
GAAATGTTCT 
CAGCAGACAA 



41 
I 

WNSIIGSGI 
KTFGFPGYLL 
TVTFTLPLSIj 
IQAVGVMSFA 
FTQGDLFENY 
TVMVITVATL 
MLPIGAWMV 
FQ 



41 

1 

GAGGATTGCC 
GGGTTTCATA 
GAACAGATAC 
TCTCTGTTCT 
GAGATACTTT 
TTGGTCGCCA 
ACCGAAATAT 
TTCTTGGAAT 
ACGCTTGGGT 
TTATTTGCCA 
AGTGGTCCCG 
CTACATGTGG 
GCAGAAATGA 
CATACCCTAT 
ATCTTTCATC 
TGTCATTGCT 
CTCCCCTCAT 
CACACTCCGA 
TTGGATTCGT 
ACTGCTTTCC 
CACAACTTTC 



51 
I 

IGI/PYSMKQA 
LSVLQFLYPF 
YRN I AKLGKV 
FICHHNSFLV 
CRNDDLVTFG 
VSLLIDCLGI 
FGFVMAITNT 



51 
I 

TTATTCAATG 
TGTTACAGAC 
CTACCAGTCT 
TCAGTTTTTG 
GAGCAAAGTT 
CTT CATTATT 
AGCAAAGCTT 
TGTAATGGCA 
ATTTGCAAAG 
CCATAACTCC 
CCTTATCCAT 
AT ACT TGAC A 
TGACCTGGTA 
GGAATGCTTT 
GGTTTTCCAC 
GATTGATTGC 
TTTTATCATT 
TAAGATTATG 
CATGGCTATT 
TGACAATTTC 
TACTTTAAAT 



Seq ID NO: 671 Protein sequence 
Protein Accession #: Eos sequence 



MGYQRQEPVI 
LVNKTFGFPG 
GLSTVTFTLP 
PNAIQAVGVM 
FTGFTQGDLF 
IWTVMVITV 
SCVMLPIGAV 
ISIFQLE 



11 

I 

PPQRGLPYSM 
YLLLSVLQFL 
LSLYRNIAKL 
SFAFICHHNS 
ENYCRNDDLV 
ATLVSLLIDC 
VMVFGFVMAI 



21 

I 

KQAGFPLGIL 
YPFIAMISYN 
GKVSLISTGL 
FLVYSSLEEP 
TFGRFCYGVT 
LGIVLELNGV 
TNTQDCTHGQ 



31 
I 

LLFWVSYVTD 
I I AGDTLSKV 
TTLILGIVMA 
TVAKWSRLIH 
VILTYPMECF 
LCATPLIFII 
EMFYCFPDNF 



41 

1 

FSLVLLIKGG 
FQRIPGVDPE 
RAISLGPHIP 
MSIVISVFIC 
VTREVIANVF 
PSACYLKLSE 
SLTNTSESHV 



51 
I 

ALSGTDTYQS 
NVFIGRHFII 
KTEDAWVFAK 
IFFATCGYLT 
FGGNLSSVFH 
EPRTHSDKIM 
QQTTQLSTLN 



Seq ID NO: 672 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1203 



ATGGGCTACC 
AAAGGAGGGG 
TTTCCAGGGT 
AGTTACAATA 
GATCCTGAAA 
ACTCTGCCTT 
ACAGGTTTAA 
CACATACCAA 
GGGGTTATGT 
GAAGAACCCA 
TTTATCTGTA 
GACTTATTTG 
GGTGTCACTG 
AATGTGTTTT 
ATCACTGTAG 
AATGGTGTGC 
CTGTCTGAAG 
GGTGCTGTGG 
CATGGGCAGG 
TCTCATGTTC 



11 
I 

AGAGGCAGGA 
CCCTCTCTGG 
ATCTGCTCCT 
TAATAGCTGG 
ACGTGTTTAT 
TATCCTTGTA 
CAACTCTGAT 
AAACAGAAGA 
CTTTTGCATT 
CAGTAGCTAA 
TATTCTTTGC 
AAAATTACTG 
TCATTTTGAC 
TTGGTGGGAA 
CCACGCTTGT 
TCTGTGCAAC 
AACCAAGGAC 
TGATGGTTTT 
AAATGTTCTA 
AGCAGACAAC 



21 
I 

GCCTGTCATC 
AACAGATACC 
CTCTGTTCTT 
AGATACTTTG 
TGGTCGCCAC 
CCGAAATATA 
TCTTGGAATT 
CGCTTGGGTA 
TATTTGCCAC 
GTGGTCCCGC 
TACATGTGGA 
CAGAAATGAT 
ATACCCTATG 
TCTTTCATCG 
GTCATTGCTG 
TCCCCTCATT 
ACACTCCGAT 
TGGATTCGTC 
CTGCTTTCCT 
ACAACTTTCT 



31 

i 

CCGCCGCAGT 
TACCAGTCTT 
CAGTTTTTGT 
AGCAAAGTTT 
TTCATTATTG 
GCAAAGCTTG 
GTAATGGCAA 
TTTGCAAAGC 
CATAACTCCT 
CTTATCCATA 
TACTTGACAT 
GACCTGGTAA 
GAATGCTTTG 
GTTTTCCACA 
ATTGATTGCC 
TTTATCATTC 
AAGATTATGT 
ATGGCTATTA 
GACAATTTCT 
ACTTTAAATA 



41 
I 

TTTCCCTTGT 
TGGTCAATAA 
ATCCTTTTAT 
TTCAAAGAAT 
GACTTTCCAC 
GAAAGGTCTC 
GGGCAATTTC 
CCAATGCCAT 
TCTTAGTTTA 
TGTCCATCGT 
TTACTGGCTT 
CATTTGGAAG 
TGACAAGAGA 
TTGTTGTAAC 
TCGGGATAGT 
CATCAGCCTG 
CTTGTGTCAT 
CAAATACTCA 
CTCTCACAAA 
TTAGTATCTT 



51 

I 

TTTATTGATA 
AACTTTCGGC 
AGCAATGATA 
CCCAGGAGTT 
AGTTACCTTT 
CCTCATCTCT 
ACTGGGTCCA 
TCAAGCGGTC 
CAGTTCTCTA 
GATTTCTGTA 
CACCCAAGGG 
ATTTTGTTAT 
GGTAATTGCC 
AGTGATGGTC 
TCTAGAACTC 
TTATCTGAAA 
GCTTCCCATT 
AGACTGCACC 
TACCTCAGAG 
TCAACTCGAG 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



60 
120 
180 
240 
300 
360 
420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



442 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



WO 02/086443 

TAA 

Seq ID NO: 673 Protein sequence 
Protein Accession : Eos sequence 



PCT/US02/12476 



MGYQRQEPVI 
SYNI I AGDTL 
TGLTTLILGI 
EEPTVAKWSR 
GVTVILTYPM 
NGVL CATPL I 
HGQEMFYCFP 



11 
I 

PPQFSLVLLI 
SKVFQRIPGV 
VMARAISLGP 
LIHMSIVISV 
ECFVTREVIA 
FIIPSACYLK 
DNFSLTNTSE 



21 

! 

KGGALSGTDT 
DPENVFIGRH 
HIPKTEDAWV 
FICIFFATCG 
NVFFGGNLSS 
LSEEPRTHSD 
SHVQQTTQLS 



31 

I 

YQSLVNKTFG 
FIIGLSTVTF 
FAKPNAIQAV 
YLTFTGFTQG 
VFHIWTVMV 
KIMSCVMLPI 
TLNISIFQLE 



41 
i 

FPGYLLLSVL 
TDPLSLYRMI 
GVMSFAFICH 
DLFENYCRND 
ITVATLVSLL 
GAWMVFGFV 



Seq ID NO: 674 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..1140 



11 



21 



31 



41 



51 

i 

QFLYPFIAMI 
AKLGKVSLIS 
HNSFLVYSSL 
DLVTFGRFCY 
IDCLGIVLEL 
MAI TNTQDCT 



51 



ATGGGCTACC 
CCAGGGTATC 
TACAATATAA 
CCTGAAAACG 
CTGCCTTTAT 
GGTTTAACAA 
ATACCAAAAA 
GTTATGTCTT 
GAACCCACAG 
ATCTGTATAT 
TTATTTGAAA 
GTCACTGTCA 
GTGTTTTTTG 
ACTGTAGCCA 
GGTGTGCTCT 
TCTGAAGAAC 
GCTGTGGTGA 
GGGCAGGAAA 
CATGTTCAGC 



AGAGGCAGGA 
TGCTCCTCTC 
TAGCTGGAGA 
TGTTTATTGG 
CCTTGTACCG 
CTCTGATTCT 
CAGAAGACGC 
TTGCATTTAT 
TAGCTAAGTG 
TCTTTGCTAC 
AT TACTGCAG 
TTTTGACATA 
GTGGGAATCT 
CGCTTGTGTC 
GTGCAACTCC 
CAAGGACACA 
TGGTTTTTGG 
TGTTCTACTG 
AGACAACACA 



GCCTGTCATC 
TGTTCTTCAG 
TACTTTGAGC 
TCGCCACTTC 
AAATATAGCA 
TGGAATTGTA 
TTGGGTATTT 
TTGCCACCAT 
GTCCCGCCTT 
ATGTGGATAC 
AAATGATGAC 
CCCTATGGAA 
TTCATCGGTT 
ATTGCTGATT 
CCTCATTTTT 
CTCCGATAAG 
ATTCGTCATG 
CTTTCCTGAC 
ACTTTCTACT 



CCGCCGCAGG 
TTTTTGTATC 
AAAGTTTTTC 
ATTATTGGAC 
AAGCTTGGAA 
ATGGCAAGGG 
GCAAAGCCCA 
AACTCCTTCT 
ATCCATATGT 
TTGACATTTA 
CTGGTAACAT 
TGCTTTGTGA 
TTCCACATTG 
GATTGCCTCG 
ATCATTCCAT 
ATTATGTCTT 
GCTATTACAA 
AATTTCTCTC 
TTAAATATTA 



TCAATAAAAC 
CTTTTATAGC 
AAAGAATCCC 
TTTCCACAGT 
AGGTCTCCCT 
CAATTT C ACT 
ATGCCATTCA 
TAGTTTACAG 
CCATCGTGAT 
CTGGCTTCAC 
TTGGAAGATT 
CAAGAGAGGT 
TTGTAACAGT 
GGATAGTTCT 
CAGCCTGTTA 
GTGTCATGCT 
ATACTCAAGA 
TCACAAATAC 
GTATCTTTCA 



TTTCGGCTTT 
AATGATAAGT 
AGGAGTTGAT 
TACCTTTACT 
CATCTCTACA 
GGGTCCACAC 
AGCGGTCGGG 
TTCTCTAGAA 
TTCTGTATTT 
CCAAGGGGAC 
TTGTTATGGT 
AATTGCCAAT 
GATGGT CATC 
AGAACTCAAT 
TCTGAAACTG 
TCCCATTGGT 
CTGCACCCAT 
CTCAGAGTCT 
ACTCGAGTAA 



Seq ID NO: 675 Protein sequence 
Protein Accession #: Eos sequence 



MGYQRQEPVI 
PENVFIGRHF 
IPKTEDAWVF 
ICIFFATCGY 
VFFGGNL S3V 
SEEPRTHSDK 
HVQQTTQLST 



11 
I 

PPQVNKTFGF 
IIGLSTVTFT 
AKPNAI QAVG 
LTFTGFTQGD 
FHIWTVMVI 
IMSCVMLPIG 
LNISIFQLE 



21 
I 

PGYLLLSVLQ 
LPLSLYRNIA 
VMSFAFI CHH 
LFENYCRNDD 
TVATLVSLLI 
AWMVFGFVM 



31 

FLYPFIAMIS 
KLGKVSLIST 
NSFLVYSSLE 
LVTFGRFCYG 
DCLGIVLELN 
AITNTQDCTH 



41 
I 

YNI I AGDTL S 
GLTTLILGIV 
EPTVAKWSRL 
VTVILTYPME 
GVIiCATPLIF 
GQEMFYCFPD 



51 
I 

KVFQRIPGVD 
MARAISLGPH 
IHMSIVISVF 
CFVTREVIAN 
IIPSACYLKIi 
NFSLTNTSES 



Seq ID NO: 676 DNA sequence 

Nucleic Acid Accession #; NM_ 006853.1 

Coding sequence: 26.. 874 



AGGAATCTGC 
ATCGGGCAGA 
CATGAGGATT 
CAGGAT CAT C 
CGAGAAGACG 
AGCCCACTGC 
GGAGGGCTGT 
CAGCCTCCCC 
CTCCATCACC 
CAGCTGCCTC 
CTTGCGATGC 
CAACATCACA 
GGGTGACTCC 
CCAGGATCCG 
GGACTGGATC 
ACCCTCCATT 
CAAGACCCTC 
AATCAACCTG 
G ACT CTGGGA 
TCCTGGCCAT 



11 

I 

GCTCGGGTTC 
GGTCTCACAG 
CTGCAGTTAA 
AAGGGGTTCG 
CGGCTACTCT 
CTCAAGCCCC 
GAGCAGACCC 
AACAAAGACC 
TGGGCTGTGC 
ATTTCCGGCT 
GCCAACATCA 
GACACCATGG 
GGGGGCCCTC 
TGTGCGATCA 
CAGGAGACGA 
TCCACTTGGT 
TACGAACATT 
GGGTTCGAAA 
ATGACAACAC 
ATATCAAGGT 



21 
I 

CGCAGATGCA 
CAGCCAAGGA 
TCCTG CTTGC 
AGTGCAAGCC 
GTGGGGCGAC 
GCTACATAGT 
GGACAGCCAC 
ACCGCAATGA 
GACCCCTCAC 
GGGGCAGCAC 
CCATCATTGA 
TGTGTGCCAG 
TGGTCTGTAA 
CCCGAAAGCC 
TGAAGAACAA 
GTTTGGTTCC 
CTTTGGGCCT 
TCAGTGAGAC 
CTGGTTTGTT 
TTCAATAAAT 



31 
I 

GAGGTTGAGG 
ACCTGGGGCC 
TCTGG CAACA 
TCACTCCCAG 
GCTCATCGCC 
TCACCTGGGG 
TGAGTCCTTC 
CATCATGCTG 
CCTCTCCTCA 
GTCCAGCCCC 
GCACCAGAAG 
CGTGCAGGAA 
CCAGTCTCTT 
TGGTGTCTAC 
TTAGACTGGA 
TGTTCACTCT 
CCTGGACTAC 
CTGGATTCAA 
CTCTGTTGTA 
ATTTGCTAAA 



41 
] 

TGGCTGCGGG 
CGCTCCTCCC 
GGGCTTGTAG 
CCCTGGCAGG 
CCCAGATGGC 
CAGCACAACC 
CCCCACCCCG 
GTGAAGATGG 
CGCTGTGTCA 
CAGTTACGCC 
TGTGAGAACG 
GGGGGCAAGG 
CAAGG CA T TA 
ACGAAAGTCT 
CCCACCCACC 
GTTAATAAGA 
AGGAGATGCT 
ATTCTGCCTT 
TCCCCAGCCC 
TGAGTG 



51 

i 

ACTGGAAGTC 
CCCTCCAGGC 
GGGGAGAGAC 
CAGCCCTGTT 
TCCTG AC AGC 
TCCAGAAGGA 
GCTTCAACAA 
CATCGCCAGT 
CTGCTGGCAC 
TGCCTCACAC 
CCTACCCCGG 
ACTCCTGCCA 
TCTCCTGGGG 
GCAAATATGT 
ACAGCCCATC 
AACCCTAAGC 
GTCACTTAAT 
GAAATATTGT 
CAAAGACAGC 



Seq ID NO: 677 Protein sequence 
Protein Accession #: NP_006844.1 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



60 
120 
180 
240 
300 
360 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



11 



21 



31 41 51 

1 1 I I I I 

MRILQLILLA LATGLVGGET RIIKGFECKP HSQPWQAALF EKTRLLCGAT LIAPRWLIjTA 



443 



WO 02/086443 

AHCLKPRYIV HLGQHNLQKE EGCEQTRTAT ESFPHPGFNN SLPNKDHRND IMLVKMASPV 
SITWAVRPLT LSSRCVTAGT SCLISGWGST SSPQLRLPHT LRCANITIIE HQKCENAYPG 
NITDTMVCAS VQEGGKDSCQ GDSGGPLVCN QSLQGIISWG QDPCAITRKP GVYTKVCKYV 
DWIQETMKNN 

Seq ID NO: 67 8 DNA sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1..933 



ATGTGCAGCA 
TTCGACAAGA 
TTCCCCTGTG 
GACTGTCCCG 
GCCCGCTACC 
AATAACTGTC 
GGGCAGGTGT 
ATCAT CGGCA 
CACCAGCGGA 
CTGCTGTCCC 
AATAATGGCA 
CCACCCTCCT 
CCGCCCTACT 
CGGTCCGGGA 
GACACCAGCC 
TCTGAGCCGA 



11 
I 

ATGGACGGTG 
GTGATGAGAA 
CCAGCGGCAT 
ATGGCAGCGA 
ACTGCAAGAA 
AAGACAACAG 
TTGTGACTTC 
GCTCCGTCAT 
AGCGGAACAA 
GCCTGGTGGT 
TCCAGTATGT 
ACTCCGAGGC 
CTTCTGACAC 
GTGCCAACAG 
ACAGCCCGGG 
GCCAGGGCAC 



21 
I 

CATCCCGGGC 
GGAGTGCCCC 
CCATTGCATC 
TGAAGAGAAC 
CGGCCTCTGT 
TGATGAGGAA 
AGAGAACCAA 
TTTTGTGCTG 
CCTCATGACG 
CCTGGACCAC 
GGCCAGCCAG 
CTTGCTGGAC 
GGAAT CTCTG 
TGCCAGCTCC 
GCAGCCTGGC 
TGAAGAAGTA 



31 
I 

GCCTGGCAGT 
AAGGCTAAGT 
ATTGGTCGCT 
TGCACAGCAA 
ATTGACAAGA 
AGCTGTGAAA 
CTTGTGTATT 
GTGGTGGCCC 
CTGCCCGTGC 
CCCCACCACT 
GCGGAGCAGA 
CAGAGGCCTG 
AACCAAGCCG 
CAGGCAGCCA 
CCCCAGGAGG 
TAA 



41 

i 

GTGACGGGCT 
CGAAATGTGG 
TCCGGTGCAA 
ACCCTCTGCT 
GCTTCATCTG 
GTTCTCAAGA 
ACCCCAGCAT 
TGCTGGCACT 
ACCGGCTGCA 
GCAACGT CAC 
ATGCGTCGGA 
CGTGGTATGA 
ACCTGCCCCC 
GCAGCCTCCT 
GCACTGCTGA 



51 

r 

GCCTGACTGC 
CCCGACCTTC 
TGGGTTTGAG 
TTGCTCCACC 
CGATGGACAG 
ACCCGGCAGT 
CACCTATGCC 
GGTCTTGCAC 
GCACCCTGTG 
CTACAACGTC 
AGTAGGCTCC 
CCTTCCTCCA 
CTACCGCTCC 
GAGCGTGGAA 
GCCCAGGGAC 



120 
180 
240 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



Seq ID NO: 679 Protein sequence 
Protein Accession #: Eos sequence 



l 

MCSNGRCIPG 
DCPDGSDEEN 
GQVFVTSENQ 
LLSRLWLDH 
PPYSSDTESL 
SEPSQGTEEV 



11 

I 

AWQCDGLPDC 
CTANPLLCST 
LVYYPSITYA 
PHHCNVTYNV 
NQADLPPYRS 



21 

I 

FDKSDEKECP 
ARYHCKNGLC 
IIGSSVIFVL 
NNGIQYVASQ 
RSGSANSASS 



31 

I 

KAKSKCGPTF 
IDKSFICDGQ 
WALDALVLH 
AEQNASEVGS 
QAASSLLSVE 



41 
I 

FPCASGIHCI 
NNCQDNSDEE 
HQRKRNNLMT 
PPSYSEALLD 
DTSHSPGQPG 



51 
! 

IGRFRCNGFE 
SCESSQEPGS 
LPVHRLQHPV 
QRPAWYDLPP 
PQEGTAEPRD 



60 
120 
180 
240 
300 



Seq ID NO: 680 DNA sequence 
Nucleic Acid Accession #: S78203.1 
Coding sequence: 1..2190 



ATGAATCCTT 
GAGGTACCAC 
AACTATCCAC 
TATGGAATGA 
ACCTCCACAT 
GCAGCCATTG 
TATGTGCTTG 
GTACACACAG 
AAACCCTGTG 
ACTAGATACT 
ATCACACCCA 
TTTGGAGTTC 
ATATACAATA 
TTTGCTATTT 
CTAGACTGGG 
AGGGTACTAT 
TCACGATGGA 
CCGGACCAGA 
TTTGTCATTT 
GCTGTTGGTA 
ATAAATGAAA 
CTGGCAGATG 
GAGTCCATCA 
AGCCAGGATT 
GTGCAGGAGA 
ATGATGGTAA 
AACACTTTGC 
GAAGACTATG 
TGTAGAACAG 
TATCTGTTTG 
ATTCCAGCCA 
GGGGAGGTCA 
ATGAAATCTG 
CTTGTTGTGG 
CTCCTGCTGG 
ACAGAGGATA 
AAACTAGAGA 



11 
I . 

TCCAGAAAAA 
CTCGACCACC 
TGAGCATTGC 
AAGCTGTGCT 
CTATATACCA 
CTGACTCGTG 
GCCATGTGAT 
TCCTATCATT 
TGGCAGCTTT 
TCTCAGTCTT 
TGCTGAGAGG 
CAGGACTGCT 
AACCACCCCC 
CCAATCGTTT 
CAGCTGAGAA 
TCCTTTATAT 
CTTTGCAAGC 
TGCAGGTTCT 
ATCGTCTGGT 
TGATCCTAGC 
TGGCCCCAGC 
ATGAGGTGAA 
AATCCTTTCA 
TTCACTTCCA 
AGAACTGGTA 
AGGATACAGA 
ATAAAGATGT 
GTGTGTCTGC 
AAGATAAGAA 
TTATTACTAA 
ACAAAATGTC 
TGTTCTCTGT 
TGCTCCAGGC 
CACAGTTCAG 
TGATCTGCCT 
TGCGGGGTCC 
CCAAGAAGAC 



21 
I 

TGAGTCCAAG 
TAGCCCTCCA 
CTTCATTGTG 
GAT CCTGT AT 
TGCCTTCAGC 
GTTGGGAAAA 
CAAGTCCTTG 
GATCGGCCTG 
TGGTGGAGAC 
CTACCTGTCC 
AGATGTGCAA 
CATGGTAATT 
TGAAGGAAAC 
CAAGAACCGT 
AT AT CCAAAG 
CCCATTGCCC 
CAT CAGGATG 
AAATCCCTTT 
CTCCAAGTGT 
GTGCCTGGCA 
CCAGTCAGGT 
GGTGACAGTG 
GAAAACACCA 
CCTGAAATAT 
CAGTCTTGTC 
AAGCAAAACA 
CAACATCTCC 
TTATAGAACT 
CTTTTCTCTG 
TAACACCAAT 
CATTGCGTGG 
CACAGGTCTT 
AGCTTGGCTA 
TGGCCTGGTA 
GATCTTCTCC 
AGCAGATAAG 
AAAACT CTGA 



31 

! 

GAAACTCTTT 
AAGAAGCCAT 
GTGAATGAAT 
TT CCTGT ATT 
AGCCTCTGTT 
TTCAAGACAA 
GGTGCCTTAC 
AGTCTAATAG 
CAGTTTGAAG 
ATCAATGCAG 
TGTTTTGGAG 
GCACTTGTTG 
ATAGTGGCTC 
TCTGGAGACA 
CAGCTCATTA 
ATGTTCTGGG 
AATAGGAATT 
CTGGTTCTTA 
GGAATTAACT 
TTTGCAGTTG 
CCCCAGGAGG 
GTGGGAAATG 
CACTATTCCA 
CACAATTTGT 
ATTCGTGAAG 
ACCAATGGGA 
CTGAGTACAG 
GTGCAAAGAG 
AATTTGGGTC 
CAGGGTCTTC 
CAGCTACCAC 
GAGTTTTCTT 
TTGACAATTG 
CAGTGGGCCG 
AT CATGGGCT 
CACATTCCTC 



41 



51 



TTTCACCTGT 
CTCCGACAAT 
TCTGCGAGCG 
TCCTGCACTG 
ATTTTACTCC 
TCATCTATCT 
CAATACTGGG 
CTTTGGGGAC 
AAAAACATGC 
GGAGCTTGAT 
AAGACTGCTA 
TGTTTGCAAT 
AAGTTTTCAA 
TTCCAAAGCG 
TGGATGTAAA 
CTCTTTTGGA 
TGGGGTTTTT 
TCTTCATCCC 
TCTCATCACT 
CGGCAGCTGT 
TTTTCCTACA 
AAAACAATTC 
AACTGCACCT 
CTCTCTACAC 
ATGGGAACAG 
TGACAACCGT 
ATACCTCTCT 
GAGAATACCC 
TTCTAGACTT 
AGGCCTGGAA 
AATATG CCCT 
ATTCTCAGGC 
CAGTTGGGAA 
AATTCATTTT 
ACTACTATGT 
ACATCCAGGG 



CTCCATTGAA 
CTGTGGCTCC 
CTTTTCCTAT 
GAATGAAGAT 
CATCCTGGGA 
CTCCTTGGTG 
AGGACAAGTG 
AGGAGGCATC 
AGAGGAACGG 
TTCTACATTT 
TGCATTGGCT 
GGGAAGCAAA 
ATGTATCTGG 
ACAGCACTGG 
GGCACTGACC 
TCAGCAGGGT 
TGTGCTTCAG 
GTTGTTTGAC 
TAGGAAAATG 
AGAGATAAAA 
AGTCTTGAAT 
TCTGTTGATA 
GAAAACAAAA 
TGAG CATTCT 
TATCTCCAGC 
GAGGTTTGTT 
CAATGTTGGT 
TGCAGTGCAC 
TGGTGCAGCA 
GATTGAAGAC 
GGTTACAGCT 
TCCCTCTAGC 
TATCATCGTG 
GTTTTCCTGC 
TCCTGTAAAG 
GAACATGATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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10 



15 



1 
I 

MNPFQKNESK 
YGMKAVLILY 
YVLGHVTKSIi 
TRYFSVFYLS 
IYNKPPPEGN 
RVLFLYIPLP 
FVIYRLVSKC 
LADDEVKVTV 
VQEKNWYSLV 
EDYGVSAYRT 
IPANKMSIAW 
LWAQFSGLV 
KLETKKTKL 



11 
I 

ETLFSPVSIE 
FLYFLHWNED 
GALPILGGQV 
INAGSLISTF 
IVAQVFKCIW 
MFWALLDQQG 
GINFSSLRKM 
VGWENNSLLI 
IREDGNSISS 
VQRGEYPAVH 
QLPQYALVTA 
QWAEFILFSC 



21 

I 

EVPPRPPSPP 
TSTSIYHAFS 
VHTVLSLIGL 
ITPMLRGDVQ 
FAISNRFKNR 
SRWTLQAIRM 
AVGMILACLA 
ESIKSFOKTP 
MMVKDTESKT 
CRTEDKNFSL 
GEVMFSVTGL 
LLLVICLIFS 



31 
1 

KKPSPTICGS 
SLCYFTPILG 
SLIALGTGGI 
CFGEDCYALA 
SGDIPKRQHW 
NRNLGFFVLQ 
FAVAAAVEIK 
HYSKLHLKTK 
TNGMTTVRFV 
NLGLLDFGAA 
EFSYSQAPSS 
IMGYYYVPVK 



41 
I 

NYPLSIAFIV 
AAIADSWLGK 
KPCVAAFGGD 
FGVPGLLMVI 
LDWAAEKYPK 
PDQMQVLNPF 
INEMAPAQSG 
SQDFHFHLKY 
NTLHKDVNIS 
YLFVITNNTN 
MKSVLQAAWL 
TEDMRGPADK 



51 

1 

VNEFCERFSY 
FKTIIYLSLV 
QFEEKHAEER 
ALWFAMGSK 
QLIMDVKALT 
LVLIFIPLFD 
PQEVFLQVLN 
HNLSLYTEHS 
LSTDTSLNVG 
QGLQAWKIED 
LTIAVGNIIV 
HIPHIQGNMI 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 



Seq ID NO: 682 DNA sequence 

Nucleic Acid Accession #: NM_016077.1 

Coding sequence: 128.. 667 



11 



21 



31 



41 



51 



TCGCTTTGTG 
CGCGATAGAA 
ACTGTAGATG 
CTTGGCTGTT 
GATGCTCCCC 
CTTGGGAGAC 
AAAAGGGAAA 
AAGAAGAAAT 
CAAAGCTCCT 
GACTGTAAGT 
CCTAGGGATT 
TTACTAGGTG 
GATTCTAACA 
AAACCTATTC 



ATTCTTGATC 
ACGTGTTCGC 
CCCTCCAAAT 
GGAGTTGCTT 
AAAAGCAAGA 
AGCGGGGAGT 
GTGGCTGCCC 
CCTGAAATGC 
GATGAAGAAA 
TTAATTCAAG 
GGGCCAGGAC 
GACTTTGATA 
ACAAAAG CTG 
CCATGTTCTA 



CGGAACTTTG 
TTGCCCAGAA 
CCTTGGTTAT 
GTGGCATGTG 
CGAGCAAGAC 
ACAAGATGAT 
AGTGCTCTCA 
TCAAACAATG 
CCCTGATTGC 
ATGCTGGACG 
CAGCAGACCT 
TGACAACAAC 
AATTTCTTCA 
AAAAAA 



TCACCCAGGA 
GAAGGGAAGG 
GGAATATTTG 
CCTGGGCTGG 
ACACACAGAT 
TCTTGTGGTT 
TGCTGCTGTT 
GGAATACTGT 
ATTATTGGCC 
TACTCAGATT 
AATTGACAAA 
CCCTCCATCA 
CCCAACTTAA 



ACCCCGGAAG 
CGCGAGTGAG 
GCTCATCCCA 
AGCCTTCGAG 
ACTGAAAGTG 
CGAAATGACT 
TCAGCCTACA 
GGCCAGCCCA 
CATGCAAAAA 
GCACCAGGCT 
GTCACTGGTC 
CAAGTGTTTG 
ATGTTCTTGA 



AGGTAGCTCA 
GAAAGGAGGT 
GTACACTCGG 
TATGCTTTGG 
AAGCAAGCAT 
TAAAGATGGG 
AGCAGATTCA 
AGGTGGTGGT 
TGCTGGGACT 
CTCAAACTGT 
ACCTAAAACT 
AAGCCTGTCA 
GATGAAAATA 



Seq ID NO: 683 Protein sequence 
Protein Accession #: NP_0 57161.1 

1 11 21 31 41 51 

I I I 1 I i 

MPSKSLVMEY LAHPSTLGLA VGVACGMCLG WSLRVCFGML PKSKTSKTHT DTESEASILG 
DSGEYKMILV VRNDLKMGKG KVAAQCSHAA VSAYKQIQRR NPEMLKQWEY CGQPKVWKA 
PDEETLIALL AHAKMLGLTV SLIQDAGRTQ IAPGSQTVLG IGPGPADLID KVTGHLKLY 

Seq ID NO: 684 DNA sequence 

Nucleic Acid Accession #; NM_004864.1 

Coding sequence: 2 6.. 952 



CGGAACGAGG 
TCAGATGCTC 
GGCCGAGGCG 
ATTCCGAGAG 
CTGGGAAGAT 
AGTGCGGCTG 
GGGGCTCCCC 
AAGGTCGTGG 
GCCCGCGCTG 
ATCTTCGTCC 
CCGCAGAGCG 
TCTGCACACG 
ACGGGAGGTG 
CATGCACGCG 
CTGCTGCGTG 
GTCGCTCCAG 
GGTCCTTCCA 
GGGCTCAAGG 
TTATTTATTA 
ACTGTGTATT 
AAAA 



11 
I 

GCAACCTGCA 
CTGGTGTTGC 
AGCCGCGCAA 
TTGCGGAAAC 
TCGAACACCG 
GGATCCGGCG 
GAGGCCTCCC 
GACGTGACAC 
CACCTGCGAC 
GCACGGCCCC 
CGTGCGCGCA 
GTCCGCGCGT 
CAAGTGACCA 
CAGATCAAGA 
CCCGCCAGCT 
ACCTATGATG 
CTGTGCACCT 
TTCCTGAGAC 
TTAATTTATT 
TATTTAAAAC 



21 
I 

CAGCCATGCC 
TGGTGCTCTC 
GTTTCCCGGG 
GCTACGAGGA 
ACCTCGTCCC 
GCCACCTGCA 
GCCTTCACCG 
GACCGCTGCG 
TGTCGCCGCC 
AGCTGGAGTT 
ACGGGGACGA 
CGCTGGAAGA 
TGTGCATCGG 
CGAGCCTGCA 
ACAATCCCAT 
ACTTGTTAGC 
GCGCGGGGGA 
ACCCGATTCC 
GGGGTGACCT 
TCTGGTGATA 



31 
I 

CGGGCAAGAA 
GTGGCTGCCG 
ACCCTCAGAG 
CCTGCTAACC 
GGCCCCTGCA 
CCTGCGTATC 
GGCTCTGTTC 
GCGTCAGCTC 
GCCGTCGCAG 
GCACTTGCGG 
CTGTCCGCTC 
CCTGGGCTGG 
CGCGTGCCCG 
CCGCCTGAAG 
GGTGCTCATT 
CAAAGACTGC 
GGCGACCTCA 
TGCCCAAACA 
TCTTGGGGAC 
AAAATAAAGC 



41 
I 

CTCAGGACGG 
CATGGGGGCG 
TTGCACTCCG 
AGGCTGCGGG 
GT CCGGAT AC 
TCTCGGGCCG 
CGGCTGTCCC 
AGCCTTGCAA 
TCGGACCAAC 
CCGCAAGCCG 
GGGCCCGGGC 
GCCGATTGGG 
AGCCAGTTCC 
CCCGACACGG 
CAAAAGACCG 
CACTGCATAT 
GTTGTCCTGC 
GCTGTATTTA 
TCGGGGGCTG 
TGTCTGAACT 



51 
I 

TGAATGGCTC 
CCCTGTCTCT 
AAGACTCCAG 
CCAACCAGAG 
TCACGCCAGA 
CCCTTCCCGA 
CGACGGCGTC 
GACCCCAAGC 
TGCTGGCAGA 
CCAGGGGGCG 
GTTGCTGCCG 
TGCTGTCGCC 
GGGCGGCAAA 
AGCCAGCGCC 
ACACCGGGGT 
GAGCAGTCCT 
CCTGTGGAAT 
TATAAGTCTG 
GTCTGATGGA 
GTTAAAAAAA 



Seq ID NO: 685 Protein sequence 
Protein Accession #: NP__004855.1 



MPGQELRTVN 
EDLLTRLRAN 
HRALFRLSPT 
ELHLRPQAAR 
IGACPSQFRA 
LAKDCHCI 



11 
I 

GSQMLLVLLV 
QSWEDSNTDL 
ASRSWDVTRP 
GRRRARARNG 
ANMHAQIKTS 



21 
I 

LSWLPHGGAL 
VPAPAVRILT 
LRRQLSLARP 
DDCPLGPGRC 
LHRLKPDTEP 



31 



41 



SLAEASRASF PGPSELHSED 
PEVRLGSGGH LHLRISRAAL 
QAPALHLRLS PPPSQSDQLL 
CRLHTVRASL EDLGWADWVL 
APCCVPASYN PMVLIQKTDT 



51 
I 

SRFRELRKRY 
PEGLPEASRL 
AESSSARPQL 
SPREVQVTMC 
GVSLQTYDDL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



60 
120 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



60 
120 
180 
240 
300 



Seq ID NO: 686 DNA sequence 
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Nucleic Acid Accession #: NM_002423.2 
Coding sequence; 4 8.. 851 

1 11 21 31 41 51 

5 1 1 | I I I 

ACCAAATCAA CCATAGGTCC AAGAACAATT GTCTCTGGAC GGCAGCTATG CGACTCACCG 60 

TGCTGTGTGC TGTGTGCCTG CTGCCTGGCA GCCTGGCCCT GCCGCTGCCT CAGGAGGCGG 120 

GAGGCATGAG TGAGCTACAG TGGGAACAGG CTCAGGACTA TCTCAAGAGA TTTTATCTCT 180 

ATGACTCAGA AACAAAAAAT GCCAACAGTT TAGAAGCCAA ACT CAAGGAG ATGCAAAAAT 240 

10 TCTTTGGCCT ACCTATAACT GGAATGTTAA ACTCCCGCGT CATAGAAATA ATGCAGAAGC 300 

CCAGATGTGG AGTGCCAGAT GTTGCAGAAT ACTCACTATT TCCAAATAGC CCAAAATGGA 360 

CTTCCAAAGT GGTCACCTAC AGGATCGTAT CATATACTCG AGACTTACCG CATATTACAG 420 

TGGATCGATT AGTGTCAAAG GCTTTAAACA TGTGGGGCAA AGAGATCCCC CTGCATTTCA 480 

GGAAAGTTGT ATGGGGAACT GCTGACATCA TGATTGGCTT TGCGCGAGGA GCTCATGGGG 540 

15 ACTCCTACCC ATTTGATGGG CCAGGAAACA CGCTGGCTCA TGCCTTTGCG CCTGGGACAG 600 

GTCTCGGAGG AGATGCTCAC TTCGATGAGG ATGAACGCTG GACGGATGGT AGCAGTCTAG 660 

GGATTAACTT CCTGTATGCT GCAACTCATG AACTTGGCCA TTCTTTGGGT ATGGGACATT 720 

CCTCTGATCC TAATGCAGTG ATGTATCCAA CCTATGGAAA TGGAGATCCC CAAAATTTTA 780 

AACTTTCCCA GGATGATATT AAAGGCATTC AGAAACTATA TGGAAAGAGA AGTAATTCAA 840 

20 GAAAGAAATA GAAACTTCAG GCAGAACATC CATTCATTCA TTCATTGGAT TGTATATCAT 900 

TGTTGCACAA TCAGAATTGA TAAGCACTGT TCCTCCACTC CATTTAGCAA TTATGTCACC 960 

CTTTTTTATT GCAGTTGGTT TTTGAATGTC TTTCACTCCT TTTATTGGTT AAACTCCTTT 1020 

ATGGTGTGAC TGTGTCTTAT TCCATCTATG AGCTTTGTCA GTGCGCGTAG ATGTCAATAA 10 80 
ATGTTACATA CACAAATAAA TAAAATGTTT ATTCCATGGT AAATTTA 



25 



40 



Seq ID NO: 687 Protein sequence 
Protein Accession #: NP 002414.1 



a 11 21 31 41 51 

30 i | i | | | 

MRLTVLCAVC LLPGSLALPIi PQEAGGMSEL QWEQAQDYLK RFYLYDSETK NANSLEAKLK 60 

EMQKFFGLPI TGMLNSRVIE IMQKPRCGVP DVAEYSLFPN SPKWTSKWT YRIVSYTRDL 12 0 

PHITVDRLVS KALNMWGKE I PLHFRKWWG TADIMIGFAR GAHGDSYPFD GPGNTLAHAF 180 

APGTGLGGDA HFDEDERWTD GSSLGINFLY AATHELGHSIi GMGHSSDPNA VMYPTYGNGD 240 

35 PQNFKLSQDD IKGIQKIjYGK RSNSRKK 



Seq ID NO: 688 DNA sequence 

Nucleic Acid Accession #: NM_005221.3 

Coding sequence : 1 . . 870 



1 11 21 31 41 51 

I I I I 1 I 

ATGACAGGAG TGTTTGACAG AAGGGTCCCC AGCATCCGAT CCGGCGACTT CCAAGCTCCG 60 

TTCCAGACGT CCGCAGCTAT GCACCATCCG TCT CAGGAAT CGCCAACTTT GCCCGAGTCT 120 

45 TCAGCTACCG ATTCTGACTA CTACAGCCCT ACGGGGGGAG CCCCGCACGG CTACTGCTCT 180 

CCTACCTCGG CTTCCTATGG CAAAGCTCTC AACCCCTACC AGTATCAGTA TCACGGCGTG 240 

AACGGCTCCG CCGGGAGCTA CCCAGCCAAA GCTTATGCCG ACTATAGCTA CGCTAGCTCC 300 

TACCACCAGT ACGGCGGCGC CTACAACCGC GTCCCAAGCG CCACCAACCA GCCAGAGAAA 360 

GAAGTGACCG AGCCCGAGGT GAGAATGGTG AATGGCAAAC CAAAGAAAGT TCGTAAACCC 420 

50 AGGACTATTT ATTCCAGCTT TCAGCTGGCC GCATTACAGA GAAGGTTTCA GAAGACTCAG 480 

TACCTCGCCT TGCCGGAACG CGCCGAGCTG GCCGCCTCGC TGGGATTGAC ACAAACACAG 540 

GTGAAAATCT GGTTTCAGAA C AAAAG AT C C AAGATCAAGA AGATCATGAA AAACGGGGAG 600 

ATGCCCCCGG AGCACAGTCC CAGCTCCAGC GACCCAATGG CGTGTAACTC GCCGCAGTCT 660 

CCAGCGGTGT GGGAGCCCCA GGGCTCGTCC CGCTCGCTCA GCCACCACCC TCATGCCCAC 720 

55 CCTCCGACCT CCAACCAGTC CCCAGCGTCC AGCTACCTGG AGAACTCTGC ATCCTGGTAC 780 

ACAAGTGCAG CCAGCTCAAT CAATTCCCAC CTGCCGCCGC CGGGCTCCTT ACAGCACCCG 840. 
CTGGCGCTGG CCTCCGGGAC ACTCTATTAG 

Seq ID NO; 689 Protein sequence 
60 Protein Accession #: NP_005212,1 

1 11 21 31 41 51 

I I I i I i 

MTGVFDRRVP SIRSGDFQAP FQTSAAMHHP SQESPTLPES SATDSDYYSP TGGAPHGYCS 60 

65 PTSASYGKAL NPYQYQYHGV NGSAGSYPAK AYADYSYASS YHQYGGAYNR VPSATNQPEK 120 

EVTEPEVRMV NGKPKKVRKP RTIYSSFQLA ALQRRFQKTQ YLALPERAEL AASLGLTQTQ 180 

VKIWFQNKRS KIKKIMKNGE MPPEHSPSSS DPMACNSPQS PAVWEPQGSS RSLSHHPHAH 240 
PPTSNQSPAS SYLENSASWY TSAASSINSH LPPPGSLQHP LALASGTLY 



446 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
ncorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 LA method of detecting a lung cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1A-16. 

1 2. The method of claim 1, wherein the polynucleotide selectively 

2 hybridizes to a sequence at least 95% identical to a sequence as shown in Tables 1A-16. 

1 3. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 4. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 5. The method of claim 4, wherein the nucleic acids are mRNA. 

1 6. The method of claim 4, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 7. The method of claim 1 , wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 A-16. 

1 8. The method of claim 1 , wherein the polynucleotide is labeled. 

1 9. The method of claim 8, wherein the label is a fluorescent label. 

1 10. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 11. The method of claim 1, wherein the patient is undergoing a therapeutic 

2 regimen to treat lung cancer. 

1 12. The method of claim 1 , wherein the patient is suspected of having lung 

2 cancer. 

1 13. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 
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3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated transcript in the 

6 biological sample by contacting the biological sample with a polynucleotide that selectively 

7 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1 A- 16, 

8 thereby monitoring the efficacy of the therapy. 

1 14, The method of claim 13, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated transcript to a level of the lung cancer-associated 

3 transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

1 15. The method of claim 13, wherein the patient is a human. 

1 16. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 

5 (ii) determining the level of a lung cancer-associated antibody in the biological 

6 sample by contacting the biological sample with a polypeptide encoded by a polynucleotide 

7 that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 

8 Tables 1 A- 16, wherein the polypeptide specifically binds to the lung cancer-associated 

9 antibody, thereby monitoring the efficacy of the therapy. 

1 17. The method of claim 16, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated antibody to a level of the lung cancer-associated 

3 antibody in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

1 18. The method of claim 16, wherein the patient is a human. 

1 1 9. A method of monitoring the efficacy of a therapeutic treatment of lung 

2 cancer, the method comprising the steps of: 

3 (i) providing a biological sample from a patient undergoing the therapeutic 

4 treatment; and 
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5 (ii) determining the level of a lung cancer-associated polypeptide in the 

6 biological sample by contacting the biological sample with an antibody, wherein the antibody 

7 specifically binds to a polypeptide encoded by a polynucleotide that selectively hybridizes to 

8 a sequence at least 80% identical to a sequence as shown in Tables 1A-16, thereby 

9 monitoring the efficacy of the therapy. 

1 20. The method of claim 19, further comprising the step of: (iii) comparing 

2 the level of the lung cancer-associated polypeptide to a level of the lung cancer-associated 

3 polypeptide in a biological sample from the patient prior to, or earlier in, the therapeutic 

4 treatment. 

2 1 . The method of claim 1 9, wherein the patient is a human. 

22. An isolated nucleic acid molecule consisting of a polynucleotide 
sequence as shown in Tables 1A-16. 

23. The nucleic acid molecule of claim 22, which is labeled. 

24. The nucleic acid of claim 23, wherein the label is a fluorescent label 

25. An expression vector comprising the nucleic acid of claim 22. 

26. A host cell comprising the expression vector of claim 25. 

27. An isolated polypeptide which is encoded by a nucleic acid molecule 
having polynucleotide sequence as shown in Tables 1A-16. 

28. An antibody that specifically binds a polypeptide of claim 27. 

29. The antibody of claim 28, further conjugated to an effector component. 

30. The antibody of claim 29, wherein the effector component is a 
fluorescent label. 

3 1 . The antibody of claim 29, wherein the effector component is a 
radioisotope or a cytotoxic chemical. 

32. The antibody of claim 29, which is an antibody fragment. 
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1 33 . The antibody of claim 29, which is a humanized antibody 

1 34. A method of detecting a lung cancer cell in a biological sample from a 

2 patient, the method comprising contacting the biological sample with an antibody of claim 

3 28. 

1 35. The method of claim 34, wherein the antibody is further conjugated to 

2 an effector component. 

1 36. The method of claim 35, wherein the effector component is a 

2 fluorescent label. 

1 37. A method of detecting antibodies specific to lung cancer in a patient, 

2 the method comprising contacting a biological sample from the patient with a polypeptide 

3 encoded by a nucleic acid comprises a sequence from Tables 1 A-16. 

1 38. A method for identifying a compound that modulates a lung cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a lung cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1 A-16; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 

1 39. The method of claim 38, wherein the functional effect is a physical 

2 effect. 

1 40. The method of claim 38, wherein the functional effect is a chemical 

2 effect. 

1 41 . The method of claim 38, wherein the polypeptide is expressed in a 

2 eukaryotic host cell or cell membrane. 

1 42. The method of claim 38, wherein the functional effect is determined by 

2 measuring ligand binding to the polypeptide. 

1 43. The method of claim 38, wherein the polypeptide is recombinant. 
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1 44. A method of inhibiting proliferation of a lung cancer-associated cell to 

2 treat lung cancer in a patient, the method comprising the step of administering to the subject a 

3 therapeutically effective amount of a compound identified using the method of claim 38. 

1 45. The method of claim 44, wherein the compound is an antibody. 

1 46. The method of claim 45, wherein the patient is a human. 

1 47. A drug screening assay comprising the steps of 

2 (i) administering a test compound to a mammal having lung cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1A-16 in a 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of lung cancer. 

1 48. The assay of claim 47, wherein the control is a mammal with lung 

2 cancer or a cell therefrom that has not been treated with the test compound. 

1 49. The assay of claim 47, wherein the control is a normal cell or mammal. 

1 50. A method for treating a mammal having lung cancer comprising 

2 administering a compound identified by the assay of claim 47. 

1 5 1 . A pharmaceutiPcal composition for treating a mammal having lung 

2 cancer, the composition comprising a compound identified by the assay of claim 47 and a 

3 physiologically acceptable excipient. 
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